With the increased usage of video streaming for work purposes, there is a strong need to monitor live and recorded video broadcasts. The quality checks could be as simple as signal errors, issues with subtitles, and audio language, which human operators typically monitor. It becomes very difficult sometimes for live broadcasts.
The simple usage of artificial intelligence (AI) can automate many of the monitoring tasks done by human resources. The AI-based detections can help you to analyze the content of an HTTP Live Streaming (HLS) video stream. AWS reckognition performs an example set of monitoring checks in near real-time (<15 seconds).
Below is an attempt to familiarize you with the technologies and standards used in this solution –
- HLS is an HTTP adaptive bitrate streaming communications protocol.
- AWS Elemental MediaLive is a real-time video service that lets you create live outputs for broadcast and streaming.
- Amazon Rekognition Custom Labels allows you to build models to identify the objects and scenes specific to your business needs.
A robust broadcast quality control solution should monitor various aspects of the live streams:
Traditional image and audio analysis algorithms can be used for some situations, and many are better suited for detection using Machine Learning (ML) models.
- The video ingestion pipeline produces HLS streams using AWS Elemental MediaLive and is stored in Amazon Simple Storage Service (Amazon S3)
- A video processing pipeline orchestrated by AWS Step Functions based automation which performs monitoring checks on extracted frames and audio from every video segment
Automated Verification Outcomes:
- Audio silence detection – based on a predefined volume threshold.
- Logo verification – Known logos from images are well suited for Convolutional Neural Networks (CNN) based ML models. Object detection models have been created using Amazon Rekognition Custom Labels.
- Program type verification: Whether the video looks like the type of program it should be. To ensure this, a custom image classification model was created using Amazon Rekognition Custom Labels.
- Character/person identification: Whether this video shows the correct person or actor. For verifying this, the face image extraction feature of Amazon Rekognition to look for persons/ actors on screen can be leveraged along with Rekognition Custom Labels to train a model to recognize a specific show/ program.
In combination with Media Live and Amazon AI/ML services, you can easily broadcast the content of your dream. The near real-time AI/ML intelligence would bring more efficiency to expect accuracy up to your need. Stay tuned for more media solutions using new-age technologies.
Written By,
Abhinav Abhishek
Solutions Director, Rapyder Cloud Solutions
Want to know? Contact us now for a free consultation