AutoML Video Intelligence

AutoML Video Intelligence has a graphical interface that makes it easy to train your own custom models to classify and track objects within videos, even if you have minimal machine learning experience. It’s ideal for projects that require custom labels which aren’t covered by the pre-trained Video Intelligence API.

Video Intelligence API

Video Intelligence API has pre-trained machine learning models that automatically recognize a vast number of objects, places, and actions in stored and streaming video. It’s highly efficient for common use cases and improves over time as new concepts are introduced.

Which video product is right for you?

You can work with either one or reap the benefits of both products by using Video Intelligence API to quickly categorize content using thousands of predefined labels and using AutoML Video Intelligence to create additional custom labels to suit your specific needs.

AutoML Video IntelligenceVideo Intelligence API
USER INTERFACE
Use APIsUse REST and RPC APIs.
Use a graphical UIUse a graphical user interface.
PREDEFINED OR CUSTOM LABELING
Classify video using predefined labelsPre-trained models leverage vast libraries of pre-defined labels.
Classify video using custom labelsTrain models to classify video via labels you choose.
ADDITIONAL FEATURES
Detect shot changesDetect scene changes in a segment or throughout the video.
Detect and track objectsDetect and track objects, how many, where they are within the frame (bounding box), and when they show up (timestamp).
Detect and extract textDetect and extract text using OCR, know where it is within the frame (bounding box), and when it shows up (timestamp).
Moderate contentDetect explicit content (adult, violent, etc.) within images.
Analyze streaming and stored videoAnalyze streaming video and stored video.
Automate video transcription for closed captioning and subtitlesTranscribe speech to text with punctuation. Refine results with alternatives provided for transcribed words or phrases. Censor profanities. Transcribe up to two audio tracks from multitrack video files. Currently supports English.
 composer require google/cloud-videointelligence 
  'gs://cloud-samples-data/video/cat.mp4',

    'features' => [Feature::LABEL_DETECTION]

];

$operation = $video->annotateVideo($options);



# Wait for the request to complete.

$operation->pollUntilComplete();



# Print the result.

if ($operation->operationSucceeded()) {

    $results = $operation->getResult()->getAnnotationResults()[0];

    # Process video/segment level label annotations

    foreach ($results->getSegmentLabelAnnotations() as $label) {

        printf('Video label description: %s' . PHP_EOL, $label->getEntity()->getDescription());

        foreach ($label->getCategoryEntities() as $categoryEntity) {

            printf('  Category: %s' . PHP_EOL, $categoryEntity->getDescription());

        }

        foreach ($label->getSegments() as $segment) {

            $start = $segment->getSegment()->getStartTimeOffset();

            $end = $segment->getSegment()->getEndTimeOffset();

            printf('  Segment: %ss to %ss' . PHP_EOL,

                $start->getSeconds() + $start->getNanos()/1000000000.0,

                $end->getSeconds() + $end->getNanos()/1000000000.0

            );

            printf('  Confidence: %f' . PHP_EOL, $segment->getConfidence());

        }

    }

} else {

    print_r($operation->getError());

} 

Reference Docs : Google Cloud Video Intelligence for PHP

Github Url : GoogleCloudPlatform

Reference from Google Cloud