AutoML Video Intelligence
AutoML Video Intelligence has a graphical interface that makes it easy to train your own custom models to classify and track objects within videos, even if you have minimal machine learning experience. It’s ideal for projects that require custom labels which aren’t covered by the pre-trained Video Intelligence API.
Video Intelligence API
Video Intelligence API has pre-trained machine learning models that automatically recognize a vast number of objects, places, and actions in stored and streaming video. It’s highly efficient for common use cases and improves over time as new concepts are introduced.
Which video product is right for you?
You can work with either one or reap the benefits of both products by using Video Intelligence API to quickly categorize content using thousands of predefined labels and using AutoML Video Intelligence to create additional custom labels to suit your specific needs.
| AutoML Video Intelligence | Video Intelligence API | |
|---|---|---|
| USER INTERFACE | ||
| Use APIsUse REST and RPC APIs. | ||
| Use a graphical UIUse a graphical user interface. | ||
| PREDEFINED OR CUSTOM LABELING | ||
| Classify video using predefined labelsPre-trained models leverage vast libraries of pre-defined labels. | ||
| Classify video using custom labelsTrain models to classify video via labels you choose. | ||
| ADDITIONAL FEATURES | ||
| Detect shot changesDetect scene changes in a segment or throughout the video. | ||
| Detect and track objectsDetect and track objects, how many, where they are within the frame (bounding box), and when they show up (timestamp). | ||
| Detect and extract textDetect and extract text using OCR, know where it is within the frame (bounding box), and when it shows up (timestamp). | ||
| Moderate contentDetect explicit content (adult, violent, etc.) within images. | ||
| Analyze streaming and stored videoAnalyze streaming video and stored video. | ||
| Automate video transcription for closed captioning and subtitlesTranscribe speech to text with punctuation. Refine results with alternatives provided for transcribed words or phrases. Censor profanities. Transcribe up to two audio tracks from multitrack video files. Currently supports English. |
composer require google/cloud-videointelligence
'gs://cloud-samples-data/video/cat.mp4',
'features' => [Feature::LABEL_DETECTION]
];
$operation = $video->annotateVideo($options);
# Wait for the request to complete.
$operation->pollUntilComplete();
# Print the result.
if ($operation->operationSucceeded()) {
$results = $operation->getResult()->getAnnotationResults()[0];
# Process video/segment level label annotations
foreach ($results->getSegmentLabelAnnotations() as $label) {
printf('Video label description: %s' . PHP_EOL, $label->getEntity()->getDescription());
foreach ($label->getCategoryEntities() as $categoryEntity) {
printf(' Category: %s' . PHP_EOL, $categoryEntity->getDescription());
}
foreach ($label->getSegments() as $segment) {
$start = $segment->getSegment()->getStartTimeOffset();
$end = $segment->getSegment()->getEndTimeOffset();
printf(' Segment: %ss to %ss' . PHP_EOL,
$start->getSeconds() + $start->getNanos()/1000000000.0,
$end->getSeconds() + $end->getNanos()/1000000000.0
);
printf(' Confidence: %f' . PHP_EOL, $segment->getConfidence());
}
}
} else {
print_r($operation->getError());
}
Reference Docs : Google Cloud Video Intelligence for PHP
Github Url : GoogleCloudPlatform
Reference from Google Cloud
