Wednesday, May 10, 2017

Computer Vision API - Microsoft Cognitive Services

MS Computer Vision API is powerful REST API’s or Web services, which takes image as input, processes it and return back the image analysis in JSON or in structured format. For ex: API identifies for any adult content, does recognizes texts/character (OCR), face recognition etc.

MS Computer Vision API, is based on the hundreds of image processing algorithms which internally uses lots of complex math’s to process and analyze the images.

Microsoft, bundled image processing API’s into Computer Vision API and hooked up it with her Cloud platform. So basically you have to pay for every single API call. You might be wondering how and what scenario you can consume it.

Here is a typical scenario where you can use Computer Vision API:

You have a camera, it’s an IoT device hooked up with internet. You have set up that in front of your home / apt / office, as soon as someone pops in and knocks the door. Your camera/IoT device takes photo of her/his and sends it to MS Azure\ cloud, it’s their you make call to MS computer vision api to process and get the result, from that result you analyze and decide to open the door or not. Like this you can use it in any scenario where you want to process image.

Following are just few things you can do on images with CV API’s: Tagging, Categorizing, Identifying, OCR, Face recognition, generate thumbnails and lot more. Obviously it can process video (video is an illusion, when you display a set of frames /images like 24 images or more in a second, eye processes it as movie)

Simple Client program to invoke computer vision api: