• Trending
  • Latest
  • All
  • News
  • Business
  • Politics
  • Science
  • World
  • Tech
Argo closes $2.6 billion round from VW at a $7.25 billion valuation

Argo closes $2.6 billion round from VW at a $7.25 billion valuation

9 months ago
What is hazard pay, and why are Amazon and other companies ending it for essential workers?

What is hazard pay, and why are Amazon and other companies ending it for essential workers?

9 months ago
What makes something smell good or bad?

What makes something smell good or bad?

9 months ago
Antibody injections could fight COVID-19 infections – an infectious disease expert explains the prospects

Antibody injections could fight COVID-19 infections – an infectious disease expert explains the prospects

9 months ago
Sandsoft Games in Saudi Arabia will make games for Middle Eastern players

Sandsoft Games in Saudi Arabia will make games for Middle Eastern players

9 months ago
Netflix’s Rebecca review: director Ben Wheatley flattens a classic

Netflix’s Rebecca review: director Ben Wheatley flattens a classic

4 months ago
China’s Didi Chuxing says ride-hailing orders are back to pre-pandemic levels

China’s Didi Chuxing says ride-hailing orders are back to pre-pandemic levels

9 months ago
Fairphone 3 Plus review: sustainability comes with compromises

Fairphone 3 Plus review: sustainability comes with compromises

4 months ago
Publishers sue Internet Archive over Open Library ebook lending

Publishers sue Internet Archive over Open Library ebook lending

9 months ago
Cyberpunk 2077’s Microsoft store listing now has a warning for bugs

Cyberpunk 2077’s Microsoft store listing now has a warning for bugs

3 months ago
Vin Diesel is a dinosaur hunter in Ark 2

Vin Diesel is a dinosaur hunter in Ark 2

3 months ago
Assassin’s Creed Valhalla can run at 60fps on PS5 and Xbox Series X and S with new update

Assassin’s Creed Valhalla can run at 60fps on PS5 and Xbox Series X and S with new update

3 months ago
  • Disclaimer
  • Cookie Policy
  • Privacy & Policy
  • Contact
Monday, March 8, 2021
Fylosophi
  • Home
  • News
    • All
    • Business
    • Politics
    • Science
    • World

    Meet the woman who’s making consumer boycotts great again

    New campaign wants you to raise funds for abuse victims by ditching the razor

    Twitter tweaks video again, adding view counts for some users

    A beginner’s guide to the legendary Tim Tam biscuit, now available in America

    India is bringing free Wi-Fi to more than 1,000 villages this year

    Betterment moves beyond robo-advising with human financial planners

    Magical fish basically has the power to conjure its own Patronus

    This Filipino guy channels his inner Miss Universe by strutting in six-inch heels and speedos

    Oil spill off India’s southern coast leaves fisherman stranded, marine life impacted

    You can now play Bill Gates’ first PC game and run over donkeys on your iPhone, Apple Watch

    Trending Tags

    • Donald Trump
    • Future of News
    • Climate Change
    • Market Stories
    • Election Results
    • Flat Earth
  • Tech
    • All
    • Apps
    • Gear
    • Mobile
    • Startup

    Rap group call out publication for using their image in place of ‘gang’

    Meet the woman who’s making consumer boycotts great again

    New campaign wants you to raise funds for abuse victims by ditching the razor

    Twitter tweaks video again, adding view counts for some users

    A beginner’s guide to the legendary Tim Tam biscuit, now available in America

    India is bringing free Wi-Fi to more than 1,000 villages this year

    Betterment moves beyond robo-advising with human financial planners

    People are handing out badges at Tube stations to tackle loneliness

    Trump’s H-1B Visa Bill spooks India’s IT companies

    Oil spill off India’s southern coast leaves fisherman stranded, marine life impacted

    Trending Tags

    • Flat Earth
    • Sillicon Valley
    • Mr. Robot
    • MotoGP 2017
    • Golden Globes
    • Future of News
No Result
View All Result
Fylosophi
No Result
View All Result
Home Entrepreneurship

Microsoft explains how it improved automatic image captioning in Azure Cognitive Services

by khan
October 15, 2020
in Entrepreneurship
0
Microsoft researchers say NLP bias studies must consider role of social hierarchies like racism
497
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter

Microsoft right now launched a brand new pc imaginative and prescient service it claims can generate picture captions which might be, in some circumstances, extra correct than human-written descriptions. The corporate calls the service, which is offered as a part of Azure Cognitive Providers Laptop Imaginative and prescient, a “important analysis breakthrough” and an instance of its dedication to accessible AI.

Computerized picture captioning has quite a few broad use circumstances, firstly helping customers with disabilities. In response to the World Well being Group, the variety of individuals of all ages who’re visually impaired is estimated to be 285 million, of whom 39 million are blind.

Accuracy turns into all of the extra important when vision-impaired customers depend on captioning for every day duties. In response to a research by researchers at Indiana College, the College of Washington, and Microsoft, blind individuals have a tendency to position a number of belief in mechanically generated captions, constructing unsupported narratives to reconcile variations between picture contexts and incongruent captions. When requested to establish captions of photos on Twitter that is likely to be incorrect, even blind customers who describe themselves as being expert and constant about double-checking tended to belief automated captions, the researchers discovered — regardless of whether or not the captions make sense.

In early 2017, Microsoft up to date Workplace 365 apps like Phrase and PowerPoint with automated picture captioning, drawing on Cognitive Providers Laptop Imaginative and prescient. (Cognitive Providers is a cloud-based suite of APIs and SDKs accessible to builders constructing AI and machine studying capabilities into their apps and providers.) Extra lately, the corporate launched Seeing AI, a cellular app designed to assist low- and impaired-vision customers navigate the world round them.

However whereas Workplace 365 and Seeing AI might mechanically caption photos higher than some AI baselines, Microsoft engineers pursued new methods to enhance them additional.

The engineers describe their approach in a September paper printed on Arxiv.org, a server for preprints. Known as visible vocabulary pretraining, or VIVO for brief, it leverages massive quantities of pictures with out annotations to study a vocabulary for picture captioning. (Usually, coaching automated captioning fashions requires corpora that include annotations supplied by human labelers.) The vocabulary includes an embedding house the place options of picture areas and tags of semantically related objects are mapped into vectors which might be shut to one another (e.g., “individual” and “man,” “accordion” and “instrument”). As soon as the visible vocabulary is established, an automated picture captioning mannequin could be fine-tuned utilizing a knowledge set of photos and corresponding captions.

Above: Picture captioning outcomes on nocaps. B: A baseline with out including VIVO pretraining. V: With VIVO
pretraining. Crimson textual content represents novel objects. The bounding field shade is brighter when the similarity is greater.

Picture Credit score: Microsoft

In the course of the mannequin coaching course of, a number of tags are randomly masked and the mannequin is requested to foretell the masked tags conditioned on the picture area options and the opposite tags. Despite the fact that the dataset used for fine-tuning solely covers a small subset of the most typical objects within the visible vocabulary, the VIVO-pretrained mannequin can generalize to any photos that depict related scenes (e.g., individuals sitting on a sofa collectively). In truth, it’s one of many few caption-generating pretraining strategies that doesn’t depend on caption annotations, enabling it to work with present picture knowledge units developed for picture tagging and object detection duties.

Microsoft benchmarked the VIVO-pretrained mannequin on nocaps, a take a look at designed to encourage the event of picture captioning fashions that may study visible ideas from different sources of knowledge. Evaluated on tens of hundreds of human-generated captions describing hundreds of photos, the mannequin achieved state-of-the-art outcomes with substantial enchancment for objects it hadn’t seen earlier than. Furthermore, on a metric referred to as consensus-based picture description analysis (CIDEr), which goals to measure the similarity of a generated caption towards floor reality sentences written by people, the mannequin surpassed human efficiency by a statistically important margin.

Along with the most recent model of the Cognitive Providers Laptop Imaginative and prescient API, Microsoft says the mannequin is now included in Seeing AI. It is going to roll out to Microsoft services together with Phrase and Outlook, for Home windows and Mac, and PowerPoint for Home windows, Mac, and internet later this yr, changing a picture captioning mannequin that’s been used since 2015.

“Given the advantage of this, we’ve labored to speed up the combination of this analysis breakthrough and get it into manufacturing and Azure AI,” Eric Boyd, company vp of AI platform at Microsoft, advised VentureBeat by way of telephone earlier this week. “It’s one factor to have a breakthrough of one thing that works in a fragile setup within the lab. However to have one thing that [in a few months] we are able to have pressure-tested and working at scale and a part of Azure … showcases how we’re in a position to go from the analysis breakthrough to getting issues out into manufacturing.”

Share199Tweet124
khan

khan

  • Trending
  • Comments
  • Latest
Argo closes $2.6 billion round from VW at a $7.25 billion valuation

Argo closes $2.6 billion round from VW at a $7.25 billion valuation

June 2, 2020
What is hazard pay, and why are Amazon and other companies ending it for essential workers?

What is hazard pay, and why are Amazon and other companies ending it for essential workers?

June 1, 2020
What makes something smell good or bad?

What makes something smell good or bad?

June 2, 2020
Antibody injections could fight COVID-19 infections – an infectious disease expert explains the prospects

Antibody injections could fight COVID-19 infections – an infectious disease expert explains the prospects

June 1, 2020
The UN says a new computer simulation tool could boost global development

The UN says a new computer simulation tool could boost global development

0
Third Pixel feature drop improves AI-powered Adaptive Battery, integrates Recorder and Google Assistant

Third Pixel feature drop improves AI-powered Adaptive Battery, integrates Recorder and Google Assistant

0
Luna Labs’s Replay automates creation of mobile game video ads

Luna Labs’s Replay automates creation of mobile game video ads

0
Robo-umps are coming to Major League Baseball, and the game will never be the same

Robo-umps are coming to Major League Baseball, and the game will never be the same

0
Samsung Display promises thinner laptop bezels with unproven under-screen webcams

Samsung Display promises thinner laptop bezels with unproven under-screen webcams

January 14, 2021
The Oculus Quest is getting multi-user support soon

The Oculus Quest is getting multi-user support soon

January 14, 2021
Super Nintendo World opening delayed due to Osaka state of emergency

Super Nintendo World opening delayed due to Osaka state of emergency

January 14, 2021
Hack together your own e-paper smartwatch with this $50 open-source kit

Hack together your own e-paper smartwatch with this $50 open-source kit

January 14, 2021
Fylosophi

Copyright © 2020 Fylosophi.

Navigate Site

  • Disclaimer
  • Cookie Policy
  • Privacy & Policy
  • Contact

Follow Us

No Result
View All Result
  • Home
  • News
    • Politics
    • Business
    • World
    • Science
  • Tech
    • Apps
    • Gear
    • Mobile
    • Startup

Copyright © 2020 Fylosophi.

Login to your account below

Forgotten Password?

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In