Award Banner
Award Banner

Meta releases AI model that can check other AI models' work

Meta releases AI model that can check other AI models' work
Meta's researchers used entirely AI-generated data to train the evaluator model, eliminating human input at that stage as well.
PHOTO: Reuters

NEW YORK — Facebook owner Meta said on Oct 18 it was releasing a batch of new AI models from its research division, including a "Self-Taught Evaluator" that may offer a path toward less human involvement in the AI development process.

The release follows Meta's introduction of the tool in an August paper, which detailed how it relies upon the same "chain of thought" technique used by OpenAI's recently released o1 models to get it to make reliable judgements about models' responses.

That technique involves breaking down complex problems into smaller logical steps and appears to improve the accuracy of responses on challenging problems in subjects like science, coding and math.

Meta's researchers used entirely AI-generated data to train the evaluator model, eliminating human input at that stage as well.

The ability to use AI to evaluate AI reliably offers a glimpse at a possible pathway toward building autonomous AI agents that can learn from their own mistakes, two of the Meta researchers behind the project told Reuters.

Many in the AI field envision such agents as digital assistants intelligent enough to carry out a vast array of tasks without human intervention.

Self-improving models could cut out the need for an often expensive and inefficient process used today called Reinforcement Learning from Human Feedback, which requires input from human annotators who must have specialised expertise to label data accurately and verify that answers to complex math and writing queries are correct.

"We hope, as AI becomes more and more super-human, that it will get better and better at checking its work, so that it will actually be better than the average human," said Jason Weston, one of the researchers.

"The idea of being self-taught and able to self-evaluate is basically crucial to the idea of getting to this sort of super-human level of AI," he said.

Other companies including Google and Anthropic have also published research on the concept of RLAIF, or Reinforcement Learning from AI Feedback. Unlike Meta, however, those companies tend not to release their models for public use.

Other AI tools released by Meta on Oct 18 included an update to the company's image-identification Segment Anything model, a tool that speeds up LLM response generation times and datasets that can be used to aid the discovery of new inorganic materials. 

Read Also
Intel, AMD team up to confront rising challenge from Arm
digicult
Intel, AMD team up to confront rising challenge from Arm

Source: Reuters

homepage

trending

trending
    2 taken to hospital after Toa Payoh flat fire linked to PMD battery
    Singapore poly grad receiving 'bouquet' of roast duck and bitter gourd at graduation goes viral
    1,000 flats at former Keppel Club golf course to be offered in October BTO exercise
    Fatal Second Link accident: Singaporean pleads not guilty to dangerous driving, lawyers say he lost control of Maserati
    3 weeks' jail for man who molested stewardess on SIA flight
    Singapore and Changi cannot be complacent, says PM Wong during groundbreaking ceremony of Terminal 5
    'I hate you': Addy Lee details fallout with Quan Yi Fong and Eleanor Lee in livestream
    Baby suspected to have been eaten by monitor lizard in Thailand, only head found
    CL, BabyMetal, Foo Fighters: Singapore concert calendar for 2025
    Woman dies in fatal crash along Punggol Road, vape pods found in car
    Mexican beauty influencer shot to death during TikTok livestream
    Ghib Ojisan opens up about birth of baby girl, taking on confinement nanny role to care for wife: 'I want to be there for her'

Singapore

Singapore
    • Defence Minister Ng Eng Hen on Singapore's place in the world, SAF's evolution and 24 years in politics
    • 'Not a one-off exercise': PM Wong launches latest tranche of $500 CDC vouchers
    • Covid-19 cases going up, but variants are not more transmissible or severe: MOH, CDA
    • Woman sues mother for evicting her; judge dismisses her claim of right to stay indefinitely
    • Maid who stabbed employer’s mother-in-law 26 times has murder charge reduced on appeal
    • Daily roundup: Singapore and Changi cannot be complacent, says PM Wong during groundbreaking ceremony of Terminal 5 — and other top stories today
    • Cleaner who molested 10-year-old girl twice in one day at school gets nearly a year in jail
    • Stray cat in Punggol dies from 'deliberate abuse'; NParks investigating
    • Man arrested for allegedly attacking parent with metal chair after Singapore Youth League match
    • 'His legacy lives on': Singapore's cricket community mourns coach Arjun Menon who was 'brutally murdered' in Malawi

Entertainment

Entertainment
    • Director of K-drama Nine Puzzles 'pulled strings' to get these famous actors to cameo in the show
    • Lee Do-hyun and Monsta X's Hyungwon complete military service, Cha Eun-woo speculated to enlist soon
    • Taiwanese comedian Nono found guilty of attempted rape, sentenced to 2 1/2 years' jail
    • 'My heart feels an unbearable ache': Hong Ling reveals miscarriage earlier this year
    • 'I found blood all over my body': Hong Kong former actor Wong He reveals being sexually assaulted twice in 2024
    • 'Difficult, demanding, diva': Rui En recalls feeling vilified for refusing to act in intimate scenes
    • Demi Lovato and Jutes reportedly set to tie the knot on Memorial Day weekend
    • Liam Gallagher to be grandfather for first time
    • Jennifer Lopez suffers facial injury during 2025 American Music Awards rehearsals
    • Tom Cruise dazzles Cannes for Mission: Impossible premiere

Lifestyle

Lifestyle
    • 'A new chapter begins': 8 local indie bookshops unite to launch one-stop online platform
    • Chicken Supremo owners retiring after 34 years, hawker stall to continue under new owner
    • 'Why didn't my mum try harder?' Woman serving jail time confronts painful past in Mother's Day visit
    • Sizzling exhibits, games and freebies: McDonald's launching first McSpicy Museum at Bugis Junction
    • Swensen's wedding? Restaurant's buffet concept to open in the west with space for large-scale event hosting
    • Spring in full bloom: Festive fun for all ages in Hong Kong
    • Battle of Middle East budget airlines: Which ones are worth it?
    • The ultimate work-from-home homebuyer checklist (that most people still overlook)
    • 6 inspiring local mum-preneurs in celebration of Mother's Day
    • I let my spontaneous INFP friend plan our day out – here's how we got around hassle-free

Digicult

Digicult
    • A $500 wake-up call: How the Samsung Galaxy Ring made me realise my stress
    • Monster Hunter Wilds producer explains how game has remained unique and fresh over 20 years
    • World's best Dota 2 teams to compete for $1m prize pool in Singapore in November
    • Google Pixel 9a: The best AI-centric phone under $800 in 2025?
    • Western intelligence agencies warn spyware threat targeting Taiwan, Tibetan rights advocates
    • Taiwan says China using generative AI to ramp up disinformation and 'divide' the island
    • Russian court fines Telegram app for refusal to remove anti-government content, TASS reports
    • One Beijing man's quest to keep cooking — and connecting with Americans — on camera
    • Nintendo Switch 2 to launch in June with US$449.99 price tag
    • Games in April: RPGs, racing and Ronaldo in a fighting game

Money

Money
    • Wall Street equity indexes close higher after US-China tariff truce
    • Giant deal: Malaysian company to acquire Cold Storage and Giant supermarket chains in Singapore
    • Apec warns of tariff impact on trade as members seek deals with US
    • Family of Koufu Group founders to buy Caldecott Hill GCB site for $58m
    • This US-owned factory in China made toys for Walmart. Tariffs put it on life support
    • Are you paying more than you should with dealer financing?
    • Best credit card promotions in Singapore (May 2025): Citibank, DBS, HSBC, UOB and more
    • Why paying minimum on credit cards may cost you in the long run
    • Here's where you can find the biggest 2-bedder condos under $1.8m in 2025
    • Best fixed deposit rates in Singapore (May 2025): Minimum deposits from $500, rates up to 2.50%

Latest

Latest
  • Daily roundup: 1,000 flats at former Keppel Club golf course to be offered in October BTO exercise — and other top stories today
  • Ukraine peace talks mired in confusion as Putin stays away
  • Trump: India has offered US a trade deal with no tariffs
  • Trump says US close to a nuclear deal with Iran
  • Mike Lynch's yacht doomed by extreme wind, interim report finds
  • Australia PM Albanese meets Indonesia counterpart in first international visit since re-election
  • Malaysia PM discusses MH17 downing with Russia's Putin
  • US Justice Department to meet families of 737 MAX victims on Boeing criminal case
  • Pope Leo says he will make 'every effort' for world peace

In Case You Missed It

In Case You Missed It
  • 'Dog will return soon': GE2025 independent candidate Jeremy Tan wants to contest again
  • Ong Ye Kung leads PAP team to victory while elder brother Howard Ong loses in Australia's election on the same day
  • Tan Kiat How weighs in on viral video of Gan Kim Yong being ignored by passers-by in Punggol
  • PSP's Tan Cheng Bock turns 85; SDP's Paul Tambyah joins celebration at Teban Gardens
  • PM Wong urges voters to 'choose leaders of good character' in PAP's first party political broadcast
  • It is 'important for Singapore's democracy' that WP wins more seats, says Pritam in election broadcast
  • GE2025: PSP, RDU, SDP, PPP, PAR, NSP promise to push for policy changes if elected to Parliament in first political broadcast
  • 'Everyone has the right to express their feelings': WP candidates address four-cornered fight in Tampines GRC
  • PAP's Desmond Lee responds to opposition's calls for GST exemption, says 'we want to make it progressive'
This website is best viewed using the latest versions of web browsers.