The Watch Jukkalan Onlinefloodgates have opened for building AI reasoning models on the cheap.
Researchers at Stanford and the University of Washington have developed a model that performs comparably to OpenAI o1 and DeepSeek R1 models in math and coding — for less than $50 of cloud compute credits.
What's more, the model was trained on only 1,000 questions, and took just 26 minutes and 16 Nvidia H100 GPUs. Stanford researcher Niklas Muennighoff said in a email to Mashable that the cost is an estimate based on the GPU runtime and number of H100 GPUs used.
The AI industry of late is all about how new approaches to the pre and post training process can massively save computing costs, as evidenced by DeepSeek's disruptive impact. On top of that, developers are now able to build on top of existing AI models at little or no cost, through APIs, open-source access, and even closed-source models by distilling their data, bringing the costs down even more.
According to the team's research paper which was published last Friday, s1 was trained on a dataset consisting of "1,000 carefully curated questions paired with reasoning traces and answers distilled from Gemini Thinking Experimental." Google's Gemini Thinking Experimental model is accessible with daily limits through AI Studio. While it's a closed-source model, that clearly hasn't stopped researchers from making use of its responses.
SEE ALSO: OpenAI launches 'deep research' AI agent for ChatGPTNext, the researchers used an "off the shelf" pretrained model from Alibaba-owned lab, Qwen, and performed supervised fine-tuning of its curated dataset. Then, the team created a token budget to control the amount of compute time for testing the model. If s1 went over budget on thinking tokens, it was cut off and forced to generate whatever answer it came up with. If the researchers wanted the model to spend more "test-time compute" on a problem, they would simply tell the model to "wait," which extended its thinking time and led to more accurate results.
By controlling the amount of time and compute spent on a problem, the researchers were able to show how increased thinking team leads to improved performance.
S1 is one example of open-source reasoning models that have been developed for a fraction of the cost of flagship models from Google and OpenAI. In January, UC Berkeley researchers released an open-source reasoning model called Sky-T1 that cost $450, "demonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently," per its blog post. There's also the open-source rStar-Math reasoning model from Microsoft Asia researchers, Tulu 3 from non profit research institute Ai2, and HuggingFace has its own initiative to replicate DeepSeek's R1.
As high-quality models become more accessible and cheaper, we're starting to see a power shift from the few AI heavy hitters, to the many.
Topics Artificial Intelligence OpenAI
Staff Picks: Odes, #Ads, and Amazing Grace by The Paris ReviewA Trip to Bohemia by Adam Ehrlich SachsBest TV deal: Save $20 on Amazon Fire TV 2So What If Lincoln Was Gay? by Louis BayardArcade by Lucy SanteBeach Life by The Paris ReviewIn the Name of Notre Dame by Chris KnappIn Summer We’re Reborn by Nina MacLaughlinMasked and Anonymous by Lucy SanteFarewell to Dr. John, Wherever You Is Now by Brian CullmanStaff Picks: Bunnies, Berries, and Baffling Omissions by The Paris Review'Sew Torn' review: 'Pushing Daisies' meets 'Run Lola Run'Best TV deal: Save $20 on Amazon Fire TV 2Proust and the Joy of Suffering by Elisa GabbertWe Are All Scared by What We Aren’t Saying by T FleischmannThe Hidden Harper Lee by Casey N. CepThe Winners of 92Y’s 2019 Discovery Poetry Contest by The Paris ReviewPoetry Rx: An IV Dripping into Something Already Dead by Kaveh AkbarHow Not to Be Forgotten by Lauren KaneBooks Only a Mother Could Love by The Paris Review Supportive mom throws daughter a party for starting her period Watch Michelle Obama transform from spunky toddler to fabulous FLOTUS in one GIF This cat hanging with his boys took the best selfie of all time Please enjoy Russell Crowe's perfectly zen smartphone photography This man's crafty Snapchat post is not at all what it seems Fox Super Bowl pregame show includes Bill O’Reilly interview with Donald Trump Charity speedrun commentator tells distracting audience to stand in front of a bus Ed Sheeran just scored his first No. 1 song with 'Shape of You' Samsung may have 'leaked' its Galaxy S8, and boy does it look gorgeous Even the LA Chargers' new coach calls the NFL's lamest team the San Diego Chargers This Is Us Recap: Season 1, episode 12 Governor orders subway train redesign, because he thought it looked like a cricket Octavia Spencer bought out a whole cinema so families could see 'Hidden Figures' Using this symbol in a video game violates international law Indian tech startup that is changing healthcare globally just raised $55 million Young Thug didn't show up to a music video shoot but that didn't stop the director Google's first Android Wear 2.0 smartwatches are coming in February Crowdfunding effort might just bring back the billboard racists helped remove One of the most underrated heroes in comics is back. Here's a first look Amazon Prime Video might soon be playing on trains in India
2.4014s , 8224 kb
Copyright © 2025 Powered by 【Watch Jukkalan Online】,Fresh Information Network