{"id":89565,"date":"2025-01-12T01:37:32","date_gmt":"2025-01-12T01:37:32","guid":{"rendered":"https:\/\/neclink.com\/index.php\/2025\/01\/12\/researchers-open-source-sky-t1-a-reasoning-ai-model-that-can-be-trained-for-less-than-450\/"},"modified":"2025-01-12T01:37:32","modified_gmt":"2025-01-12T01:37:32","slug":"researchers-open-source-sky-t1-a-reasoning-ai-model-that-can-be-trained-for-less-than-450","status":"publish","type":"post","link":"https:\/\/neclink.com\/index.php\/2025\/01\/12\/researchers-open-source-sky-t1-a-reasoning-ai-model-that-can-be-trained-for-less-than-450\/","title":{"rendered":"Researchers open source Sky-T1, a &#8216;reasoning&#8217; AI model that can be trained for less than $450"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">So-called reasoning AI models are becoming easier \u2014 and cheaper \u2014 to develop. <\/p>\n<p class=\"wp-block-paragraph\">On Friday, NovaSky, a team of researchers based out of UC Berkeley\u2019s Sky Computing Lab, released Sky-T1-32B-Preview, a reasoning model that\u2019s competitive with an <a href=\"https:\/\/techcrunch.com\/2024\/09\/12\/openai-unveils-a-model-that-can-fact-check-itself\/\">earlier version of OpenAI\u2019s o1<\/a> on a number of key benchmarks. Sky-T1 appears to be the first truly open source reasoning model in the sense that it can be <a href=\"https:\/\/techcrunch.com\/2024\/10\/28\/we-finally-have-an-official-definition-for-open-source-ai\/\">replicated from scratch<\/a>; the team released the data set they used to train it as well as the necessary training code.<\/p>\n<p class=\"wp-block-paragraph\">\u201cRemarkably, Sky-T1-32B-Preview was trained for less than $450,\u201d the team wrote in a <a rel=\"nofollow\" href=\"https:\/\/novasky-ai.github.io\/posts\/sky-t1\/\">blog post<\/a>, \u201cdemonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently.\u201d<\/p>\n<p class=\"wp-block-paragraph\">$450 might not sound that affordable. But it wasn\u2019t long ago that the price tag for training a model with comparable performance <a rel=\"nofollow\" href=\"https:\/\/www.forbes.com\/sites\/katharinabuchholz\/2024\/08\/23\/the-extreme-cost-of-training-ai-models\/\">often ranged in the millions of dollars<\/a>. Synthetic training data, or training data generated by other models, has helped drive costs down. Palmyra X 004, a model recently released by AI company Writer, trained almost entirely on\u00a0<a href=\"https:\/\/techcrunch.com\/2024\/10\/13\/the-promise-and-perils-of-synthetic-data\/\">synthetic data<\/a>, reportedly cost just $700,000 to develop.<\/p>\n<p class=\"wp-block-paragraph\">Unlike most AI, reasoning models effectively fact-check themselves, which\u00a0<a href=\"https:\/\/techcrunch.com\/2024\/08\/27\/why-ai-cant-spell-strawberry\/\">helps them to avoid some of the\u00a0pitfalls\u00a0that normally trip up models<\/a>. Reasoning models take a little longer \u2014 usually seconds to minutes longer \u2014 to arrive at solutions compared to a typical non-reasoning model. The upside is, they tend to be more reliable in domains such as physics, science, and mathematics.<\/p>\n<p class=\"wp-block-paragraph\">The NovaSky team says it used another reasoning model, <a href=\"https:\/\/techcrunch.com\/2024\/11\/27\/alibaba-releases-an-open-challenger-to-openais-o1-reasoning-model\/\">Alibaba\u2019s QwQ-32B-Preview<\/a>, to generate the initial training data for Sky-T1, then \u201ccurated\u201d the data mixture and leveraged OpenAI\u2019s <a href=\"https:\/\/techcrunch.com\/2024\/07\/18\/openai-unveils-gpt-4o-mini-a-small-ai-model-powering-chatgpt\/\">GPT-4o-mini<\/a> to refactor the data into a more workable format. Training the 32-billion-parameter Sky-T1 took about 19 hours using a rack of 8 Nvidia H100 GPUs. (Parameters roughly correspond to a model\u2019s problem-solving skills.)<\/p>\n<p class=\"wp-block-paragraph\">According to the NovaSky team, Sky-T1 performs better than an early preview version of o1 on MATH500, a collection of \u201ccompetition-level\u201d math challenges. The model also beats the preview of o1 on a set of difficult problems from LiveCodeBench, a coding evaluation. <\/p>\n<p class=\"wp-block-paragraph\">However, Sky-T1 falls short of the o1 preview on GPQA-Diamond, which contains physics, biology, and chemistry-related questions a PhD graduate would be expected to know.<\/p>\n<p class=\"wp-block-paragraph\">Also important to note is that OpenAI\u2019s <a href=\"https:\/\/techcrunch.com\/2024\/12\/17\/openai-brings-its-o1-reasoning-model-to-its-api-for-certain-developers\/\">GA release of o1<\/a> is a stronger model than the preview version of o1, and that OpenAI is expected to release an even better-performing reasoning model, <a href=\"https:\/\/techcrunch.com\/2024\/12\/20\/openai-announces-new-o3-model\/\">o3<\/a>, in the weeks ahead.<\/p>\n<p class=\"wp-block-paragraph\">But the NovaSky team says that Sky-T1 only marks the start of their journey to develop open source models with advanced reasoning capabilities. <\/p>\n<p class=\"wp-block-paragraph\">\u201cMoving forward, we will focus on developing more efficient models that maintain strong reasoning performance and exploring advanced techniques that further enhance the models\u2019 efficiency and accuracy at test time,\u201d the team wrote in the post. \u201cStay tuned as we make progress on these exciting initiatives.\u201d<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2025\/01\/11\/researchers-open-source-sky-t1-a-reasoning-ai-model-that-can-be-trained-for-less-than-450\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>So-called reasoning AI models are becoming easier \u2014 and cheaper \u2014 to develop. On Friday, NovaSky, a team of researchers based out of UC Berkeley\u2019s<\/p>\n","protected":false},"author":1,"featured_media":89566,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[149],"tags":[],"class_list":["post-89565","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-business"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts\/89565","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/comments?post=89565"}],"version-history":[{"count":0,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts\/89565\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/media\/89566"}],"wp:attachment":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/media?parent=89565"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/categories?post=89565"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/tags?post=89565"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}