We collect cookies to analyze our website traffic and performance; we never collect any personal data. Cookie Policy
Accept
NEW YORK DAWN™NEW YORK DAWN™NEW YORK DAWN™
Notification Show More
Font ResizerAa
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Reading: Do new AI reasoning fashions require new approaches to prompting?
Share
Font ResizerAa
NEW YORK DAWN™NEW YORK DAWN™
Search
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Follow US
NEW YORK DAWN™ > Blog > Technology > Do new AI reasoning fashions require new approaches to prompting?
Do new AI reasoning fashions require new approaches to prompting?
Technology

Do new AI reasoning fashions require new approaches to prompting?

Last updated: January 14, 2025 1:18 am
Editorial Board Published January 14, 2025
Share
SHARE

The period of reasoning AI is effectively underway.

After OpenAI as soon as once more kickstarted an AI revolution with its o1 reasoning mannequin launched again in September 2024 — which takes longer to reply questions however with the payoff of upper efficiency, particularly on advanced, multi-step issues in math and science — the industrial AI subject has been flooded with copycats and rivals.

There’s DeepSeek’s R1, Google Gemini 2 Flash Pondering, and simply as we speak, LlamaV-o1, all of which search to supply related built-in “reasoning” to OpenAI’s new o1 and upcoming o3 mannequin households. These fashions interact in “chain-of-thought” (CoT) prompting — or “self-prompting” — forcing them to replicate on their evaluation midstream, double again, examine over their very own work and in the end arrive at a greater reply than simply capturing it out of their embeddings as quick as potential, as different massive language fashions (LLMs) do.

But the excessive price of o1 and o1-mini ($15.00/1M enter tokens vs. $1.25/1M enter tokens for GPT-4o on OpenAI’s API) has brought about some to balk on the supposed efficiency positive factors. Is it actually value paying 12X as a lot as the everyday, state-of-the-art LLM?

Because it seems, there are a rising variety of converts — however the important thing to unlocking reasoning fashions’ true worth could lie within the consumer prompting them in another way.

In brief, as an alternative of the human consumer writing prompts for the o1 mannequin, they need to take into consideration writing “briefs,” or extra detailed explanations that embody plenty of context up-front about what the consumer needs the mannequin to output, who the consumer is and what format through which they need the mannequin to output data for them.

As Hylak writes on Substack:

With most fashions, we’ve been skilled to inform the mannequin how we wish it to reply us. e.g. ‘You’re an skilled software program engineer. Assume slowly and punctiliously“

That is the other of how I’ve discovered success with o1. I don’t instruct it on the how — solely the what. Then let o1 take over and plan and resolve its personal steps. That is what the autonomous reasoning is for, and may truly be a lot quicker than for those who had been to manually overview and chat because the “human in the loop”.

Hylak additionally features a nice annotated screenshot of an instance immediate for o1 that produced a helpful outcomes for an inventory of hikes:

This weblog publish was so useful, OpenAI’s personal president and co-founder Greg Brockman re-shared it on his X account with the message: “o1 is a different kind of model. Great performance requires using it in a new way relative to standard chat models.”

I attempted it myself on my recurring quest to be taught to talk fluent Spanish and right here was the consequence, for these curious. Maybe not as spectacular as Hylak’s well-constructed immediate and response, however positively exhibiting sturdy potential.

Screenshot 2025 01 13 at 6.39.12%E2%80%AFPM

Individually, even in relation to non-reasoning LLMs corresponding to Claude 3.5 Sonnet, there could also be room for normal customers to enhance their prompting to get higher, much less constrained outcomes.

As Louis Arge, former Teton.ai engineer and present creator of neuromodulation system openFUS, wrote on X, “one trick i’ve discovered is that LLMs trust their own prompts more than my prompts,” and supplied an instance of how he satisfied Claude to be “less of a coward” by first “trigger[ing] a fight” with him over its outputs.

All of which works to point out that immediate engineering stays a useful ability because the AI period wears on.

Every day insights on enterprise use circumstances with VB Every day

If you wish to impress your boss, VB Every day has you coated. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for optimum ROI.

An error occured.

vb daily phone

You Might Also Like

Why most enterprise AI coding pilots underperform (Trace: It's not the mannequin)

Google’s new framework helps AI brokers spend their compute and gear finances extra correctly

Ai2's new Olmo 3.1 extends reinforcement studying coaching for stronger reasoning benchmarks

Cohere’s Rerank 4 quadruples the context window over 3.5 to chop agent errors and enhance enterprise search accuracy

Nous Analysis simply launched Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math examination

TAGGED:approachesmodelspromptingreasoningrequire
Share This Article
Facebook Twitter Email Print

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow
Popular News
10 books to learn in November, from Margaret Atwood’s new memoir to John Irving’s newest
Entertainment

10 books to learn in November, from Margaret Atwood’s new memoir to John Irving’s newest

Editorial Board November 1, 2025
There is a new vaccine for pneumococcal illness in Australia. This is what to know
Pacers’ Tyrese Haliburton has some Reggie Miller in him going into ECF vs. Knicks
Assessment: Karen Russell’s Mud Bowl ‘Antidote’ is much more bold than ‘Swamplandia!’
Fed’s Kashkari says officials are ‘a long way’ from backing off inflation fight.

You Might Also Like

GPT-5.2 first impressions: a strong replace, particularly for enterprise duties and workflows
Technology

GPT-5.2 first impressions: a strong replace, particularly for enterprise duties and workflows

December 12, 2025
OpenAI's GPT-5.2 is right here: what enterprises must know
Technology

OpenAI's GPT-5.2 is right here: what enterprises must know

December 11, 2025
Marble enters the race to convey AI to tax work, armed with  million and a free analysis device
Technology

Marble enters the race to convey AI to tax work, armed with $9 million and a free analysis device

December 11, 2025
Making a glass field: How NetSuite is engineering belief into AI
Technology

Making a glass field: How NetSuite is engineering belief into AI

December 11, 2025

Categories

  • Health
  • Sports
  • Politics
  • Entertainment
  • Technology
  • Art
  • World

About US

New York Dawn is a proud and integral publication of the Enspirers News Group, embodying the values of journalistic integrity and excellence.
Company
  • About Us
  • Newsroom Policies & Standards
  • Diversity & Inclusion
  • Careers
  • Media & Community Relations
  • Accessibility Statement
Contact Us
  • Contact Us
  • Contact Customer Care
  • Advertise
  • Licensing & Syndication
  • Request a Correction
  • Contact the Newsroom
  • Send a News Tip
  • Report a Vulnerability
Term of Use
  • Digital Products Terms of Sale
  • Terms of Service
  • Privacy Policy
  • Cookie Settings
  • Submissions & Discussion Policy
  • RSS Terms of Service
  • Ad Choices
© 2024 New York Dawn. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?