We collect cookies to analyze our website traffic and performance; we never collect any personal data. Cookie Policy
Accept
NEW YORK DAWN™NEW YORK DAWN™NEW YORK DAWN™
Notification Show More
Font ResizerAa
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Reading: TikTok dad or mum firm ByteDance releases new open supply Seed-OSS-36B mannequin with 512K token context
Share
Font ResizerAa
NEW YORK DAWN™NEW YORK DAWN™
Search
  • Home
  • Trending
  • New York
  • World
  • Politics
  • Business
    • Business
    • Economy
    • Real Estate
  • Crypto & NFTs
  • Tech
  • Lifestyle
    • Lifestyle
    • Food
    • Travel
    • Fashion
    • Art
  • Health
  • Sports
  • Entertainment
Follow US
NEW YORK DAWN™ > Blog > Technology > TikTok dad or mum firm ByteDance releases new open supply Seed-OSS-36B mannequin with 512K token context
TikTok dad or mum firm ByteDance releases new open supply Seed-OSS-36B mannequin with 512K token context
Technology

TikTok dad or mum firm ByteDance releases new open supply Seed-OSS-36B mannequin with 512K token context

Last updated: August 21, 2025 2:01 am
Editorial Board Published August 21, 2025
Share
SHARE

TikTok is making headlines once more at this time after the White Home joined the favored social media utility — however its dad or mum firm ByteDance, a Chinese language internet big, additionally had a shock announcement up its sleeve.

The corporate’s Seed Group of AI researchers at this time launched Seed-OSS-36B on AI code sharing web site Hugging Face.

Seed-OSS-36B is new line of open supply, giant language fashions (LLM) designed for superior reasoning, and developer-focused usability with an extended token context — that’s, how a lot info the fashions can settle for as inputs after which output in a single change — than many competing LLMs from U.S. tech corporations, even leaders reminiscent of OpenAI and Anthropic.

The gathering introduces three primary variants:

AI Scaling Hits Its Limits

Energy caps, rising token prices, and inference delays are reshaping enterprise AI. Be part of our unique salon to find how prime groups are:

Turning power right into a strategic benefit

Architecting environment friendly inference for actual throughput features

Unlocking aggressive ROI with sustainable AI methods

Safe your spot to remain forward: https://bit.ly/4mwGngO

Seed-OSS-36B-Base with artificial information

Seed-OSS-36B-Base with out artificial information

Seed-OSS-36B-Instruct

In releasing each artificial and non-synthetic variations of the Seed-OSS-36B-Base mannequin, the Seed Group sought to stability sensible efficiency with analysis flexibility.

The synthetic-data variant, educated with extra instruction information, persistently delivers stronger scores on customary benchmarks and is meant as a higher-performing general-purpose possibility.

The non-synthetic mannequin, in contrast, omits these augmentations, making a cleaner basis that avoids potential bias or distortion launched by artificial instruction information.

By offering each, the workforce offers utilized customers entry to improved outcomes whereas guaranteeing researchers retain a impartial baseline for finding out post-training strategies.

In the meantime, the Seed-OSS-36B-Instruct mannequin differs in that it’s post-trained with instruction information to prioritize job execution and instruction following, relatively than serving purely as a basis mannequin.

All three fashions are launched below the Apache-2.0 license, permitting free use, modification, and redistribution by researchers and builders working for enterprises.

Which means they can be utilized to energy business purposes, inner to an organization or exterior/customer-facing, with out paying ByteDance any licensing charges or for utility programming interface (API) utilization.

This continues the summer time 2025 development of Chinese language corporations transport highly effective open supply fashions with OpenAI trying to meet up with its personal open supply gpt-oss duet launched earlier this month.

The Seed Group positions Seed-OSS for worldwide purposes, emphasizing versatility throughout reasoning, agent-like job execution, and multilingual settings.

The Seed Group, fashioned in 2023, has focused on constructing basis fashions that may serve each analysis and utilized use instances.

Design and core options

The structure behind Seed-OSS-36B combines acquainted design decisions reminiscent of causal language modeling, grouped question consideration, SwiGLU activation, RMSNorm, and RoPE positional encoding.

Every mannequin carries 36 billion parameters throughout 64 layers and helps a vocabulary of 155,000 tokens.

One of many defining options is its native long-context functionality, with a most size of 512,000 tokens, designed to course of prolonged paperwork and reasoning chains with out efficiency loss.

That’s twice the size of OpenAI’s new GPT-5 mannequin household and is roughly equal to about 1,600 pages of textual content, the size of a Christian Bible.

One other distinguishing ingredient is the introduction of a considering funds, which lets builders specify how a lot reasoning the mannequin ought to carry out earlier than delivering a solution.

It’s one thing we’ve seen from different current open supply fashions as effectively, together with Nvidia’s new Nemotron-Nano-9B-v2, additionally out there on Hugging Face.

In observe, this implies groups can tune efficiency relying on the complexity of the duty and the effectivity necessities of deployment.

Budgets are beneficial in multiples of 512 tokens, with 0 offering a direct response mode/

Aggressive efficiency on third-party benchmarks

Benchmarks printed with the discharge place Seed-OSS-36B among the many stronger giant open-source fashions. The Instruct variant, specifically, posts state-of-the-art ends in a number of areas.

Math and reasoning: Seed-OSS-36B-Instruct achieves 91.7 p.c on AIME24 and 65 on BeyondAIME, each representing open-source “state-of-the-art” (SOTA).

Coding: On LiveCodeBench v6, the Instruct mannequin data 67.4, one other SOTA rating.

Lengthy-context dealing with: On RULER at 128K context size, it reaches 94.6, marking the very best open-source end result reported.

Base mannequin efficiency: The synthetic-data Base variant delivers 65.1 on MMLU-Professional and 81.7 on MATH, each state-of-the-art ends in their classes.

The no-synthetic Base model, whereas barely behind on many measures, proves aggressive in its personal proper.

It outperforms its artificial counterpart on GPQA-D, offering researchers with a cleaner, instruction-free baseline for experimentation.

For enterprises evaluating open choices, these outcomes recommend Seed-OSS provides sturdy potential throughout math-heavy, coding, and long-context workloads whereas nonetheless offering flexibility for analysis use instances.

Entry and deployment

Past efficiency, the Seed Group highlights accessibility for builders and practitioners. The fashions will be deployed utilizing Hugging Face Transformers, with quantization help in each 4-bit and 8-bit codecs to scale back reminiscence necessities.

In addition they combine with vLLM for scalable serving, together with configuration examples and API server directions.

To decrease boundaries additional, the workforce consists of scripts for inference, immediate customization, and power integration.

For technical leaders managing small groups or working below funds constraints, these provisions are positioned to make experimentation with 36-billion-parameter fashions extra approachable.

Licensing and issues for enterprise decision-makers

With the fashions supplied below Apache-2.0, organizations can undertake them with out restrictive licensing phrases, an necessary issue for groups balancing authorized and operational issues.

For resolution makers evaluating the open-source panorama, the discharge brings three takeaways:

State-of-the-art benchmarks throughout math, coding, and long-context reasoning.

A stability between higher-performing synthetic-trained fashions and clear analysis baselines.

Accessibility options that decrease operational overhead for lean engineering groups.

By putting sturdy efficiency and versatile deployment below an open license, ByteDance’s Seed Group has added new choices for enterprises, researchers, and builders alike.

Day by day insights on enterprise use instances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.

An error occured.

You Might Also Like

AI denial is turning into an enterprise threat: Why dismissing “slop” obscures actual functionality positive factors

GAM takes purpose at “context rot”: A dual-agent reminiscence structure that outperforms long-context LLMs

The 'reality serum' for AI: OpenAI’s new technique for coaching fashions to admit their errors

Anthropic vs. OpenAI pink teaming strategies reveal completely different safety priorities for enterprise AI

Inside NetSuite’s subsequent act: Evan Goldberg on the way forward for AI-powered enterprise methods

TAGGED:512KByteDancecompanycontextmodelopenparentreleasesSeedOSS36BsourceTikTokToken
Share This Article
Facebook Twitter Email Print

Follow US

Find US on Social Medias
FacebookLike
TwitterFollow
YoutubeSubscribe
TelegramFollow
Popular News
G.O.P. Governors Cause Havoc by Busing Migrants to East Coast
Trending

G.O.P. Governors Cause Havoc by Busing Migrants to East Coast

Editorial Board August 6, 2022
Unlocking recollections of the previous with the soundtrack of a lifetime
Brooklyn Welcomes a New Heart for Previously Incarcerated Artists
How primate eye monitoring reveals new insights into the evolution of language
Greater than 1.2 million medical gadget side-effect studies not submitted inside authorized timeframe, evaluation finds

You Might Also Like

Nvidia's new AI framework trains an 8B mannequin to handle instruments like a professional
Technology

Nvidia's new AI framework trains an 8B mannequin to handle instruments like a professional

December 4, 2025
Gong examine: Gross sales groups utilizing AI generate 77% extra income per rep
Technology

Gong examine: Gross sales groups utilizing AI generate 77% extra income per rep

December 4, 2025
AWS launches Kiro powers with Stripe, Figma, and Datadog integrations for AI-assisted coding
Technology

AWS launches Kiro powers with Stripe, Figma, and Datadog integrations for AI-assisted coding

December 4, 2025
Workspace Studio goals to unravel the true agent drawback: Getting staff to make use of them
Technology

Workspace Studio goals to unravel the true agent drawback: Getting staff to make use of them

December 4, 2025

Categories

  • Health
  • Sports
  • Politics
  • Entertainment
  • Technology
  • Art
  • World

About US

New York Dawn is a proud and integral publication of the Enspirers News Group, embodying the values of journalistic integrity and excellence.
Company
  • About Us
  • Newsroom Policies & Standards
  • Diversity & Inclusion
  • Careers
  • Media & Community Relations
  • Accessibility Statement
Contact Us
  • Contact Us
  • Contact Customer Care
  • Advertise
  • Licensing & Syndication
  • Request a Correction
  • Contact the Newsroom
  • Send a News Tip
  • Report a Vulnerability
Term of Use
  • Digital Products Terms of Sale
  • Terms of Service
  • Privacy Policy
  • Cookie Settings
  • Submissions & Discussion Policy
  • RSS Terms of Service
  • Ad Choices
© 2024 New York Dawn. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?