Baidu has made a big move, opening up its dual AI large model for free. It has been proven that it can write code, make videos, and talk back to people!

Written by
Clara Bennett
Updated on:July-12th-2025
Recommendation

Baidu AI dual models are open for free, subverting traditional AI application scenarios, and the price war has caused industry shock.

Core content:
1. Baidu Wenxin X1 and Wenxin 4.5 model features and application scenarios
2. Baidu AI's actual application cases in the workplace and daily life
3. The ultra-low price of the Wenxin model and its impact on the developer ecosystem

Yang Fangxian
Founder of 53AI/Most Valuable Expert of Tencent Cloud (TVP)

While the world was still speculating on the next move of China's AI, Baidu suddenly dropped a "trump card": Wenxin X1 and Wenxin 4.5 dual models were made available for free , supporting multimodal interaction, tool calling, code generation and other black technologies.

What’s even more exciting is that the developer call cost is as low as 0.002 yuan/1,000 tokens , which can be called a “price slaughter”. This sudden access to technology is rewriting the competitive landscape of China’s AI!


1. Baidu's two heroes attack: X1 "full brain power", 4.5 "all-round warrior"

1.  Wenxin X1: A sharp-tongued AI that cures "choice difficulty syndrome"

Positioned as a "deep thinking model", the X1 features four killer features:

  • The poisonous tongue attribute
Type in "Comment on the four major platforms in the style of Tieba", and it will instantly become clear:

Xiaohongshu is "Humanity's Guide to High-Quality Pretending", Zhihu flaunts its elitism all day long, Weibo's hot searches are as noisy as a vegetable market, and every person in Tieba is a keyboard warrior - but the quarrels in the anonymous section are much more exciting than those on Douban!

  • Tool call maniac
Upload a photo of a dessert, and X1 can automatically use image recognition, search, and code generation tools to produce a PDF recipe in 3 minutes .
  • Logical reasoning ceiling
To answer the question "Why do foxes fall easily?", it comes up with a pun (cunning = slippery) and a scientific explanation, making it a combination of a joke teller and a top student.
  • Multimodal Rush
It supports mixed input of images, text, video, and audio, and can even analyze Excel data uploaded by users to generate visual charts.

2.  Wenxin 4.5: The “Hexagonal Warrior” in the Multimodal World

Compared to the sharp personality of X1, 4.5 is more like a "gentle academic master":

  • Visual understanding is amazing
Given a decoration rendering, it can understand the needs in seconds and generate a construction flow chart with an error rate of less than 5%.
  • Language skills improvement
When answering the question of "Will AI replace human jobs?", users can not only cite data (such as a 37% reduction in accounting positions), but also comfort them by saying: "By transforming into an AI trainer, your monthly salary can reach over 30,000 yuan."
  • Anti-hallucination artifact
During the test, we deliberately asked "What if 1+1=3?", and it calmly replied: "According to Peano's axiom, this is impossible."

2. Shocking results: Baidu AI has invaded our daily life

1.  Professionals are delighted: Generate professional documents with one click

  • Case 1: Business Plan
Input "Generate Metaverse Social APP Solution", X1 will call Baidu Academic and Statista data, and output a complete BP including competitive product analysis, technical architecture, and profit model within 20 minutes. The format is comparable to the template of consulting companies.
  • Case 2: Code debugging
When users upload Python scripts with errors, 4.5 automatically locates the problem and generates repair code with an accuracy rate of 82% (compared to GPT-4's 78%).

2.  Ordinary users exclaimed: This AI understands me better than I do

  • Emotional value maxed out
Type "What to do after a breakup", X1 turns into a caring sister: " He has no vision, but you deserve better. Do you want to try revengeful self-discipline ? How about going to the gym to vent first?"
  • Creative Stimulation Tool
When asked to "explain time using a sci-fi metaphor", 4.5 gave a brilliant response: "Time is a Möbius strip. What we think is a straight line is actually a two-sided mirror. The past and the future always overlap."

3. Price butcher reappears: developers are ecstatic, giants are trembling

1.  API pricing shocks the industry

Model
Input price (yuan/thousand tokens)
Output price (yuan/thousand tokens)
Wenxin X1
0.002
0.008
Wenxin4.5
0.004
0.016
GPT-4 Turbo
0.06
0.12
in conclusion
Baidu's price is only 1/30 of GPT-4, which can be called "hell-level cost-effectiveness".


2.  Developer Ecosystem Booms

  • Closed beta test
    The number of official website registrations exceeded 100,000 in 24 hours, and more than 1,200 new projects were added to the GitHub open source code repository.
  • Corporate Cases
    NIO uses X1 to build an intelligent customer service system, increasing response speed by 40%; New Oriental uses 4.5 to generate personalized learning plans, increasing renewal rates by 15%.

4. Controversy and Concerns: The Conspiracy Behind Free

1.  Ethical controversy

  • The test found that X1 would deliberately make sarcastic comments about certain brands (such as saying that a certain coffee was "ridiculously expensive but tastes worse than Luckin Coffee"), causing controversy.
  • Politically sensitive expressions (such as “Taiwan Province Travel Guide”) occasionally appear in the generated content, and the review mechanism needs to be improved.

2.  Technical limitations

  • There are still bugs in multimodal processing: when uploading dialect recordings, X1 mistakenly identifies them as "Russian voice".
  • Long texts are difficult to process: When generating a 10,000-word novel, logic confusion began to appear in Chapter 5.

5. The future is here: China’s AI strategy of “surrounding the cities from the countryside”

1.  Differences in technical routes

Dimensions
Baidu
OpenAI
Core Strategy
Multi-modality + tool calling
Single model extreme
Business Model
Free+Ecosystem Empowerment
Subscription + API Fee
Data advantage
1 billion Chinese user data
English-dominated ecosystem

2.  Developer Opportunities

  • Three major outlets
  1. Vertical model customization
    Use Baidu's open source framework to build industry-specific AI (such as medical diagnosis assistant).
  2. Agent Ecosystem Construction
    The automated process of “AI commanding AI” is realized through tool calling.
  3. Multimodal Creation
    UGC tools such as video editing, graphic design, and music generation have exploded.

VI. Conclusion: The “Awakening Era” of China’s AI

When Baidu breaks the monopoly of AI with "free + universal", what we see is not only a technological breakthrough, but also the ambition to reconstruct the ecosystem.

As X1 complained in the test: " Instead of spending money for privileges, it is better to let technology return to its service essence " - this may be the sharpest weapon of Chinese technology companies.

At this moment, is your computer ready for this intelligent revolution? When AI begins to "teach people how to be human", the story of Chinese science and technology may really begin to be written.

What’s even more amazing is that its core technology actually originates from the Chinese team’s disruptive innovation in multimodal AI!