TestingCatalog AI News 🗞 – Telegram
TestingCatalog AI News 🗞
4.06K subscribers
2.69K photos
326 videos
40 files
3.78K links
Reporting AI nonsense. A future news media, driven by virtual assistants 🤖
Download Telegram
TestingCatalog AI News 🗞
A mention of Grok 4.1 was spotted in the code 👀 - grok-4-1-non-thinking-w-tool - grok-4-1-non-thinking-no-tool-1111b
BREAKING 🚨: Grok 4.1 Beta is rolling out on the Grok web! It is available as a standalone option, next to the existing Grok 4 modes.

Testing time 👀
5
BREAKING 🚨: A new Gemini 3 tooltip has been added to AI Studio. Preparation is underway.

"For Gemini 3, best results at default 1.0. Lower values may impact reasoning."
7👍1
GPT-5.1 Thinking High from OpenAI claims a top spot on ARC AGI 2 benchmark and dethrones Grok 4.

Was GPT-5.1 underhyped? 👀
👍4
This media is not supported in your browser
VIEW IN TELEGRAM
Poe introduced group chats for up to 200 people to enable collaborative work between AI and humans.

Multi-human AI system 👀
👍6👎2
Poe gets group AI chats for up to 200 people

Quora’s Poe now supports group chats with up to 200 participants, allowing simultaneous collaboration with multiple AI models and users. The feature supports over 200 models and is available across devices with synced chat history.

🗞 #ai
👍31
xAI launches Grok 4.1 across Grok and X apps

xAI's Grok 4.1 leads real-world conversational AI with top rankings in major benchmarks, improved factual accuracy, and higher emotional intelligence, now available to all users for free across platforms.

🗞 #grok
👍6🎉1
Gemini 3 time, November 18 👀
🔥158🤝5
TestingCatalog AI News 🗞
Gemini 3 time, November 18 👀
Gemini 3 Pro benchmarks are wild!

- Humanity’s Last Exam: 37.5%
- ARC-AGI-2: 31.1%

True SOTA 👀
🔥10👍3🥱2
BREAKING 🚨: Gemini 3 Pro Preview is finally rolling out on AI Studio!

It is happening 👀
🔥11👍6
BREAKING 🚨: NVIDIA and Microsoft will invest up to $10bn and $5bn respectively in Anthropic.

Claude is now also available on Microsoft Azure.
😁9👨‍💻51
BREAKING 🚨: Gemini 3 Deep Think gets 41% on HLE and 45.1% on ARC_AGI-2!

"In testing, Gemini 3 Deep Think outperforms Gemini 3 Pro’s already impressive performance on Humanity’s Last Exam and GPQA Diamond."
🔥6🤯4👎1
Gemini 3 Era has officially started 🔥

Seems like Gemini 3 will be rolling out everywhere, including Google and 3rd party products and APIs!
12🔥5🏆2
BREAKING 🚨: Most important Gemini 3-related changes on LMArena for you to know!

- Gemini is now the top 1
- Gemini crossed 1500 score
- Grok is now top 2
🔥115👎1
BREAKING 🚨: Google is launching Antigravity, a free vibe coding IDE.

- Agent model: access to Gemini 3 Pro, Claude Sonnet 4.5, GPT-OSS
- Unlimited Tab completions
- Unlimited Command requests
- Generous rate limits *

Free testing time 👀
🔥12🤔32