TestingCatalog AI News 🗞 – Telegram
TestingCatalog AI News 🗞
4.06K subscribers
2.69K photos
326 videos
40 files
3.78K links
Reporting AI nonsense. A future news media, driven by virtual assistants 🤖
Download Telegram
TestingCatalog AI News 🗞
Google to enable research automation on Gemini Enterprise Google is developing a multi-agent system in Gemini for Enterprise that generates and ranks up to 100 ideas using a 40-minute tournament-style evaluation, targeting advanced enterprise and research…
Media is too big
VIEW IN TELEGRAM
BREAKING 🚨: Google is working on multi-agent systems to help you refine ideas with tournament-like evaluation. Each run takes around 40 minutes and brings you 100 detailed ideas on a given research topic.

2 new multi-agents are being developed for Gemini Enterprise:
- Idea Generation - "Create a multi-agent innovation session"
- Co-Scientist - "Drive novel scientific discovery with Co-Scientist"

Co-Scientist 3-step workflow 👀
- Tell Co-Scientist what you plan to research, point it to relevant data, and set your evaluation criteria.
- A team of agents will generate ideas on your topic using their available data
- The agents will evaluate the ideas against your criteria and rank them, tournament-style

Google is not only automating research but also preparing a product that will enable others to do so.

This is the next level 🤯
👍14
Google works on multi-agent builder for Gemini Enterprise

Gemini Enterprise is expanding its automation capabilities with broader platform integrations, in-product agent evaluations, and multi-agent workflows, supporting reuse and orchestration across technical and operational teams.

🗞 #gemini
👍31
AI Studio is about to get a dedicated mobile app at the beginning of next year.

Easy top 1 in App Stores? 👀
👏12👍51
Google tests Agentspace Live feature for Gemini Enterprise

Google is testing a Gemini Live mode in Enterprise, hinting at voice-based collaboration via Agentspace. Updates also include a personalized "For You" section and memory management tools, aimed at boosting team productivity in business settings.

🗞 #gemini
👍4🔥1
Kimi K2 Thinking is now available on Perplexity to all users! Only the Thinking version is currently available.
👍71
A mention of Grok 4.1 was spotted in the code 👀

- grok-4-1-non-thinking-w-tool
- grok-4-1-non-thinking-no-tool-1111b
🔥2👍1
TestingCatalog AI News 🗞
A mention of Grok 4.1 was spotted in the code 👀 - grok-4-1-non-thinking-w-tool - grok-4-1-non-thinking-no-tool-1111b
BREAKING 🚨: Grok 4.1 Beta is rolling out on the Grok web! It is available as a standalone option, next to the existing Grok 4 modes.

Testing time 👀
5
BREAKING 🚨: A new Gemini 3 tooltip has been added to AI Studio. Preparation is underway.

"For Gemini 3, best results at default 1.0. Lower values may impact reasoning."
7👍1
GPT-5.1 Thinking High from OpenAI claims a top spot on ARC AGI 2 benchmark and dethrones Grok 4.

Was GPT-5.1 underhyped? 👀
👍4
This media is not supported in your browser
VIEW IN TELEGRAM
Poe introduced group chats for up to 200 people to enable collaborative work between AI and humans.

Multi-human AI system 👀
👍6👎2
Poe gets group AI chats for up to 200 people

Quora’s Poe now supports group chats with up to 200 participants, allowing simultaneous collaboration with multiple AI models and users. The feature supports over 200 models and is available across devices with synced chat history.

🗞 #ai
👍31
xAI launches Grok 4.1 across Grok and X apps

xAI's Grok 4.1 leads real-world conversational AI with top rankings in major benchmarks, improved factual accuracy, and higher emotional intelligence, now available to all users for free across platforms.

🗞 #grok
👍6🎉1
Gemini 3 time, November 18 👀
🔥158🤝5
TestingCatalog AI News 🗞
Gemini 3 time, November 18 👀
Gemini 3 Pro benchmarks are wild!

- Humanity’s Last Exam: 37.5%
- ARC-AGI-2: 31.1%

True SOTA 👀
🔥10👍3🥱2