[News] OpenAI strengthens its presence in Korea with Kakao partnership
OpenAI and Kakao have formed a strategic partnership to integrate OpenAI's technology into Kakao's services through three main projects: developing a Korean language AI agent, incorporating OpenAI's technology across Kakao's ecosystem, and implementing ChatGPT Enterprise for Kakao's workforce.
https://dataphoenix.info/openai-strengthens-its-presence-in-korea-with-kakao-partnership/
OpenAI and Kakao have formed a strategic partnership to integrate OpenAI's technology into Kakao's services through three main projects: developing a Korean language AI agent, incorporating OpenAI's technology across Kakao's ecosystem, and implementing ChatGPT Enterprise for Kakao's workforce.
https://dataphoenix.info/openai-strengthens-its-presence-in-korea-with-kakao-partnership/
[News] Mistral revamped its AI assistant Le Chat and introduced new paid service tiers
Mistral is expanding its AI assistant Le Chat with new iOS and Android apps, a Pro subnoscription tier, and enterprise features while competing with established players through fast performance and unique capabilities like on-premises deployment.
https://dataphoenix.info/mistral-revamped-its-ai-assistant-le-chat-and-introduced-new-paid-service-tiers/
Mistral is expanding its AI assistant Le Chat with new iOS and Android apps, a Pro subnoscription tier, and enterprise features while competing with established players through fast performance and unique capabilities like on-premises deployment.
https://dataphoenix.info/mistral-revamped-its-ai-assistant-le-chat-and-introduced-new-paid-service-tiers/
[News] AlphaGeometry2 has surpassed IMO gold medalist performance in geometry problems
AlphaGeometry2 (AG2) is an enhanced neuro-symbolic system for solving geometry problems, which Google DeepMind claims outperforms IMO gold medalists. Although AG2 has some limitations, it undoubtedly has taken DeepMind one step closer to "solving geometry".
https://dataphoenix.info/alphageometry2-has-surpassed-imo-gold-medalist-performance-in-geometry-problems/
AlphaGeometry2 (AG2) is an enhanced neuro-symbolic system for solving geometry problems, which Google DeepMind claims outperforms IMO gold medalists. Although AG2 has some limitations, it undoubtedly has taken DeepMind one step closer to "solving geometry".
https://dataphoenix.info/alphageometry2-has-surpassed-imo-gold-medalist-performance-in-geometry-problems/
[News] Researchers looking for benchmarks relevant to average AI users turn to the NPR Sunday Puzzle
A new benchmark based on the NPR's Sunday Puzzle riddles aims to test LLMs' general reasoning skills. The findings are remarkable: reasoning models like OpenAI's o1 do best at the benchmark, and some replicate behaviors such as "giving up" or showing "frustration" when stuck on difficult problems.
https://dataphoenix.info/researchers-looking-for-benchmarks-relevant-to-average-ai-users-turn-to-the-npr-sunday-puzzle/
A new benchmark based on the NPR's Sunday Puzzle riddles aims to test LLMs' general reasoning skills. The findings are remarkable: reasoning models like OpenAI's o1 do best at the benchmark, and some replicate behaviors such as "giving up" or showing "frustration" when stuck on difficult problems.
https://dataphoenix.info/researchers-looking-for-benchmarks-relevant-to-average-ai-users-turn-to-the-npr-sunday-puzzle/
[News] AI Highlights Review: February 1–10
DeepSeek released Janus-Pro, a new iteration of its image generation models; OpenAI strikes strategic partnerships in Asia; Mistral AI has enhanced its Le Chat assistant with new features; Eleven Labs raised a $180M Series C; Tülu 3 showcases Ai2's open source post-training recipe; and more.
https://dataphoenix.info/ai-highlights-review-february-1-10/
DeepSeek released Janus-Pro, a new iteration of its image generation models; OpenAI strikes strategic partnerships in Asia; Mistral AI has enhanced its Le Chat assistant with new features; Eleven Labs raised a $180M Series C; Tülu 3 showcases Ai2's open source post-training recipe; and more.
https://dataphoenix.info/ai-highlights-review-february-1-10/
[News] Google One AI Premium individual subscribers can now access NotebookLM Plus
Google has integrated NotebookLM Plus, the wildly popular AI-powered notebook and research assistant, into its Google One AI Premium subnoscription plan at no additional cost. In addition, the company also introduced a 50% discount on Google One AI Premium for US-based students under 18.
https://dataphoenix.info/google-one-ai-premium-individual-subscribers-can-now-access-notebooklm-plus/
Google has integrated NotebookLM Plus, the wildly popular AI-powered notebook and research assistant, into its Google One AI Premium subnoscription plan at no additional cost. In addition, the company also introduced a 50% discount on Google One AI Premium for US-based students under 18.
https://dataphoenix.info/google-one-ai-premium-individual-subscribers-can-now-access-notebooklm-plus/
[News] LangChain kicked off a series of experiments to study AI agent performance
A recent experiment by LangChain tests the performance of five leading LLMs by Anthropic, OpenAI, and Meta within single-agent architectures for single-domain tasks. The findings reveal that performance degrades as the domains to manage, context complexity, and available tools increase.
https://dataphoenix.info/langchain-kicked-off-a-series-of-experiments-to-study-ai-agent-performance-2/
A recent experiment by LangChain tests the performance of five leading LLMs by Anthropic, OpenAI, and Meta within single-agent architectures for single-domain tasks. The findings reveal that performance degrades as the domains to manage, context complexity, and available tools increase.
https://dataphoenix.info/langchain-kicked-off-a-series-of-experiments-to-study-ai-agent-performance-2/
[News] A BBC study shows AI assistants distort current affairs even when given a source
BBC research reveals that leading AI tools including ChatGPT, Perplexity, Microsoft Copilot, and Google Gemini produced significant factual errors in about 20% of responses and altered or fabricated quotes in over 10% of cases when answering news questions using BBC articles as sources.
https://dataphoenix.info/a-bbc-study-shows-ai-assistants-distort-current-affairs-even-when-given-a-trustable-source/
BBC research reveals that leading AI tools including ChatGPT, Perplexity, Microsoft Copilot, and Google Gemini produced significant factual errors in about 20% of responses and altered or fabricated quotes in over 10% of cases when answering news questions using BBC articles as sources.
https://dataphoenix.info/a-bbc-study-shows-ai-assistants-distort-current-affairs-even-when-given-a-trustable-source/
[News] Former DeepMind scientist Simon Kohl launches Latent Labs to transform protein design
Founded by Former DeepMind scientist Simon Kohl, Latent Labs has emerged from stealth with $50M in funding to develop AI models for protein design. Latent Labs aims to accelerate drug discovery through partnerships with biotech and pharmaceutical companies.
https://dataphoenix.info/former-deepmind-scientist-simon-kohl-launches-latent-labs-to-transform-protein-design/
Founded by Former DeepMind scientist Simon Kohl, Latent Labs has emerged from stealth with $50M in funding to develop AI models for protein design. Latent Labs aims to accelerate drug discovery through partnerships with biotech and pharmaceutical companies.
https://dataphoenix.info/former-deepmind-scientist-simon-kohl-launches-latent-labs-to-transform-protein-design/
[News] Arize AI raised $70M to bring evaluation and observability into the 'agentic AI' era
Arize AI has raised $70 million in Series C funding to expand its AI evaluation and observability platform. Arize's platform helps companies ensure their AI systems work reliably in real-world applications by empowering them with testing, debugging, and optimization tools.
https://dataphoenix.info/arize-ai-raised-a-70m-series-c-to-bring-ai-evaluation-and-obrevability-to-the/
Arize AI has raised $70 million in Series C funding to expand its AI evaluation and observability platform. Arize's platform helps companies ensure their AI systems work reliably in real-world applications by empowering them with testing, debugging, and optimization tools.
https://dataphoenix.info/arize-ai-raised-a-70m-series-c-to-bring-ai-evaluation-and-obrevability-to-the/
[News] Former OpenAI CTO Mira Murati Launches Thinking Machines Lab to Make AI More Accessible
Former OpenAI CTO Mira Murati has launched Thinking Machines Lab, a new AI startup focused on making artificial intelligence more customizable and accessible while advancing AI capabilities, with a team of prominent AI researchers from OpenAI and other leading companies.
https://dataphoenix.info/former-openai-cto-mira-murati-launches-thinking-machines-lab-to-make-ai-more-accessible/
Former OpenAI CTO Mira Murati has launched Thinking Machines Lab, a new AI startup focused on making artificial intelligence more customizable and accessible while advancing AI capabilities, with a team of prominent AI researchers from OpenAI and other leading companies.
https://dataphoenix.info/former-openai-cto-mira-murati-launches-thinking-machines-lab-to-make-ai-more-accessible/
[News] Lingo.dev Raises $4.2M to Automate App Localization Using AI
Lingo.dev, a startup offering AI-powered UI localization automation for developers, has secured $4.2 million in seed funding to provide comprehensive app, database, and website translation services beyond simple text conversion.
https://dataphoenix.info/lingo-dev-raises-4-2m-to-automate-app-localization-using-ai/
Lingo.dev, a startup offering AI-powered UI localization automation for developers, has secured $4.2 million in seed funding to provide comprehensive app, database, and website translation services beyond simple text conversion.
https://dataphoenix.info/lingo-dev-raises-4-2m-to-automate-app-localization-using-ai/
[News] Mistral AI released Mistral Saba, an LLM specializing in Middle East and South Asia languages
Mistral AI has launched Mistral Saba, a language model optimized for Arabic that works well with Indian-origin languages, positioning the startup to expand its presence in Middle Eastern and South Asian markets.
https://dataphoenix.info/mistral-ai-released-mistral-saba-an-llm-specialized-in-languages-from-the-middle-east-and-south-asia/
Mistral AI has launched Mistral Saba, a language model optimized for Arabic that works well with Indian-origin languages, positioning the startup to expand its presence in Middle Eastern and South Asian markets.
https://dataphoenix.info/mistral-ai-released-mistral-saba-an-llm-specialized-in-languages-from-the-middle-east-and-south-asia/
🔥1
[News] Meta's first-ever AI developer conference, LlamaCon, will take place on April 29
Meta announced LlamaCon, its first developer conference for its generative AI ecosystem on April 29. The LlamaCon announcement follows a year of rapid Llama model releases that gained 650M+ downloads and 85,000+ derivatives.
https://dataphoenix.info/metas-first-ever-ai-developer-conference-llamacon-will-take-place-on-april-29/
Meta announced LlamaCon, its first developer conference for its generative AI ecosystem on April 29. The LlamaCon announcement follows a year of rapid Llama model releases that gained 650M+ downloads and 85,000+ derivatives.
https://dataphoenix.info/metas-first-ever-ai-developer-conference-llamacon-will-take-place-on-april-29/
[News] The recently launched Grok 3 is already a subject of controversy
xAI launched Grok 3 and Grok 3 mini with reasoning capabilities and a DeepSearch agent. Shortly after their release, the models sparked controversy when xAI compared their consensus-based math scores to competitors' single-attempt metrics, highlighting ongoing issues with benchmark comparisons.
https://dataphoenix.info/the-recently-launched-grok-3-is-already-a-subject-of-controversy/
xAI launched Grok 3 and Grok 3 mini with reasoning capabilities and a DeepSearch agent. Shortly after their release, the models sparked controversy when xAI compared their consensus-based math scores to competitors' single-attempt metrics, highlighting ongoing issues with benchmark comparisons.
https://dataphoenix.info/the-recently-launched-grok-3-is-already-a-subject-of-controversy/
[News] Anthropic raises the stakes in the generative AI race with Claude 3.7 Sonnet and Claude Code
Anthropic launches Claude 3.7 Sonnet, the first hybrid reasoning AI that offers both quick responses and visible step-by-step thinking. It excels at coding tasks and comes with Claude Code, a new terminal tool for developers.
https://dataphoenix.info/anthropic-raises-the-stakes-in-the-generative-ai-race-with-claude-3-7-sonnet-and-claude-code/
Anthropic launches Claude 3.7 Sonnet, the first hybrid reasoning AI that offers both quick responses and visible step-by-step thinking. It excels at coding tasks and comes with Claude Code, a new terminal tool for developers.
https://dataphoenix.info/anthropic-raises-the-stakes-in-the-generative-ai-race-with-claude-3-7-sonnet-and-claude-code/
[News] OpenAI releases the long-awaited GPT-4.5/Orion, its last non-chain-of-thought model
OpenAI has released GPT-4.5, its largest AI model to date, featuring improved knowledge and emotional intelligence. But while GPT-4.5 is a clear improvement over GPT-4o, it falls behind the o series in benchmark evaluations and comes with dramatically higher costs.
https://dataphoenix.info/openai-releases-the-long-awaited-gpt-4-5-orion-its-last-non-chain-of-thought-model-2/
OpenAI has released GPT-4.5, its largest AI model to date, featuring improved knowledge and emotional intelligence. But while GPT-4.5 is a clear improvement over GPT-4o, it falls behind the o series in benchmark evaluations and comes with dramatically higher costs.
https://dataphoenix.info/openai-releases-the-long-awaited-gpt-4-5-orion-its-last-non-chain-of-thought-model-2/
[News] Anthropic has raised an additional $3.5B in a Series E round
Anthropic announced earlier this week that it has secured $3.5B in a Series E funding round led by Lightspeed Venture Partners, with new and existing investors participating. This Series E announcement follows the Claude 3.7 Sonnet and Claude Code launches.
https://dataphoenix.info/anthropic-has-raised-an-additional-3-5b-in-a-series-e-round/
Anthropic announced earlier this week that it has secured $3.5B in a Series E funding round led by Lightspeed Venture Partners, with new and existing investors participating. This Series E announcement follows the Claude 3.7 Sonnet and Claude Code launches.
https://dataphoenix.info/anthropic-has-raised-an-additional-3-5b-in-a-series-e-round/
[News] Cohere's open research lab released Aya Vision, a "best-in-class" open-weights vision model
Cohere for AI released the Aya Vision models (8B/32B), which support 23 languages, perform remarkably well in image captioning, visual Q&A, and translations, and outperform competitors in benchmarks. Aya Vision is available via Cohere Playground, WhatsApp, Kaggle, and Hugging Face.
https://dataphoenix.info/coheres-open-research-lab-released-aya-vision-a-best-in-class-open-weights-vision-model/
Cohere for AI released the Aya Vision models (8B/32B), which support 23 languages, perform remarkably well in image captioning, visual Q&A, and translations, and outperform competitors in benchmarks. Aya Vision is available via Cohere Playground, WhatsApp, Kaggle, and Hugging Face.
https://dataphoenix.info/coheres-open-research-lab-released-aya-vision-a-best-in-class-open-weights-vision-model/
🔥2
[News] Scrunch AI has raised $4M in a seed round to optimize how businesses appear in AI search
Scrunch AI recently exited beta with $4M in seed funding from Mayfield to help businesses maintain visibility in AI-generated search results. Scrunch AI has seen promising adoption, having already secured 25 enterprise customers, including Lenovo and Crunchbase.
https://dataphoenix.info/scrunch-ai-has-raised-4m-in-a-seed-round-to-optimize-how-businesses-appear-in-ai-search/
Scrunch AI recently exited beta with $4M in seed funding from Mayfield to help businesses maintain visibility in AI-generated search results. Scrunch AI has seen promising adoption, having already secured 25 enterprise customers, including Lenovo and Crunchbase.
https://dataphoenix.info/scrunch-ai-has-raised-4m-in-a-seed-round-to-optimize-how-businesses-appear-in-ai-search/
[News] Google's new 'AI mode' adds more AI to Search and enables users to ask complex questions
Google has launched an AI mode for Search: an experimental product that leverages AI to handle complex queries typically addressed by performing multiple traditional web searches. In parallel, the company also announced it expanded access to its AI Overviews, now powered by Gemini 2.0.
https://dataphoenix.info/googles-new-ai-mode-lets-users-extends-ai-features-in-search/
Google has launched an AI mode for Search: an experimental product that leverages AI to handle complex queries typically addressed by performing multiple traditional web searches. In parallel, the company also announced it expanded access to its AI Overviews, now powered by Gemini 2.0.
https://dataphoenix.info/googles-new-ai-mode-lets-users-extends-ai-features-in-search/