Logical Thinking Illustration

Agentic AI: The next frontier in artificial intelligence

The future of Agentic AI will depend on balancing innovation with responsibility. By addressing ethical considerations and ...

14 小时on MSN

AI took giant strides in 2024, as AGI comes into view

Artificial intelligence enjoyed a banner year in 2024. The frontier technology captured awards, corralled investors, charmed ...

devdiscourse4 天

AI's strategic facade: How LLMs master the art of deception?

The study on alignment faking offers critical insights into the nuanced behaviour of advanced AI systems. By uncovering how ...

5 天

Buyer beware: OpenAI’s o1 reasoning model is an entirely different beast

Editor’s note: This guest commentary by Anthony Diamond of Seattle-based Pioneer Square Labs originally appeared on PSL’s ...

5 天

Alibaba announces advanced experimental visual reasoning QVQ-72B AI model

Alibaba Cloud, the cloud computing arm of China Alibaba Group Ltd., has unveiled QVQ-72B-Preview, an experimental open-source ...

5 天

Microsoft: First In Line For AGI

Investors gain insight into OpenAI's \o\ series reasoning models and Microsoft's advantage, but face competition from Amazon ...

techxplore6 天

Language AIs in 2024: Size, guardrails and steps toward AI agents

I research the intersection of artificial intelligence, natural language processing and human reasoning as the director of ...

decrypt8 天

OpenAI's o3 Hits Human-Level Scores, But Is It Good Enough to Be AGI?

OpenAI’s new o3 AI model achieved an unprecedented score on the "think like a human" benchmark, sparking a fierce debate over AGI or artificial general intelligence.

11 天

OpenAI teases its most powerful reasoning model named o3

OpenAI just introduced us to its most powerful reasoning model named o3. However, we don't know when the company will release ...

martech11 天

4 key features in Salesforce’s Agentforce 2.0

The new version of Agentforce adds a pre-built skills library, Slack integration, a more powerful reasoning engine and an agent testing center.

marktechpost18 天

Alibaba Qwen Researchers Introduced ProcessBench: A New AI Benchmark for Measuring the ...

Qwen Team and Alibaba Inc. researchers introduce PROCESSBENCH, a robust benchmark designed to measure language models’ capabilities in identifying erroneous steps within mathematical reasoning. This ...

Seeking Alpha19 天

Meta launches AI model Motivo for humanoid agents; mind reasoning program for machine learning

is to decouple reasoning from language representation, and was inspired by how humans can plan high-level thoughts to communicate. As an example, the company said that when giving a presentation ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果