Illinois and New York tighten AI rules as capital concentrates

Policy Highlights5

Illinois passes comprehensive AI safety regulation with support from OpenAI, Anthropic - MSN

Illinois enacted comprehensive AI safety regulation, reported as passing with support from OpenAI and Anthropic. Specific bill number and statutory details are not provided in the record.

POLICY HIGHLIGHTS

N.Y. lawmakers pass bill aimed at protecting minors from AI chatbots - Spectrum News NY1

New York lawmakers passed a bill intended to protect minors from AI chatbots. The record does not provide the bill number or the enacted text details.

POLICY HIGHLIGHTS

Saudi Arabia Publishes National Data Monetization Policy Details - Let's Data Science

Let's Data Science reports that Saudi Arabia published details of a national data monetization policy. The record does not provide the primary legal instrument text or an official publication date.

POLICY HIGHLIGHTS

BoE, FCA and HMT joint statement on AI frontier models and cyber resilience - JD Supra

A joint statement from the Bank of England, the Financial Conduct Authority, and HM Treasury on AI frontier models and cyber resilience. The record appears to be an official policy statement rather than a binding regulation.

POLICY HIGHLIGHTS

Cyberspace Administration of China: Strengthen Online Ecosystem Governance, Regulate and Manage Minors' Internet Use - aastocks.com

A report attributes to the Cyberspace Administration of China (CAC) an action to strengthen online ecosystem governance and regulate/manage minors' internet use. The source provided is a news/RSS item without the underlying official instrument text.

POLICY HIGHLIGHTS

Research5

View in Research →

DeXposure-Claw: An Agentic System for DeFi Risk Supervision

Decentralized finance exposes supervisors to fast-moving, networked credit risks. General-purpose LLM agents fit this setting poorly: they over-read weak evidence and recommend high-stakes interventions, while existing evaluations offer no regulator-aligned way to measure the resulting false alarms. We introduce DeXposure-Claw, a forecast-grounded agentic supervision system that routes LLM decisio

RESEARCH

DynAMO:Dynamic Asset Management Orchestration via Topological Multi-Agent Scheduling

While LLM-powered agents offer end-to-end automation for industrial asset lifecycles, real-world Industry 4.0 deployment is hindered by latency, concurrency instability, and safety risks. We present DynAMO (Dynamic Asset Management Orchestration), a deployment-ready engine using a Plan-then-Execute architecture to generate verifiable workflow graphs. DynAMO supports both SequentialWorkflow (topolo

RESEARCH

Patronus: Identifying and Mitigating Transferable Backdoors in Pre-trained Language Models

The ``Pre-train, then fine-tune'' paradigm has revolutionized Natural Language Processing (NLP). In this context, transferable backdoors pose a severe threat to the Pre-trained Language Models (PLMs) supply chain, yet defensive research remains nascent, primarily relying on detecting anomalies in the output feature space. We identify a critical flaw that fine-tuning on downstream tasks inevitably

RESEARCH

A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

Instruction fine-tuning of large language models (LLMs) often involves selecting a subset of instruction training data from a large candidate pool, using a small query set from the target task. Despite growing interest, the literature on targeted instruction selection remains fragmented and opaque: methods vary widely in selection budgets, often omit zero-shot baselines, and frequently entangle th

RESEARCH

Multimodal Evaluator Preference Collapse: Cross-Modal Contagion in Self-Evolving Agents

When AI agents use language models to evaluate their own outputs in a feedback loop, systematic biases emerge. We show that Evaluator Preference Collapse (EPC) is dramatically amplified in multimodal settings. Using GPT-4o to evaluate DeepSeek-chat across text and visual tasks, we find that a single strategy (step_by_step) absorbs 48.4% of all weight -- 3.2x the collapse observed in text-only self

RESEARCH

Funding3

View in Deal Flow →

Cursor

Capital remains highly concentrated in a small number of outsized transactions, with infrastructure and adjacent platform assets absorbing the largest checks. The pattern points to late-stage conviction and balance-sheet strength rather than broad-based seed activity.

FUNDING

Anysphere

Capital remains highly concentrated in a small number of outsized transactions, with infrastructure and adjacent platform assets absorbing the largest checks. The pattern points to late-stage conviction and balance-sheet strength rather than broad-based seed activity.

FUNDING

SpaceX

Capital remains highly concentrated in a small number of outsized transactions, with infrastructure and adjacent platform assets absorbing the largest checks. The pattern points to late-stage conviction and balance-sheet strength rather than broad-based seed activity.

FUNDING

Talent Moves3

View in Talent Tracker →

head of enterprise AI sales

Named moves continue to matter where they alter commercial reach or technical authority: one senior sales departure and return, plus a chief-scientist promotion, signal internal reshaping at major AI firms. The desk is about leverage, not churn.

TALENT MOVES

Chief Scientist

Named moves continue to matter where they alter commercial reach or technical authority: one senior sales departure and return, plus a chief-scientist promotion, signal internal reshaping at major AI firms. The desk is about leverage, not churn.

TALENT MOVES

head of enterprise AI sales

Named moves continue to matter where they alter commercial reach or technical authority: one senior sales departure and return, plus a chief-scientist promotion, signal internal reshaping at major AI firms. The desk is about leverage, not churn.

TALENT MOVES

Benchmarks3

View in Model Tracker →

Llama 3 (70B) — gsm8k

Benchmark updates are sparse but still useful: the cycle adds fresh scores across core reasoning and knowledge tests, keeping attention on where open-weight systems are landing relative to established baselines. Even a small set of entries can move the comparison frame when they cluster around general-purpose capability.

BENCHMARKS

Llama 3 (70B) — math

Benchmark updates are sparse but still useful: the cycle adds fresh scores across core reasoning and knowledge tests, keeping attention on where open-weight systems are landing relative to established baselines. Even a small set of entries can move the comparison frame when they cluster around general-purpose capability.

BENCHMARKS

Llama 3 (70B) — mmlu

Benchmark updates are sparse but still useful: the cycle adds fresh scores across core reasoning and knowledge tests, keeping attention on where open-weight systems are landing relative to established baselines. Even a small set of entries can move the comparison frame when they cluster around general-purpose capability.

BENCHMARKS