Connect with us

AI

ASEAN AI Cooperation: Five Ways to Compound the Gains

Published

on

In October 2025, ASEAN finance ministers gathered in Kuala Lumpur and announced that negotiations for the bloc’s landmark Digital Economy Framework Agreement had reached “substantial conclusion” — 73% of core provisions agreed after 14 bruising rounds of talks. The remaining 27%? Cross-border data flows, digital identity, financial services. In other words, everything AI actually runs on. That gap between ambition and architecture is the central tension of South-east Asia’s AI moment: a region capable of producing $1 trillion in incremental GDP by 2030 from artificial intelligence, yet currently organized in ways that will guarantee it captures far less. The five moves that could change that are neither secret nor complicated. The question is whether ten governments have the collective will to execute them together.

The Infrastructure Is Outrunning the Institutions

The macro picture is genuinely dazzling. South-east Asia attracted more than $55 billion in AI infrastructure commitments in 2025, as hyperscalers from Microsoft to Google to Amazon bet heavily on the region’s growth trajectory. The bloc’s digital economy, already worth approximately $300 billion in 2025, could double to $2 trillion by 2030 if the ASEAN Digital Economy Framework Agreement — DEFA — is implemented effectively, according to analysis published by the World Economic Forum. Malaysia is importing compute at a pace that would have seemed improbable two years ago: $6.45 billion worth of GPUs in just the first four months of 2025, more than any other country in the region. Johor, the Malaysian state that borders Singapore, is developing 4.5 times its operational data center capacity — the fastest-growing hub in South-east Asia. Across the bloc, AI is projected to contribute between 10% and 18% of regional GDP by 2030, a figure that covers a wide range precisely because the outcome depends entirely on policy choices not yet made.

Yet hardware alone doesn’t compound. The physical layer is racing ahead of the institutional layer — the governance frameworks, talent pipelines, and data-sharing agreements that would allow ten fragmented national markets to function as a single AI economy. Five structural moves, pursued collectively and with some urgency, could change that.

One: Harmonize Regulation Before Fragmentation Calcifies

The ASEAN AI cooperation agenda crystallized most visibly in January 2026, when Digital Ministers gathered in Hanoi and adopted what became the Hanoi Digital Declaration — a commitment to deepen AI cooperation through policy harmonization and enhanced joint safety efforts. The sixth ASEAN Digital Ministers’ Meeting, held on January 15–16, 2026 under the theme “From Connectivity to Connected Intelligence,” formally endorsed the ASEAN AI Safety Network, established in 2025 and headquartered in Kuala Lumpur, as the region’s platform for regulatory preparedness. Malaysian Digital Minister Gobind Singh Deo announced that his country would host the secretariat. The symbolism was pointed: the region’s fastest-growing data center market staking a claim as the governance hub too.

The problem is that ten countries currently operate ten distinct AI regulatory regimes. Vietnam enacted South-east Asia’s first binding AI law — No. 134/2025 — in late 2025. Indonesia is finalizing mandatory requirements. Malaysia is considering dedicated legislation. Thailand has a draft law. The 2024 ASEAN Guide on AI Governance and Ethics offers shared principles — transparency, fairness, accountability — but remains voluntary. In some parts of ASEAN, before the Guide was even published, six of the ten member states had already formulated their own national AI strategies, each with distinct emphases and risk tolerances.

The gap between voluntary principles and binding rules is where foreign investment stalls and regional AI deployment fractures into national silos. DEFA could close that gap — but only if its AI governance and data protection provisions survive the final round of negotiations intact, with signature expected by end-2026. That is not assured.

Two: Build Shared Compute, Not Competing Fiefdoms

Why ASEAN’s AI gains will compound only at regional scale

The second structural move is a coordinated approach to compute infrastructure. Malaysia’s GPU import numbers and Johor’s data center boom are impressive, but they reflect national rather than regional logic — each government competing for the same scarce pool of hyperscaler investment, power supply, and land. Singapore’s 1.4 gigawatts of data center capacity already operates at 1.4% vacancy, the lowest rate in Asia-Pacific. Data center electricity consumption across the bloc is projected to rise from 9.8 terawatt-hours in 2025 to 22 TWh by 2030, and the energy-climate dilemma is acute: ASEAN’s power mix still leans heavily on fossil fuels, and Johor has already rejected nearly 30% of data center applications on energy efficiency grounds.

A regional approach — coordinating renewable energy procurement, computing capacity allocation, and grid upgrades across borders — would be demonstrably more efficient than each government racing independently for scarce power. The Johor-Singapore Special Economic Zone, which includes a planned 1,000-megawatt solar farm to supply clean energy to cross-border data infrastructure, hints at what bilateral energy cooperation could look like at scale. Scaled to an ASEAN-wide compute compact, that model could materially reduce both costs and the bloc’s carbon exposure from AI.

What is ASEAN’s AI strategy for 2030?

ASEAN’s emerging AI strategy centers on five pillars: regulatory harmonization through DEFA and the ASEAN AI Governance Guide; shared compute and energy infrastructure; a regional talent mobility framework; trusted cross-border data corridors; and collective AI deployment on shared public challenges like climate and health. The overarching goal is to position the bloc as the world’s fourth-largest economy by 2030, with AI contributing between 10% and 18% of regional GDP.

Three: Invest in Scientists, Not Just Users

The third move — and arguably the most urgent — is a serious AI talent strategy. Not the short-course upskilling that generates favorable headlines in ministerial statements, but sustained investment in the AI scientists who can build models rather than merely operate them.

The scale of the workforce challenge is significant. More than 164 million workers — over half of ASEAN’s labour force — are expected to face disruptions from generative AI, with automation reducing some roles while augmenting others requiring complex analytical judgment. The skills required for jobs in South-east Asia are expected to change by 72% between 2016 and 2030 — nearly double the rate of change seen in the prior 14 years. Indonesia alone will need 9 million additional ICT professionals by 2030, a target that looks nearly impossible against the region’s current educational infrastructure. In some parts of ASEAN, over 75% of employers report that fresh graduates are not job-ready for digital roles.

Still, the talent challenge has a structural dimension that job-readiness statistics don’t fully capture. Singapore consistently drains engineers and data scientists from neighboring markets, deepening supply gaps in Malaysia and Thailand. Mutual Recognition Arrangements — the formal mechanisms for cross-border professional mobility — currently benefit only around 1.5% of ASEAN’s labour force. If the region doesn’t expand talent mobility and invest in frontier research capacity, it risks producing a generation of skilled users of American and Chinese AI models rather than scientists who develop ASEAN’s own.

That distinction matters enormously for long-run competitiveness. Malaysia trained more than 734,000 individuals through Microsoft’s AI skilling initiative as of October 2025. The numbers are real. Yet building a regional AI economy on another company’s foundation models is not the same as having scientific depth of your own.

Four and Five: Data Corridors and Collective Deployment

The downstream consequences of compounding — or failing to

The fourth move is unlocking cross-border data flows. AI is only as useful as the data training it, and right now, divergent privacy rules, data localization mandates, and inconsistent consent frameworks leave ASEAN’s data fragmented into national pools too shallow for genuinely powerful applications. The ASEAN AI Safety Network has begun developing the concept of “trusted data corridors” — a mechanism discussed at the January 2026 ministerial that would allow data to move across borders under agreed standards, broadly analogous to the EU’s adequacy decisions that enable transatlantic flows. DEFA’s outstanding provisions on personal data protection and cross-border transfers are precisely the ones that have proved hardest to negotiate, precisely because they touch national sovereignty most directly.

The payoff from getting this right is substantial. DEFA’s successful implementation could double ASEAN’s digital economy from $1 trillion to $2 trillion by 2030 — a differential that reflects largely the value of integrated data flows versus fragmented ones.

The fifth move is arguably the most distinctive ASEAN contribution to the global AI agenda: deploying AI collectively on problems that are inherently regional in scope. Climate change doesn’t respect borders. Neither do infectious diseases. Agricultural supply chains, maritime logistics, and disaster early-warning systems all operate at a scale that single-country AI deployments cannot optimize — but that an integrated bloc of 680 million people, pooling data and co-funding models, absolutely could. The ASEAN Responsible AI Roadmap 2025–2030 gestures toward this logic, but the institutional machinery for genuine joint deployment — shared datasets, co-funded foundation models, regional procurement frameworks — remains thin. The COVID-19 pandemic exposed how badly the region needed coordinated health data infrastructure. An ASEAN health AI compact, building on lessons from that period, would be the most concrete near-term demonstration of what cooperative AI deployment actually looks like in practice.

AI is expected to add $1 trillion to South-east Asia’s GDP by 2030, positioning the bloc as the world’s fourth-largest economy — but that figure represents a ceiling, achievable only if structural barriers to regional AI integration are removed. Companies operating across multiple ASEAN markets would benefit from a single compliance framework rather than ten overlapping ones. Small and medium enterprises, which make up the overwhelming majority of ASEAN’s private sector, would gain access to AI capabilities currently available only to multinationals with the resources to navigate regulatory complexity in every jurisdiction.

The Case Against Regional Ambition

Not everyone finds this vision compelling, and the skeptical case deserves a fair hearing.

ASEAN’s institutional culture — built on consensus, non-interference, and the diplomatic shorthand of “the ASEAN Way” — has always struggled to produce binding commitments on questions touching national sovereignty. Data is sovereign. AI models trained on citizens’ data are, in some national readings, instruments of industrial policy and security as much as economic efficiency. Vietnam’s decision to enact its own binding AI law rather than wait for ASEAN consensus reflects a rational calculation: national control, achieved faster, beats regional harmonization at a slower pace and weaker standard.

There are genuine analytical grounds for that position. The 2024 ASEAN AI Governance Guide produced a framework built on multi-stakeholder models drawing from the OECD AI Principles and UNESCO’s Ethics recommendations — sensible as guidance, but deliberately non-binding to preserve national flexibility. Singapore’s AI governance focus on financial services and the city-state’s role as a regulatory laboratory looks very different from Indonesia’s emphasis on agriculture, healthcare, and equity inclusion. A binding regional framework risks being either too lowest-common-denominator to be useful, or too prescriptive to fit ten very different economies at very different stages of digital development.

The energy constraint adds a harder edge to the skepticism. If ASEAN’s data center power consumption rises from 9 TWh today to 68 TWh by 2030 — as research from the ASEAN Centre for Energy projects — the bloc’s AI ambitions could collide directly with its Paris Agreement commitments. Building shared AI infrastructure is only virtuous if it is also clean, and that constraint may prove more binding than any governance framework.

What Compounding Actually Requires

The honest accounting is this: ASEAN has built the hardware layer of an AI economy with impressive speed. The $55 billion in commitments, the GPU imports, the solar farms and submarine cables — all of it represents genuine structural transformation, not merely ministerial ambition. What the region has not yet built is the institutional layer of trust: the harmonized rules, the open data channels, the talent networks, and the habits of joint deployment that would allow those investments to compound into durable, broadly shared economic gains.

The five moves — regulatory harmonization through DEFA, shared compute and clean energy infrastructure, frontier talent investment and mobility, trusted cross-border data flows, and collective deployment on regional public challenges — are not novel proposals. Every significant ASEAN policy document published since 2024 contains at least three of them. The ASEAN Responsible AI Roadmap 2025–2030, the Hanoi Digital Declaration, the ASEAN AI Guide’s expanded Generative AI edition released in January 2025 — all reflect genuine regional consensus on the direction of travel.

What they do not reflect, yet, is consistent execution.

Compounding, in finance and in policy alike, works only if you stay the course. The region has the assets. It now needs the discipline.


Discover more from The Economy

Subscribe to get the latest posts sent to your email.

Continue Reading
Click to comment

Leave a Reply

Analysis

South-east Asia Has Never Produced an Enterprise Software Giant. AI Might Change That.

Published

on

Southeast Asia has minted 64 unicorns. It has built ride-hailing empires, mobile payment networks, and e-commerce platforms that reach hundreds of millions of consumers across one of the most demographically compelling markets on earth. What it has never built — not once, not even close — is an enterprise software company worth the name. No SAP, no Salesforce, no ServiceNow emerged from Singapore or Jakarta or Ho Chi Minh City. The $4 trillion category that generates the most durable recurring revenue in global technology has, for three decades, belonged entirely to companies founded in Walldorf and San Francisco. The arrival of artificial intelligence is the most serious challenge to that arrangement yet.

A Market Built on Someone Else’s Software

The enterprise software market across Southeast Asia generated approximately $4 billion in revenue in 2025, according to Statista — a figure that flatters the region’s actual technological dependence, since the overwhelming majority of that spend flows directly to SAP, Oracle, Salesforce, and Microsoft. Local vendors, where they exist at all, typically occupy narrow verticals: payroll, point-of-sale, inventory management. Not the full-stack, cross-functional platforms that generate the kind of compounding recurring revenue capable of becoming a $50 billion company.

Yet the capital environment is shifting decisively. AI-related investments accounted for 32% of all private funding raised in Southeast Asia in the first half of 2025, with more than 680 AI startups collectively raising over $2.3 billion in the year to June, according to regional ecosystem analysis by Second Talent. That is not merely a financing phenomenon. It is the precondition for a structural realignment — one that, for the first time, gives a Southeast Asian software company a credible route to building at genuine enterprise scale.

The Structural Explanation — and Why It’s Starting to Break Down

Why has Southeast Asia never produced an enterprise software giant?

For most of the past two decades, building enterprise software in Southeast Asia has existed in a state of structural impossibility. The model rests on a simple foundation: win a large domestic market, develop a replicable product, and export it. The United States gave SAP and Oracle a homogenous, English-speaking buyer base of enormous size. Germany gave SAP its first industrial clients. India gave Infosys an outsourcing wedge into the same corporations. Southeast Asia gave its founders ten countries, eight hundred language variants, and ten divergent sets of tax codes, data-localisation rules, and labour law frameworks.

The consequence is identifiable and consistent. Vishal Harnal, managing partner at 500 Global overseeing the firm’s Southeast Asian activities, stated it plainly in 2025: there is “very little B2B software in Southeast Asia, almost none of it,” and virtually every large software exit in 500 Global’s portfolio came from the United States, not the regional one. The domestic corporate buyer class was simply too thin. Southeast Asia’s economy is dominated by family conglomerates — the Jardine Mathesons and Salim Groups of the world — and by SMEs that historically resisted dollar-denominated SaaS contracts and preferred either bespoke implementations or whatever SAP subsidiary had just set up offices in their city. The Southeast Asia ERP market was valued at approximately $1.74 billion in 2024, growing at a 10% annual rate, according to UniVDatos — healthy growth, but spread across an archipelago of fragmented national markets, still dominated by Western incumbents.

What has changed is the cost structure of building software itself. Enterprise software was expensive in 2003 because it required large direct-sales teams, multi-year implementations, and deep relationships with CIOs who controlled multi-million dollar procurement budgets. The generative AI layer has compressed all of that. A conversational interface, built on top of an open-weight model fine-tuned for Bahasa Indonesia or Vietnamese, can replace months of workflow configuration. A Southeast Asian company that previously needed a $500,000 SAP implementation can now automate meaningfully from a local founder charging usage-based fees in local currency. The buyer is no longer a CIO with a multi-year budget cycle. It’s a logistics manager in Surabaya who wants her invoicing done by Thursday.

The software market in Southeast Asia has always had demand. What it lacked was a product architecture that could satisfy that demand at a price point local buyers would accept. AI changes the economics.

The Leapfrog Thesis — and Why This Time Might Actually Differ

How is AI enabling Southeast Asia to leapfrog traditional SaaS models?

Southeast Asia skipped the desktop era almost entirely, going mobile-first in ways that became case studies for markets from sub-Saharan Africa to Latin America. The same structural logic is now being applied to enterprise software. As Insignia Ventures Partners has documented, the region is “leapfrogging SaaS to AI in the same way it leapfrogged the computer to mobile,” and the conditions support the claim. Cloud adoption among Southeast Asian businesses sits at roughly 32%, compared to over 70% in the United States and Australia. That gap is not a handicap. It means the installed base of legacy SaaS contracts — the kind that trap American CFOs in multi-year Salesforce renewals — simply doesn’t exist here. There is no incumbent workflow to migrate away from.

Southeast Asia never locked itself into the SaaS subscription model that now encumbers Western enterprises. With cloud penetration at just 32% versus over 70% in the US, switching costs are close to zero. AI-native tools — priced on usage, built around conversational interfaces, and localised for regional languages — can displace legacy workflows in weeks rather than years.

The language question, long the most intractable barrier to building regional software, is being attacked directly. In May 2025, A*STAR launched an upgraded version of MERaLiON, a multimodal large language model supporting Malay, Vietnamese, Thai, Tamil, Bahasa Indonesia, and Mandarin, capable of handling the code-switching that characterises how Southeast Asians actually communicate — switching mid-sentence between English and Tagalog, or Thai and Mandarin. AI Singapore’s parallel SEA-LION project, funded with a S$70 million government commitment, is building a multilingual AI ecosystem covering 11 regional languages and designed explicitly for cost-sensitive enterprise deployment.

The commercial implication is visible at the company level. Diaflow, a Singapore-based AI-native workflow platform that raised its seed round from Insignia Ventures in February 2026, was built explicitly around the conviction that button-and-click enterprise software had failed the region. Founder Jonathan Viet Pham described the genesis of the company: years of failed enterprise automation projects that “didn’t save them time, didn’t save them money,” because companies were locked in the old mindset of menus and clicks. “Nobody wanted to change their behavior to another software.” Diaflow’s response was to abandon the button-and-click interface entirely and build for fully conversational, automated workflows. It is one of dozens of similar bets being placed across the region now.

Kata.ai, an Indonesian conversational AI company, raised significant funding in 2025 and launched enterprise-grade solutions that reportedly reduced customer service costs by 40% for Indonesian banking clients in 2026. Vietnam International Bank built ViePro, a generative AI financial assistant trained on proprietary banking data, on Amazon Bedrock — delivering real-time responses in Vietnamese across mortgage, credit card, and vehicle loan queries. Neither of these is a software giant yet. Both are proof that the enterprise application layer is buildable locally.

Implications: The Moat, the Hyperscaler Signal, and the Regulatory Paradox

The downstream consequences of this shift extend well beyond individual startups. The hyperscalers are reading the same data. Amazon Web Services recorded 38% year-on-year growth in AI adoption across ASEAN in 2024, with 29% of regional businesses — roughly 21 million companies — now using AI. AWS has committed $9 billion to Singapore through 2028 and $5 billion to Thailand. Microsoft pledged $1.7 billion to Indonesian cloud and AI infrastructure. Salesforce announced a $1 billion investment in Singapore in March 2025, specifically to expand its Agentforce AI platform and co-innovate with local enterprises. These are not speculative positions. They reflect the conclusion that Southeast Asia’s enterprise application layer will be large, and that whoever owns the distribution into it will capture meaningful value.

What’s often missed in this conversation is the regulatory paradox. The data-sovereignty patchwork that has historically terrified foreign vendors — Singapore’s PDPA, Indonesia’s PDP Law, Vietnam’s AI Law enacted December 2025 — is, for a local founder with regional expertise, a competitive moat. A company that builds a compliance engine capable of satisfying Bank Indonesia’s regulatory sandbox, Vietnam’s data-residency requirements, and Thailand’s forthcoming cloud controls has constructed something that a company in Menlo Park cannot cheaply replicate. The complexity is front-loaded and painful; the defensibility compounds over time.

SAP’s announcement of a €150 million R&D hub in Vietnam, made in August 2025, is instructive from the incumbent side: even Western enterprise software giants are now investing in regional engineering capacity, because local language and regulatory nuance has become too important to manage from a global centre. The competition is finally taking the region seriously as a place to build, not just to sell into.

The picture that emerges is not one company about to displace SAP. It’s an ecosystem undergoing a structural reorientation — away from consumer applications and toward the enterprise software layer that generates the most durable recurring revenue in technology.

The Counterargument: Most of This Will Fail

The case against Southeast Asia producing an enterprise software giant is not trivial. It is, in several respects, still the more defensible position.

Research cited by Insignia Ventures puts the global failure rate of generative AI projects at 95% on an ROI basis. Southeast Asia’s version of this failure follows a consistent pattern: a promising proof-of-concept, funded by a government grant or a local corporate pilot, that never scales beyond its first customer. The gap between individual AI tool adoption and genuine enterprise transformation remains wide. While three-quarters of employees in Singapore use AI tools individually, only 15% of SMEs have managed to integrate AI at the enterprise level — a figure cited directly by Singapore’s Minister for Digital Development and Information in early 2026. Interest is not the problem. Institutional change is.

The talent constraint is structural, not cyclical. Machine learning engineers and data scientists remain scarce across the region. Salaries in Vietnam, the Philippines, and Indonesia rose 18–21% in 2025, which sounds encouraging until you note it’s partly the result of hyperscaler expansion competing for the same engineers. Companies best positioned to build durable enterprise software — those requiring deeply technical founders and the ability to retain ML talent — are disproportionately clustered in Singapore, where the cost of that talent approaches US rates.

Fragmented regulation, rather than always creating a moat, can simply create paralysis. A startup attempting to build a genuine cross-border enterprise platform faces ten different data-localisation regimes and procurement processes that explicitly reward the incumbency of SAP and Oracle. The result is that “regional enterprise software” has historically meant “Singapore plus one adjacent market” — not the genuine ten-country scale that would constitute an ASEAN platform. That pattern has resisted every generation of optimistic founders so far.

That said, the honest critique must acknowledge what it cannot explain: why this generation — armed with open-weight models, usage-based pricing, local LLMs, and zero legacy SaaS installed base to compete against — will simply repeat the failures of their predecessors rather than exploit the structural opening those predecessors never had.

Closing

The honest answer to whether Southeast Asia will finally produce an enterprise software giant is: probably not in the shape the question implies. The SAP model — one vendor, one platform, forty years of global dominance — was a product of historical conditions specific to Germany in the 1970s. What the region might produce is something structurally different: a cluster of AI-native companies, built on local language models and embedded regulatory expertise, capable of delivering enterprise-grade automation at a price point and user experience that Western incumbents cannot match. A smaller ambition in one sense. In another, a more interesting one — and more likely to actually materialise.

The leapfrog, when it arrives, will look less like SAP and more like GCash.


Discover more from The Economy

Subscribe to get the latest posts sent to your email.

Continue Reading

Analysis

The Race to the Regulators: Why AI Pre-Deployment Testing Has Arrived

Published

on

For most of the past two years, the dominant assumption in Washington’s corridors was that the Trump administration would keep its hands off frontier AI. The January 2025 revocation of Biden’s executive order on AI risk seemed to cement that posture. So when the U.S. Department of Commerce’s Center for AI Standards and Innovation announced on May 5, 2026 that it had signed formal agreements with Google DeepMind, Microsoft, and Elon Musk’s xAI — granting federal evaluators access to unreleased AI models — the pivot was sharper than most observers had anticipated.

The catalyst was not abstract policy debate. It was a model.

When security researchers at Mozilla pointed Anthropic’s new Mythos system at their code, the experience produced something close to vertigo. Bobby Holley, Firefox’s chief technology officer, said Mythos had elevated AI from a competent software engineer to something resembling a world-class, elite security researcher. That description — and its implications for every unpatched vulnerability in every network connected to the internet — lit a fire under the White House that no deregulatory talking point could easily extinguish. The Washington Post

The new AI pre-deployment testing agreements are Washington’s answer. They are voluntary, technically non-binding, and carefully constructed to avoid the language of mandates. They are also, in their quiet way, a structural reckoning with just how consequential the next generation of AI models may be.

What the CAISI Agreements Actually Do

The Center for AI Standards and Innovation announced agreements with Google DeepMind, Microsoft, and Elon Musk’s xAI that will allow the U.S. government to evaluate artificial intelligence models before they are publicly available. CAISI will conduct pre-deployment evaluations and targeted research. The announcement builds on earlier partnerships struck with OpenAI and Anthropic in 2024, which were the first of their kind. CNBC

The scope is broader than a checkbox exercise. CAISI has completed more than 40 evaluations to date, including assessments involving unreleased AI models. Developers frequently provide models with reduced or removed safeguards to support evaluations focused on national security-related capabilities and risks. The agreements also support testing in classified environments and enable participation from evaluators across government agencies through the TRAINS Taskforce, a group of interagency experts focused on AI-related national security issues. Executive Gov

That last point matters. A model tested with its guardrails intact tells evaluators relatively little about what it’s genuinely capable of doing. By examining systems in their more uninhibited state, CAISI can probe for the kinds of capabilities — automated cyberattack sequencing, biochemical synthesis guidance, manipulation of critical infrastructure — that frontier labs are increasingly warning about in their own internal research.

CAISI’s evaluations focus on demonstrable risks, such as cybersecurity, biosecurity, and chemical weapons. These aren’t theoretical threat categories. They are the precise domains in which advanced reasoning models have begun to demonstrate capabilities that, even in controlled settings, have prompted unusual candour from the labs building them. National Institute of Standards and Technology

Prior to evaluating U.S.-based AI models, CAISI recently examined the Chinese model DeepSeek, concluding it underperformed in several areas including accuracy, security and cost efficiency. That context is not incidental. Part of what’s driving Washington’s urgency is the competitive dimension — the fear that adversaries may be racing toward capabilities that American agencies don’t fully understand, even in their own country’s frontier models. Nextgov.com

CAISI Director Chris Fall has framed the institutional mission with deliberate precision. “Independent, rigorous measurement science is essential to understanding frontier AI and its national security implications,” Fall said. “These expanded industry collaborations help us scale our work in the public interest at a critical moment.” Federal News Network

What Does CAISI’s AI Pre-Deployment Testing Actually Involve?

CAISI conducts pre-release evaluations of frontier AI models by accessing versions with reduced or removed safety filters, testing in classified environments, and deploying an interagency task force — the TRAINS Taskforce — across government agencies. Evaluations focus on cybersecurity, biosecurity, and chemical weapons risks. The center has completed over 40 such assessments to date.

That question has real commercial stakes attached to it. NIST said the partnerships would help the agency and the tech companies exchange information, spur voluntary product improvements, and ensure the government had a clear understanding of what AI models were capable of doing. For the companies involved, this framing is tolerable — even attractive. A pre-release government endorsement, implicit or explicit, is worth something in enterprise procurement conversations. It’s harder to challenge a model that CAISI has already looked at. Cybersecurity Dive

Yet the capacity problem is glaring. CSET Senior Research Analyst Jessica Ji noted that government agencies simply don’t have the same amount of resources as big tech companies — either the manpower, technical staff, or access to compute — to run rigorous evaluations of these models. CAISI is a relatively lean organisation operating against labs that employ thousands of the world’s most skilled AI researchers. The asymmetry between evaluator and evaluated has no obvious near-term solution. CSET

The FDA Analogy — and Why It’s Both Tempting and Dangerous

The policy frame that has seized Washington’s imagination is, perhaps inevitably, the Food and Drug Administration. National Economic Council Director Kevin Hassett told Fox Business that the administration is studying a possible executive order to give a clear roadmap for how future AI models that create vulnerabilities should go through a process so that they’re released into the wild after they’ve been proven safe, just like an FDA drug. Bloomberg

The analogy is rhetorically clean. It is also, on closer inspection, strained in ways that matter for how any eventual mandatory regime would function in practice.

Drug approval is predicated on a relatively bounded hypothesis: does this compound do what it claims, without causing specified harms? The FDA’s clinical trial infrastructure, built over decades, evaluates outcomes in controlled populations against defined endpoints. Frontier AI models behave differently. Their capabilities emerge non-linearly from scale, training data, and interaction patterns that no pre-deployment test suite can exhaustively simulate. A model that passes a red-teaming exercise on Tuesday may discover a novel attack vector in production by Thursday.

CAISI conducts post-deployment evaluations to track risks that emerge after launch, since AI systems often behave differently under real-world conditions — including adversarial inputs and dataset drift — than they do in controlled testing environments. This acknowledgment, buried in the operational details of how CAISI works, quietly concedes what the FDA analogy papers over: there is no clean approval moment. Safety is a continuous process, not a gate. Arnav

Still, the political logic of the FDA frame is sound. It gives the administration a vocabulary for oversight that doesn’t require it to announce a regulatory regime. “Proven safe before release” is a message that plays well. The implementation will be considerably messier.

A bipartisan group of 32 House lawmakers has written to National Cyber Director Sean Cairncross urging immediate action to confront the high volume of cyber vulnerability disclosures cropping up from advanced AI systems. The letter marks an escalation in pressure on the Trump administration to confront the risks posed by frontier AI cyber models. That kind of bipartisan pressure — rare in contemporary Washington — signals that this issue has moved beyond the usual partisan channels. Axios

Second-Order Effects: Markets, Enterprise, and the Voluntary-to-Mandatory Gradient

The agreements announced on May 5 are voluntary. That status, however, may have a shorter shelf life than the companies involved are counting on.

National Economic Council Director Hassett said it’s “really quite likely” that any testing spelled out under an executive order would ultimately extend to all AI companies. “I think Mythos is the first of them, but it’s incumbent on us to build a system,” he said. When a White House economic adviser publicly floats universal applicability, the “voluntary” characterisation begins to function more as a transitional state than a permanent arrangement. Insurance Journal

For enterprise buyers, the near-term implications are more concrete. A CAISI evaluation — particularly one conducted in a classified environment, with results shared selectively across agencies — effectively creates an informal tier of government-vetted AI systems. The companies that have signed these agreements (Google DeepMind, Microsoft, xAI, OpenAI, and Anthropic) are, not coincidentally, the same companies that supply the overwhelming majority of frontier AI infrastructure to federal agencies. A new entrant — a well-capitalised European lab, or a fast-scaling domestic startup — that hasn’t been through the CAISI process faces an implicit disadvantage in federal procurement, regardless of whether any formal mandate exists.

The market signal is already visible. Following the announcement, Microsoft’s stock was down 0.6 percent in midday trading, while Alphabet, Google’s parent company, was trending in the opposite direction — up 1.3 percent. These are small moves, and reading too much into single-session trading is unwise. But the divergence may reflect a market reading of which company has the most to gain from tighter relationships with Washington’s AI oversight apparatus. Al Jazeera

The international dimension compounds the picture. The EU’s AI Act, which came into full force in August 2025, imposes mandatory conformity assessments on high-risk AI systems. The CAISI framework, built on voluntary agreements and classified evaluations, is a fundamentally different architecture — one shaped by American deregulatory instincts even as it begins to converge toward similar outcomes. The question of mutual recognition, or regulatory fragmentation, will land on the desks of trade negotiators before the decade is out.

The Counterargument: Testing Without Teeth?

Not everyone views the CAISI expansion as a meaningful check on frontier AI risk. Critics — some within the AI safety research community, others in civil liberties organisations — have raised a set of concerns that deserve a serious hearing rather than a dismissal.

The first is structural: evaluations conducted under voluntary agreements give the evaluated parties significant influence over what the evaluators can access, how results are framed, and whether findings lead to any material consequence. The new agreements allow CAISI to evaluate new AI models and their potential impact on national security and public safety ahead of their launch, and to conduct research and testing after AI models are deployed. What the agreements do not stipulate, publicly at least, is what happens when CAISI finds something troubling. The absence of a defined enforcement mechanism isn’t a technicality — it’s the central design question. CNN

The second concern is about scope creep in the opposite direction. The agreements build upon OpenAI and Anthropic’s agreements in 2024, which were the first of this kind. Each iteration has expanded the framework’s reach without a parallel expansion of CAISI’s evaluation capacity or legal authority. If the executive order now under consideration mandates testing without addressing the resource gap Jessica Ji identified, the process risks becoming a compliance ritual rather than a genuine safety check — something labs can credential-wash without fundamentally altering their deployment timelines. The Hill

Industry groups have been supportive: Business Software Alliance Senior Vice President Aaron Cooper said that CAISI brings the necessary expertise to work with private sector partners to evaluate frontier models for safety and national security risks, and called it the right institutional home within government. Industry enthusiasm for a regulatory body is not, historically, a reliable indicator of rigorous oversight. It can equally signal confidence that the oversight will remain manageable. Nextgov.com

A Framework in Formation

The agreements signed on May 5 are neither a regulatory revolution nor a fig leaf. They are something more interesting and more ambiguous than either characterisation allows.

Washington has moved from ignoring frontier AI risk to institutionalising a mechanism for examining it — in under eighteen months, and largely under the pressure of a single model’s demonstrated capabilities. That is, by the standards of government technology policy, fast. The CAISI framework exists, it has now absorbed five of the most significant frontier labs, and it has begun to develop the institutional muscle memory that eventually becomes precedent.

What it lacks is clarity on consequences. The voluntary-to-mandatory gradient that Hassett suggested — extending CAISI-style testing to all AI companies — would represent a genuine structural shift. Whether such an order arrives, and whether it comes with enforcement mechanisms or remains aspirational, will determine whether the May 5 announcements are remembered as a turning point or a photo opportunity.

The FDA comparison is imperfect. The analogy is imprecise. But the underlying instinct — that something this powerful, moving this fast, probably shouldn’t enter the world completely unexamined — is harder to argue with every week that passes.

The question now isn’t whether Washington will test frontier AI before it ships. It’s whether the testing, when it finds something, will actually matter.


Discover more from The Economy

Subscribe to get the latest posts sent to your email.

Continue Reading

AI Governance

Is AI Already Putting Graduates Out of Work? The Grim Reality Facing the Class of 2026

Published

on

Consider a sweltering commencement ceremony in Florida this past May. As the sea of black-robed graduates wiped sweat from their brows, a guest speaker—a prominent regional tech executive—stepped to the podium. When he cheerfully urged the Class of 2026 to “embrace the boundless frontier of the AI revolution,” the response was not polite applause. It was a low, rolling wave of boos.

It was a startling breach of academic decorum, yet a profoundly rational economic response. For these twenty-somethings clutching newly minted degrees, artificial intelligence is not an abstract marvel or a stock market catalyst. It is the algorithm that just rescinded their job offers.

If you ask the architects of American economic policy, however, this anxiety is entirely misplaced. On May 11, White House National Economic Council Director Kevin Hassett appeared on CNBC to assuage fears about an automated workforce. “There’s no sign in the data that AI is costing anybody their job right now,” Hassett stated flatly, arguing instead that corporate AI adoption drives rapid revenue and even employment growth.

The Economist recently highlighted this exact sentiment as a symptom of a widening disconnect between macroeconomic theory and microeconomic reality, wryly noting that someone in Washington ought to break the news to America’s Class of 2026. The dissonance is jarring, but it is not inexplicable. When high-level policymakers look for “signs in the data,” they are gazing at aggregate, national statistics. But if you peer beneath the tranquil surface of overall employment, a far more turbulent reality reveals itself. Are we seeing mass layoffs across the entire economy? No. Is AI putting graduates out of work before they even have a chance to begin their careers? Absolutely.

As white-collar automation accelerates at a breakneck pace, the AI impact on class of 2026 job market dynamics serves as a canary in the digital coal mine. We are witnessing a surgical hollowing out of the entry-level tier—a grim reality that forces us to ask not just what jobs will survive, but how a generation will manage to start their professional lives at all.

The Macro Illusion vs. The Micro Reality

To understand why Hassett’s optimism feels like a slap in the face to a twenty-two-year-old, one must understand how corporate restructuring works in the algorithmic age. When companies utilize automation to drive efficiency, they rarely execute spectacular, headline-grabbing mass layoffs of their senior staff. Instead, they rely on a quieter, less visible lever: they simply stop hiring juniors.

Entry-level hiring acts as the economy’s primary shock absorber during periods of structural technological change. The Federal Reserve Bank of New York paints a sobering picture of this phenomenon. In the first quarter of 2026, the unemployment rate for recent college graduates hovered stubbornly at 5.7%—noticeably higher than the national aggregate. Even more troubling is the underemployment rate for this demographic, which currently sits at a staggering 41.5%. Nearly half of all recent degree holders are working in roles that do not require a four-year university education.

This statistical reality undercuts the rosy narrative pushed by algorithmic optimists. The true crisis of graduate unemployment AI exposed fields isn’t found in the termination of existing contracts; it is found in the evaporation of open requisitions. Data from early-career platforms like Handshake and workforce intelligence firm Revelio Labs corroborate this stealth contraction, showing sustained drops in entry-level corporate postings over the past twenty-four months.

When a task can be automated, the job that primarily consisted of that task disappears. Historically, entry-level jobs were defined by routine, repetitive cognitive labor: organizing spreadsheets, writing boilerplate code, drafting foundational marketing copy, and conducting preliminary legal research. Today, large language models and agentic AI handle these tasks for fractions of a penny on the dollar. The entry level jobs disappearing AI phenomenon is not a future projection; it is a present-tense corporate strategy.

Dissecting the Data: The AI-Exposed Graduate Squeeze

The pain, however, is not distributed evenly across the graduating class. We are witnessing a brutal divergence based on a major’s vulnerability to generative models.

Recent labor market analyses indicate a staggering ~6.6 percentage point worse employment drop for graduates entering high-AI exposure fields compared to those in low-AI exposure sectors. A nursing graduate or a civil engineering student—professions requiring complex physical interaction and real-world spatial reasoning—faces an entirely different economic landscape than a marketing or information sciences major.

Nowhere is this dichotomy starker than in the tech sector itself. The computer science grads job prospects AI paradox is the defining irony of the Class of 2026. The very students who dedicated four years to mastering the architecture of the digital world are finding themselves displaced by their own industry’s creations.

Consider the recent restructuring at major tech firms. In early 2026, Cloudflare announced roughly 1,100 job cuts, with executives explicitly pointing to “agentic AI” that now runs thousands of internal operations daily. Coinbase reduced its headcount by 14%, with CEO Brian Armstrong publicly noting, “Over the past year, I’ve watched engineers use AI to ship in days what used to take a team weeks.” When senior engineers become a 10x multiplier of their own productivity thanks to AI copilots, the mathematical necessity of hiring a dozen junior developers to support them vanishes.

The Bifurcation of Skills: Is AI Replacing Entry Level Coding Jobs?

This brings us to the most pressing question whispered in university computer labs across the globe: is AI replacing entry level coding jobs?

The nuanced answer is that AI is not replacing all coding jobs, but it has entirely annihilated the “routine coder.” For decades, the software engineering pipeline operated on an apprenticeship model. Companies hired vast cohorts of junior developers to perform grunt work—QA testing, debugging simple errors, and writing basic, repetitive scripts. This labor was not highly valued for its innovation; it was valued because it served as the training wheels for the next generation of senior architects.

“We used to hire ten juniors right out of college, knowing only two would eventually become elite senior developers,” notes one anonymous hiring manager at a Fortune 500 tech firm. “Today, we hire two, give them enterprise-grade AI tools, and expect senior-level architectural thinking within six months.”

This shift highlights a brutal skills bifurcation. The labor market has violently split into “AI-fluent problem solvers” and “routine task executors.” The National Association of Colleges and Employers (NACE) recently published their Job Outlook 2026 Spring Update, revealing a fascinating contradiction. Overall, employers project a 5.6% increase in hiring for the Class of 2026. Yet, beneath that aggregate number lies a massive qualitative shift: the demand for AI skills in entry-level jobs has nearly tripled since the fall of 2025, now appearing in 13.3% of all entry-level postings.

Employers are not necessarily abandoning the youth; they are demanding that the youth arrive at their desks performing like seasoned veterans, augmented by silicon. If a graduate views their computer science degree as a certificate that qualifies them to write basic Python loops, they will find themselves permanently unemployable. If they view it as a foundational framework to direct, edit, and orchestrate AI systems, they become indispensable.

The Corporate Pipeline Paradox

While companies celebrate the short-term margin expansion granted by this AI-driven efficiency, they are blindly stumbling into a catastrophic long-term trap: the corporate pipeline paradox.

If consulting firms, investment banks, and tech conglomerates structurally eliminate their entry-level cohorts, where exactly will their mid-level managers and senior executives come from in 2036? Expertise is not downloaded; it is forged through the very “grunt work” that AI has now cannibalized. By severing the bottom rung of the career ladder, corporations are burning their own future human capital to heat today’s quarterly earnings reports.

Oxford Economics and the Stanford Digital Economy Lab have both published extensive research on the productivity booms associated with generative AI. According to estimates by Goldman Sachs, generative AI could eventually raise global GDP by 7%. Yet, these macroeconomic models rarely account for the generational friction borne by twenty-two-year-olds.

The international comparison adds another layer of complexity. In the UK and the European Union, stringent labor protections and the slow turning of bureaucratic wheels have somewhat insulated recent graduates from immediate tech-driven displacement. However, this regulatory shield is a double-edged sword. While it protects existing jobs, it also makes European firms highly hesitant to hire new graduates, exacerbating youth unemployment and stifling the continent’s competitive edge in an AI-dominated global market. The American model—ruthless, dynamic, and unapologetically Darwinian—may ultimately adapt faster, but the human cost is currently being paid by the Class of 2026.

Higher Education’s Existential Crisis

As the corporate world reshapes itself overnight, the higher education sector remains glacially slow to react. Universities are charging premium tuitions to teach a 2019 curriculum in a 2026 reality.

When the Bureau of Labor Statistics aggregates long-term occupational outlooks, they base their models on historical trends. But historical trends are useless when the fundamental nature of cognitive labor has been rewritten. Professors who ban the use of generative AI in their classrooms are actively handicapping their students. Teaching a student to code, write, or analyze data without the use of AI is akin to teaching an accountant to balance a ledger without Microsoft Excel. It is an exercise in archaic purity that has no place in the modern workforce.

Universities must pivot from teaching information retrieval and routine execution to teaching critical curation, systems thinking, and AI orchestration. The most valuable skill for a 2026 graduate is not knowing the answer, but knowing how to interrogate an AI agent until it produces the optimal solution, and possessing the domain expertise to verify that solution’s accuracy.

The Way Forward: Navigating the Algorithmic Squeeze

Despite the sobering data, the AI impact on class of 2026 job market is not a story of inescapable doom. It is, rather, a profound evolutionary pressure. The graduates who will thrive in this environment are those who understand that they are no longer competing against machines; they are competing against other graduates using machines.

To survive the great algorithmic squeeze, early-career professionals must lean heavily into the very traits that silicon cannot replicate. The NACE data is explicitly clear on this: when employers review resumes for the Class of 2026, the deciding factors between equally qualified candidates are consistently polished teamwork, high emotional intelligence, cross-disciplinary problem-solving, and elite communication skills.

An AI can write a flawless legal brief, but it cannot read the temperature of a courtroom. An AI can generate a perfect marketing strategy, but it cannot sit across from a hesitant client and build genuine, empathetic trust. The entry-level jobs of the future will not be about executing tasks; they will be about managing relationships, both human and digital.

The booing at that Florida commencement was not just a primal expression of anxiety; it was a demand for a modernized social contract between technology, capital, and labor. Kevin Hassett and Washington’s macroeconomic optimists may see “no sign in the data” today, but they are looking at the lagging indicators of a bygone era. For the Class of 2026, the data is lived experience. Their reality is grim, their climb is steeper, and their margin for error is nonexistent. Yet, if they can master the machine rather than be replaced by it, they will become the architects of an entirely new economy—one where human ingenuity remains the ultimate, irreplaceable premium.


Discover more from The Economy

Subscribe to get the latest posts sent to your email.

Continue Reading

Trending

Copyright © 2025 The Economy, Inc . All rights reserved .

Discover more from The Economy

Subscribe now to keep reading and get access to the full archive.

Continue reading