The Changing Face of Work: How AI is Reshaping Skills

Author: Denis Avetisyan

A new analysis maps the evolving demand for human skills in an era of increasingly capable artificial intelligence.

An analysis of thirty-five O\*NET skills reveals a predictable pattern of AI impact, positioning programming within a high-risk displacement quadrant, content creation as readily augmented by AI, and physical technical skills as requiring focused upskilling initiatives to remain relevant-a distribution quantified by a SAFI score correlated with real-world AI exposure <span class="katex-eq" data-katex-display="false">SAFI \times AI_{exposure}</span>. — An analysis of thirty-five O\*NET skills reveals a predictable pattern of AI impact, positioning programming within a high-risk displacement quadrant, content creation as readily augmented by AI, and physical technical skills as requiring focused upskilling initiatives to remain relevant-a distribution quantified by a SAFI score correlated with real-world AI exposure $SAFI \times AI_{exposure}$ .

Research introduces the Skill Automation Feasibility Index (SAFI) to benchmark large language models and assess the potential for skill obsolescence, emergence, and transition.

Despite rapid advances in artificial intelligence, a clear understanding of which skills are most susceptible to automation remains elusive. This challenge is addressed in ‘The AI Skills Shift: Mapping Skill Obsolescence, Emergence, and Transition Pathways in the LLM Era’, which introduces the Skill Automation Feasibility Index (SAFI) to benchmark large language models-including LLaMA, Mistral, and Gemini-across a comprehensive range of 35 skills. Our findings reveal a surprising divergence: while LLMs demonstrate strong capabilities in structured reasoning tasks, they lag significantly in areas requiring nuanced human communication, suggesting a future characterized by AI augmentation rather than wholesale job displacement. Will this pattern of capability-demand inversion reshape workforce development and necessitate a renewed focus on uniquely human skills?

The Automation Paradox: A Tale of Shifting Skills

The accelerating development of artificial intelligence isn’t creating a uniformly automated workforce; instead, it’s fostering a distinctly polarized labor market. While routine, codifiable tasks-data entry, simple assembly, and repetitive analysis-are increasingly vulnerable to automation, skills demanding complex socio-emotional intelligence, creativity, and critical thinking remain largely resistant. This divergence isn’t simply about job displacement; it’s about a widening gap in earning potential. Individuals possessing uniquely human skills are likely to see their value increase, while those focused on easily automated tasks face potential wage stagnation or job loss, potentially exacerbating existing economic inequalities and necessitating a re-evaluation of workforce training and education priorities.

Conventional methods of skill evaluation often prove inadequate when assessing the potential of Large Language Models (LLMs) to perform complex tasks. These assessments typically focus on isolated skills or predefined parameters, failing to capture the emergent abilities of LLMs within realistic, dynamic scenarios. While an LLM might score poorly on a standardized test of ‘critical thinking,’ it can nonetheless generate remarkably coherent and insightful text, demonstrating a functional equivalent of the skill in a practical context. This discrepancy arises because LLMs excel at pattern recognition and contextual adaptation – capabilities not easily quantified by traditional metrics. Consequently, relying on established skill assessments risks both underestimating the automation potential of LLMs and misidentifying which human skills are truly resilient to technological disruption. A more holistic and context-aware approach is therefore crucial for accurately gauging the feasibility of automating various human capabilities.

Current methods for evaluating automation potential often rely on task decomposition, analyzing whether a machine can perform individual steps, but fail to assess if it can reliably integrate those steps into a cohesive, adaptable process – a crucial distinction given the emergent capabilities of artificial intelligence. Researchers are advocating for a shift towards holistic skill assessments, focusing on the contextual understanding, problem-solving, and nuanced judgment that define complex human work. This necessitates developing benchmarks that move beyond isolated task completion and instead measure an AI’s ability to navigate ambiguity, learn from limited data, and generalize knowledge to unforeseen situations – effectively gauging not just what a machine can do, but how well it can do it within a dynamic real-world environment. Such a nuanced approach promises a more accurate prediction of which skills are truly at risk, and which will likely remain the domain of human expertise.

A correlation analysis of 744 occupations reveals that programming skills are highly concentrated in jobs with significant AI exposure (<span class="katex-eq" data-katex-display="false">+0.455</span>), while operation and control skills are predominantly found in occupations with limited AI impact (<span class="katex-eq" data-katex-display="false">-0.456</span>), highlighting a divergence between cognitive and physical skill demands. — A correlation analysis of 744 occupations reveals that programming skills are highly concentrated in jobs with significant AI exposure ( $+0.455$ ), while operation and control skills are predominantly found in occupations with limited AI impact ( $-0.456$ ), highlighting a divergence between cognitive and physical skill demands.

SAFI: Measuring Skills in the Age of Foundation Models

The Skill Assessment via Foundation Models (SAFI) method assesses skills by converting them into text-based representations suitable for analysis by Large Language Models (LLMs). This approach utilizes the LLM’s capacity for semantic understanding to evaluate skill proficiency based on textual descriptions of tasks and knowledge requirements. By framing skills as text, SAFI enables automated, scalable assessment, circumventing the limitations of traditional, manual evaluation methods. The system accepts skill definitions, analyzes them using the LLM, and outputs a skill profile, allowing for comparisons across individuals or models. This text-based approach facilitates high-throughput evaluation and reduces the costs associated with human-based skill assessment.

The SAFI methodology incorporates a standardized skill taxonomy derived from ONET, the Occupational Information Network, to facilitate a comprehensive and comparable assessment of automation potential. ONET provides a nationally recognized, detailed classification of worker attributes and skill requirements across various occupations. By anchoring SAFI’s evaluations to this established framework, the method ensures broad coverage of relevant skills and enables consistent, objective comparisons of skill profiles across different Large Language Models and, ultimately, across various job roles. This standardization mitigates subjectivity and allows for meaningful aggregation and analysis of automation risk and opportunity at a granular level.

Benchmarking of Large Language Models (LLMs) – specifically Mistral Large, Gemini 2.5 Flash, and LLaMA 3.3 70B – was conducted utilizing the SAFI automation metric. Results demonstrated a high degree of consistency in skill profiles generated by each model; the standard deviation across all evaluated skills was only 3.6 points. This indicates that, despite architectural differences, these LLMs exhibit a largely convergent understanding of skills as defined by the SAFI methodology and its underlying O*NET taxonomy, suggesting the potential for reliable and consistent skill assessment across different LLM implementations.

Despite an artifact where longer responses inflate scores on more difficult tasks, Mistral Large consistently outperforms other models across all difficulty levels, as indicated by SAFI scores.

The Unexpected Twist: Undervalued Human Skills

Analysis of Skills Affinity Forecast Index (SAFI) data reveals a counterintuitive trend: skills widely considered essential in roles susceptible to AI disruption – specifically active listening and reading comprehension – currently exhibit relatively low scores of 42.2 and 45.5, respectively. This indicates that, based on current assessments, these skills are not being prioritized or developed at a rate commensurate with their anticipated importance in an AI-driven labor market. The SAFI methodology assesses the current development level of skills relative to their projected future demand, and these comparatively low scores suggest a potential skills gap in areas presumed to be uniquely human strengths.

Analysis of Skills Automation Forecast Index (SAFI) scores reveals that mathematics and programming currently exhibit the highest values, registering at 73.2 and 71.8 respectively. This seemingly counterintuitive result is attributable to the current trajectory of AI development; rather than complete automation of these skills, AI tools are functioning as powerful augmentation technologies. Professionals proficient in mathematics and programming are demonstrating increased productivity and efficiency through the integration of AI-assisted platforms, driving up demand and, consequently, SAFI scores despite the theoretical potential for full automation.

The future labor market is projected to increasingly reward skills that are demonstrably difficult for artificial intelligence to replicate. Current data indicates that while technical skills like mathematics and programming receive high Skills Augmentation Future Index (SAFI) scores due to AI integration, core human skills – specifically active listening and reading comprehension – are undervalued despite their relative immunity to automation. This discrepancy suggests a potential shift in economic value, where uniquely human cognitive and interpersonal abilities will become premium assets as AI handles increasingly complex computational tasks. The implication is that investment in developing and refining these distinctly human capabilities will be crucial for workforce adaptability and long-term economic success.

A negative, though statistically insignificant (r = -0.196, p = 0.26, n = 35), correlation between SAFI scores and real-world AI exposure suggests that skills crucial for jobs vulnerable to AI automation tend to be undervalued by the SAFI metric.

The AI Impact Matrix: Navigating the Shifting Landscape

The AI Impact Matrix offers a novel approach to understanding the evolving landscape of work by synthesizing two crucial datasets: Skill Assessment for Future Intelligence (SAFI) scores and data from the Anthropic Economic Index. This combination allows for a visual mapping of skills, categorizing them based on their vulnerability to automation and the potential for future opportunity. By plotting skills on this matrix, analysts can readily identify areas where human capabilities are likely to be displaced by artificial intelligence, as well as those where skills are in high demand and poised for growth. This isn’t merely a prediction of job losses; the matrix highlights the shift in required competencies, providing actionable insights for workforce development and strategic investment in education and training programs. Ultimately, the AI Impact Matrix serves as a dynamic tool for navigating the complexities of the AI revolution and ensuring a future where technology complements, rather than replaces, human potential.

Current applications of artificial intelligence overwhelmingly prioritize enhancing human abilities rather than outright replacing them, a trend evidenced by data revealing 78.7% of AI interactions are focused on augmentation. This pattern isn’t merely anecdotal; research detailed in ‘Agents of Chaos’ demonstrates a consistent preference for tools that amplify existing skillsets, enabling workers to become more efficient and productive. The implications are significant, suggesting a near-term future where AI functions primarily as a collaborative partner, handling repetitive tasks and providing data-driven insights, while human expertise remains central to complex problem-solving and strategic decision-making. This collaborative dynamic shifts the focus from fears of widespread job displacement to the necessity of reskilling and adaptation, preparing the workforce to effectively leverage these new AI-powered capabilities.

The convergence of high Skill Automation Feasibility Index (SAFI) scores and escalating AI adoption signals a clear trajectory toward automation for certain skill sets. These areas, characterized by tasks easily replicated by artificial intelligence, present both a challenge and an opportunity for workforce adaptation. Conversely, skills demonstrating low SAFI scores, yet maintaining strong demand, necessitate focused strategic investment. Prioritizing education and training in these uniquely human domains – those requiring complex problem-solving, critical thinking, and nuanced interpersonal skills – becomes paramount. This proactive approach ensures a resilient workforce capable of leveraging AI as a tool, rather than being displaced by it, fostering innovation and sustained economic growth in an evolving landscape.

Analysis of 756 occupations using the Anthropic Economic Index reveals that while most exhibit minimal AI exposure, a substantial minority of 67 occupations experience significant impact, exceeding a score of <span class="katex-eq" data-katex-display="false">0.30</span>. — Analysis of 756 occupations using the Anthropic Economic Index reveals that while most exhibit minimal AI exposure, a substantial minority of 67 occupations experience significant impact, exceeding a score of $0.30$ .

The relentless march of ‘progress’ continues, predictably. This paper’s exploration of the Skill Automation Feasibility Index-another attempt to quantify the unquantifiable-merely confirms what seasoned engineers already suspect. LLMs demonstrate proficiency in structured reasoning, but stumble on the messy nuances of human communication. It’s a familiar story: automation excels at the predictable, leaving the genuinely complex for humans to wrestle with. As David Hilbert famously stated, ‘We must be able to answer the question: What are the ultimate foundations of mathematics?’-a sentiment that echoes here. The quest to define ‘skill’ and automate it will always hit the limits of what can be formally defined. The result won’t be wholesale job replacement, but a shifting landscape where humans and AI awkwardly coexist, patching each other’s shortcomings. Everything new is just the old thing with worse docs.

What’s Next?

The Skill Automation Feasibility Index, while a useful cartography of the present, inevitably sketches a map that will soon require revision. Benchmarking against LLMs is a moving target; today’s emergent capability becomes tomorrow’s baseline expectation. The real challenge isn’t identifying what these models can do, but predicting the unexpected ways production will discover their limitations. A high SAFI score doesn’t guarantee automation, only a temporary reprieve from the inevitable edge cases.

The study rightly points to a future of augmentation, but the definition of ‘augmentation’ is suspiciously optimistic. It implies a harmonious partnership. History suggests a more adversarial relationship: humans patching the cracks in automated systems, adding bespoke logic to correct algorithmic drift. The gap in ‘human communication skills’ isn’t a feature, it’s a holding pattern. Someone will build a module to address it, and then someone else will find a way to break it in a surprising context.

Future work should focus less on perfecting the index and more on quantifying the cost of maintaining these systems. Not the financial cost, but the cognitive load on the humans left to manage the fallout. The true metric isn’t automation feasibility, but ‘sustainability of suffering’. Legacy isn’t a bug; it’s a memory of better times. And these systems, inevitably, will become legacy.

Original article: https://arxiv.org/pdf/2604.06906.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

The Automation Paradox: A Tale of Shifting Skills

SAFI: Measuring Skills in the Age of Foundation Models

The Unexpected Twist: Undervalued Human Skills

The AI Impact Matrix: Navigating the Shifting Landscape

What’s Next?

See also: