FILE Empirical Validation Program: Testing AI-Era Leadership

From Human-AI Co-Created Leadership Theory to Testable Scientific Program

Author: Guillaume Mariani
AI tools: ChatGPT, Claude, Copilot, Gemini, Le Chat, and Perplexity
Date: May 2026
Arc 5: The FILE School of Thought
Version: Second strengthened version, integrating the peer reviews of Version 1 by the six AI collaborators

Abstract

The Five Intelligences of Leadership Evolution (FILE) proposes that leadership in AI-mediated environments requires the integration of Augmented, Emotional, Cultural, Political, and Adaptive Intelligence. This paper develops a FILE Empirical Validation research agenda without claiming that FILE has already been empirically validated. It presents FILE as a testable research program by defining its propositions, non-claims, falsifiability conditions, boundary conditions, latent constructs, operational variables, hypotheses, measurement strategy, pilot-study architecture, ethical safeguards, and educational implications.

Its central methodological contribution is a Roadmap-to-Falsifiability Matrix linking each research phase to conditions under which FILE could be supported, revised, narrowed, merged, or rejected. The paper also outlines how FILE may inform future Management, Leadership, and Technology degrees while insisting that such applications remain provisional until tested.

This second strengthened version integrates the intermediate peer-review recommendations of six AI collaborators. It adds a sharper contribution statement, clearer construct-boundary safeguards, preregistration and open-science discipline, primary and secondary outcome distinctions, pilot and full-scale sample-size guidance, organizational recruitment protocols, qualitative integration logic, technological boundary conditions, cross-national ethics safeguards, and a more focused doctoral dissertation pathway. The purpose is not to make FILE appear complete, but to make it more testable, more falsifiable, more ethically grounded, and more suitable as a foundation for future doctoral research.

FILE is strongest not when it claims to be complete, but when it invites the forms of evidence that could support, revise, narrow, or reject it.

Keywords: FILE Empirical Validation; FILE; Five Intelligences of Leadership Evolution; Augmented Intelligence; Emotional Intelligence; Cultural Intelligence; Political Intelligence; Adaptive Intelligence; AI leadership; leadership theory; human-AI collaboration; AI governance; empirical validation; falsifiability; leadership research agenda; latent constructs; Management Leadership and Technology; MLT degrees; socio-technical systems; Relational Commons; Ecosystemic Empowerment

1. Introduction — Why FILE Empirical Validation Now Needs an Empirical Research Agenda

FILE Empirical Validation is the next essential step for turning the Five Intelligences of Leadership Evolution from a conceptual framework into a testable research program. Leadership theory in the age of artificial intelligence faces a structural challenge that distinguishes it from prior generations of leadership scholarship. Earlier frameworks — including transformational leadership, adaptive leadership, authentic leadership, distributed leadership, and servant leadership — were developed primarily for organizational environments in which the central coordination problem was human-to-human.

The emergence of AI-mediated organizational environments changes these conditions. AI does not merely add a new managerial tool to existing organizations. It alters how decisions are generated, how judgment is distributed, how authority is exercised, how responsibility is assigned, and how human agency may be preserved or weakened within increasingly automated workflows.

The FILE framework — The Five Intelligences of Leadership Evolution — was developed in response to this challenge. It proposes that leadership in the age of AI requires the integration of five interdependent intelligences: Augmented Intelligence, Emotional Intelligence, Cultural Intelligence, Political Intelligence, and Adaptive Intelligence.

FILE’s proposed contribution is to explain how multiple human intelligences interact with AI-mediated systems to preserve human judgment, legitimacy, agency, relational responsibility, and adaptive capacity in leadership contexts.

This second strengthened version clarifies that contribution by moving from conceptual articulation to methodological accountability. It does not add new theoretical ambition to FILE; rather, it clarifies how FILE can be measured, bounded, challenged, revised, and translated into a feasible doctoral research program.

However, FILE has now reached a point where conceptual elaboration alone is insufficient. A framework that proposes to inform leadership theory, human-AI governance, future Management, Leadership, and Technology degrees, and public understanding of AI-era leadership must become empirically testable. In this sense, FILE Empirical Validation requires not only conceptual clarity, but also measurable constructs, comparison models, power-aware research design, open-science discipline, and possible falsification.

This paper therefore introduces the FILE empirical research agenda. It is not an empirical report. It does not present findings showing that FILE has been validated. Rather, it proposes the methodological architecture through which FILE could become a testable scientific program.

To ensure accountability, the paper includes a Roadmap-to-Falsifiability Matrix in Section 24. This matrix explicitly ties each research phase to possible disconfirming evidence, so that FILE can be tested, revised, narrowed, merged, or rejected where evidence requires it. The purpose of FILE Empirical Validation is not to defend the framework, but to expose it to disciplined empirical testing.

A deeper critique of FILE’s theoretical weaknesses, conceptual risks, and failure modes will be developed in Paper 2, “The Weaknesses and Limits of FILE.” A fuller epistemological analysis of human-AI co-created knowledge will be developed in Paper 4, “The Epistemology of Augmented Knowledge.” The present paper has a narrower task: to specify how FILE can be tested.

In short, FILE proposes that leadership effectiveness in AI-mediated environments depends on the coordinated exercise of five human intelligences that preserve human judgment, legitimacy, relational responsibility, agency, and adaptive capacity in socio-technical systems.

2. Core Theoretical Framework — What FILE Empirical Validation Tests

FILE advances six core theoretical propositions. These propositions should be read as conceptual claims requiring empirical translation, not as established findings. Each proposition contributes to FILE Empirical Validation by defining what must be operationalized, measured, compared, and potentially falsified.

Proposition 1 — The Multi-Intelligence Claim

FILE proposes that effective leadership in AI-mediated organizational environments requires the integration of multiple intelligences operating interdependently rather than in isolation.

Unit of analysis: individual leader, leadership dyad, executive team.

Empirical translation: test whether leaders with more balanced multi-intelligence profiles demonstrate better leadership outcomes than leaders with high scores on only one or two dimensions.

Proposition 2 — The Augmented Intelligence Claim

FILE proposes that Augmented Intelligence is a distinctive leadership capacity: the ability to work productively with AI systems without surrendering human judgment, responsibility, or critical evaluation.

Unit of analysis: individual leader, human-AI workflow, organizational decision process.

Empirical translation: test whether leaders with higher Augmented Intelligence show better decision quality, lower automation bias, stronger human oversight, and better calibrated AI use in AI-mediated tasks.

Proposition 3 — The Persistence of Human Intelligences Claim

FILE proposes that Emotional Intelligence, Cultural Intelligence, Political Intelligence, and Adaptive Intelligence remain essential even as AI systems become more capable of generating emotionally appropriate, culturally adapted, politically strategic, or adaptively responsive outputs.

Unit of analysis: individual leader, stakeholder relationship, team climate, organizational governance process.

Empirical translation: test whether human scores on these intelligences predict leadership outcomes beyond AI-generated task performance, AI literacy, or general digital leadership capability.

This proposition must not be interpreted as a human-vs-AI benchmark. The purpose is not to compare humans and machines as general intelligences. The narrower purpose is to examine whether human leadership capacities retain explanatory value in AI-mediated organizational settings when AI-generated task outputs are present.

In empirical studies, AI-generated outputs should be treated as task artifacts or decision-support inputs, not as evidence of machine leadership. Researchers may compare how leaders interpret, verify, contest, contextualize, or responsibly use AI-generated outputs, but they should not infer from output quality alone that AI systems possess leadership intelligence. This preserves the distinction between task performance and accountable leadership capacity.

Proposition 4 — The Interdependence Claim

FILE proposes that the five intelligences are not merely additive traits. They function as an interdependent profile.

Unit of analysis: individual leader profile, leadership team configuration, organizational FILE maturity.

Empirical translation: test whether latent profile analysis, cluster analysis, configurational models, or interaction models predict outcomes better than simple additive models.

Proposition 5 — The Maturity and Development Claim

FILE proposes that the five intelligences are developable capacities rather than fixed traits.

Unit of analysis: individual leader development trajectory, leadership program cohort, educational curriculum, organizational maturity trajectory.

Empirical translation: test whether FILE-based interventions produce measurable improvements in FILE-related capacities and associated leadership outcomes.

Proposition 6 — The Organizational and Educational Scope Claim

FILE proposes that its framework applies not only to individual leaders but also to teams, organizations, institutional systems, and educational programs.

Unit of analysis: team, organization, educational institution, governance system.

Empirical translation: test whether FILE maturity indicators at team and organizational levels predict outcomes beyond sector, size, AI maturity, and existing leadership models.

These propositions are related but independently testable. Evidence against one proposition would not necessarily invalidate the entire framework. FILE should be treated as a research program whose components can be confirmed, revised, limited, merged, or rejected through evidence.

If only some FILE dimensions survive empirical testing, FILE should be revised accordingly rather than defended as a fixed five-intelligence system. The framework’s credibility depends on its willingness to adapt to evidence.

3. What FILE Empirical Validation Does Not Claim

FILE does not claim to explain all leadership outcomes.

FILE does not claim to replace existing leadership theories.

FILE does not claim that AI automatically improves leadership.

FILE does not claim that all leaders require the same intelligence profile.

FILE does not claim to have already been empirically validated.

FILE does not claim universal applicability across all cultures.

FILE does not claim that its five intelligences are exhaustive; other leadership capacities may prove equally important in AI-mediated environments.

FILE does not claim that human-AI co-creation automatically produces reliable knowledge.

FILE does not claim that relational responsibility, organizational legitimacy, or adaptive capacity can be reduced to survey scores alone.

FILE does not claim that the existence of an assessment instrument would, by itself, validate the theory.

FILE does not claim that AI-generated task performance is equivalent to leadership intelligence.

These boundaries are not rhetorical modesty. They define the conditions under which FILE Empirical Validation can be responsibly studied.

4. FILE Empirical Validation, Falsifiability, Boundary Conditions, and Risks of Overclaiming

A framework that cannot be falsified cannot function as a serious scientific program. FILE must therefore specify what kinds of empirical findings would count against its claims.

The Multi-Intelligence Claim would require revision if single-factor models — such as emotional intelligence alone, general cognitive ability, AI literacy alone, transformational leadership alone, or digital leadership alone — consistently explain leadership effectiveness in AI-mediated contexts as well as or better than a multi-intelligence model.

The Augmented Intelligence Claim would require revision if leaders who defer directly to AI recommendations without independent judgment consistently produce equal or better outcomes than leaders who first generate independent assessments and then use AI critically.

The Persistence of Human Intelligences Claim would be challenged if Emotional, Cultural, Political, and Adaptive Intelligence scores show no incremental relationship with leadership outcomes once AI literacy, AI task performance, digital leadership, or AI-generated outputs are controlled.

The Interdependence Claim would be weakened if additive models predict outcomes as well as or better than profile-based, interaction-based, or configurational models.

The Maturity and Development Claim would be weakened if FILE-based educational or leadership interventions produce no measurable gains compared with control or comparison groups.

The Organizational and Educational Scope Claim would require limitation if FILE indicators predict individual-level outcomes but fail to aggregate meaningfully at team, organizational, institutional, or curriculum levels.

FILE’s propositions may vary across cultural context, sector, organizational AI maturity, leadership level, crisis intensity, institutional environment, and degree of human discretion. For example, FILE may be most relevant in contexts where human judgment, stakeholder legitimacy, and socio-technical coordination remain central. It may be less relevant in highly routinized micro-task environments, low-discretion roles, low-automation sectors, organizations with minimal formal AI governance structures, or early-stage contexts where AI is not yet integrated into consequential leadership processes.

FILE is most applicable in contexts where leaders retain discretion, where human judgment interacts with AI systems, and where legitimacy, trust, relational responsibility, and socio-technical coordination matter. FILE is less applicable in highly routinized micro-task environments, low-discretion roles, early-stage organizations without AI governance structures, or sectors with minimal human-AI interaction.

In practical terms, Propositions 1–3 are likely to be most testable in contexts where leaders exercise non-trivial discretion over AI deployment, stakeholder relationships, and human-AI interpretation. Propositions 5–6 may be least applicable in highly routinized micro-task environments, organizations with minimal formal governance structures, or contexts where AI use is too marginal to affect leadership practice.

These boundary conditions should be examined through the cross-cultural testing strategy, the MLT cultural-adaptation logic, the deterministic/probabilistic AI-environment taxonomy, and the Roadmap-to-Falsifiability Matrix.

Three risks require special attention: confirmation bias in construct development, construct bundling, and premature generalization. Paper 2 will examine FILE’s weaknesses, limits, and possible failure modes in greater depth. In this paper, the purpose is only to define the conditions under which FILE Empirical Validation can be responsibly tested.

5. Technological Boundary Conditions — Deterministic Automation and Probabilistic Multi-Agent Ecosystems

AI-mediated environments are not all the same. A leadership framework designed for the age of AI must distinguish between different types of socio-technical systems.

A critical systemic boundary condition should be drawn between Deterministic Automation Architectures and Probabilistic Multi-Agent Ecosystems.

Deterministic Automation Architectures refer to rigid, rule-based, linear systems in which automation follows predefined pathways. Examples may include legacy workflow routing, fixed enterprise-resource-planning rules, automated compliance checklists, or predictable task-execution systems. In such environments, traditional digital competency, operational excellence, or task-execution models may retain substantial predictive utility.

Probabilistic Multi-Agent Ecosystems refer to generative, semi-autonomous, adaptive, and networked AI environments in which outputs are probabilistic, interactions are dynamic, and consequences may be emergent. Examples may include generative AI copilots, agentic workflow systems, human-AI decision loops, AI-assisted strategy processes, or multi-agent systems that influence professional judgment, stakeholder communication, governance, and organizational learning.

FILE’s constructs — especially Augmented Intelligence and Political Intelligence — are likely to be most expressive in probabilistic, generative, semi-autonomous, and multi-agent environments where outcomes are uncertain, non-linear, and socially consequential. In highly rigid, deterministic legacy environments where AI functions primarily as a linear automation engine, the expression of FILE dimensions may be constrained, and narrower digital-competency or task-execution models may explain outcomes sufficiently.

This taxonomy does not make FILE irrelevant to deterministic environments. It clarifies that FILE’s distinctive value is most likely to emerge where human judgment must interpret, govern, contest, and contextualize AI-mediated outputs under conditions of uncertainty, stakeholder consequence, and distributed accountability.

6. FILE Empirical Validation Through Latent Constructs — Operationalizing the Five Intelligences

For empirical purposes, the five intelligences should be treated as latent constructs, not directly observable traits. A latent construct is inferred from multiple indicators because the underlying capacity cannot be observed in a single behavior, self-report item, or performance score. For FILE Empirical Validation to be credible, each intelligence must be distinguishable from existing measures while remaining theoretically connected to them.

6.1 Augmented Intelligence and FILE Empirical Validation

Augmented Intelligence may be defined as the latent capacity to integrate AI systems into judgment, decision-making, coordination, and workflow design without surrendering human responsibility.

Its closest existing constructs include AI literacy, digital literacy, digital leadership, technology acceptance, and human-AI teaming. FILE differs from these constructs because it does not focus only on knowledge of AI tools, acceptance of technology, or digital fluency. It focuses on human-AI judgment integration under explicit responsibility: the ability to know when to use AI, when not to use it, how to verify outputs, how to preserve human accountability, and how to design workflows in which machine assistance does not become uncritical dependence.

Incremental-validity hypothesis: FILE Augmented Intelligence will explain additional variance in calibrated AI use, automation-bias resistance, and AI governance quality beyond AI literacy, digital leadership, and general cognitive ability.

6.2 Emotional Intelligence and FILE Empirical Validation

Emotional Intelligence may be operationalized as the capacity to recognize, regulate, and respond appropriately to affective and relational dynamics in leadership contexts.

Its closest existing constructs include ability-based emotional intelligence, mixed-model emotional intelligence, empathy, emotional regulation, and relational leadership. FILE differs by situating emotional intelligence specifically in AI-mediated contexts, where leaders may rely on algorithmic systems while remaining accountable for the human consequences of those decisions. The distinctive issue is not whether a leader can recognize emotion in general, but whether the leader can preserve relational responsibility when decisions, communications, or evaluations are partially mediated by AI.

Incremental-validity hypothesis: FILE Emotional Intelligence will explain additional variance in trust, psychological safety, and employee dignity in AI-mediated workplaces beyond standard emotional intelligence measures.

6.3 Cultural Intelligence and FILE Empirical Validation

Cultural Intelligence may be defined as the capacity to interpret and act effectively across cultural, professional, disciplinary, institutional, linguistic, and symbolic differences.

Its closest existing constructs include cross-cultural intelligence, intercultural competence, global leadership, and diversity competence. FILE differs by extending cultural intelligence beyond national or ethnic culture to include techno-social translation across professional communities, algorithmic systems, institutional norms, and symbolic contexts. A leader may need to translate between engineers, regulators, employees, educators, and citizens as much as between national cultures.

Incremental-validity hypothesis: FILE Cultural Intelligence will explain additional variance in cross-functional coordination, culturally legitimate AI deployment, and stakeholder trust beyond conventional cultural intelligence scales.

6.4 Political Intelligence and FILE Empirical Validation

Political Intelligence may be defined as the capacity to navigate power, legitimacy, stakeholder conflict, coalition formation, institutional constraints, and governance trade-offs.

Its closest existing constructs include political skill, social astuteness, influence tactics, stakeholder management, and institutional awareness. FILE differs by emphasizing legitimacy and governance in AI-mediated systems, not only interpersonal influence. Political Intelligence in FILE concerns how leaders align power with responsibility, build coalitions around contested technologies, anticipate institutional consequences, and prevent AI transformation from becoming a technocratic imposition.

Incremental-validity hypothesis: FILE Political Intelligence will explain additional variance in stakeholder alignment, AI governance legitimacy, and implementation feasibility beyond political skill and transformational leadership.

6.5 Adaptive Intelligence and FILE Empirical Validation

Adaptive Intelligence may be defined as the capacity to learn, unlearn, revise assumptions, and remain effective under uncertainty, disruption, and incomplete information.

Its closest existing constructs include resilience, learning agility, adaptive performance, cognitive flexibility, ambidextrous leadership, and dynamic capabilities. FILE differs by emphasizing adaptive judgment under socio-technical disruption. It is not merely the ability to cope with stress or change behavior. It is the ability to revise leadership assumptions when AI changes workflows, authority structures, labor relations, and governance risks.

Incremental-validity hypothesis: FILE Adaptive Intelligence will explain additional variance in crisis decision quality, organizational learning, and responsible adaptation to AI disruption beyond resilience, learning agility, and adaptive performance.

7. Measurement Instruments for FILE Empirical Validation

FILE should not begin measurement from nothing. Existing research offers relevant starting points. A responsible approach to FILE Empirical Validation must therefore compare FILE against existing leadership theories and adjacent constructs.

For Emotional Intelligence, ability-based and self-report instruments may inform items related to emotional perception, regulation, empathy, trust-building, and relational functioning. However, FILE-specific items must be added to capture AI-mediated relational responsibility.

For Cultural Intelligence, existing CQ instruments provide useful foundations around metacognitive, cognitive, motivational, and behavioral dimensions. However, FILE-specific items must be added to capture techno-social translation across professional, institutional, and algorithmic contexts.

For Political Intelligence, political skill measures and related instruments can inform the measurement of social astuteness, influence, networking, and sincerity. However, FILE-specific Political Intelligence requires new items focused on AI governance, stakeholder legitimacy, institutional consequence, and contestability.

For Adaptive Intelligence, adaptive performance, learning agility, resilience, and crisis response instruments may provide partial foundations. However, new items are required to capture algorithmic stress, socio-technical disruption, and revision of assumptions under AI-mediated uncertainty.

For Augmented Intelligence, measurement is less mature. Possible foundations include AI literacy, digital literacy, human-AI teaming, automation trust, automation bias, decision support, and calibrated oversight. However, the core of the construct requires primarily original item generation because no mature scale fully captures human-AI judgment integration under explicit responsibility.

At the outcome level, FILE should engage with instruments measuring psychological safety, leadership effectiveness, learning organization, dynamic capabilities, trust, ethical climate, and organizational resilience.

The central measurement challenge is to show that FILE is not simply a relabeling of existing constructs. This requires convergent validity, discriminant validity, and incremental validity testing.

As provisional guidance, convergent validity would be expected when FILE-specific constructs correlate moderately with theoretically adjacent measures, for example in the approximate range of .30 to .60. Correlations that are too low may indicate conceptual disconnection; correlations that are too high may indicate redundancy. Discriminant validity would be threatened if a FILE construct correlates so highly with an existing construct that it cannot be empirically distinguished from it.

For example, if FILE Emotional Intelligence correlates above approximately .80 with a validated emotional intelligence scale across multiple samples and models, this would trigger a recommendation to merge, relabel, narrow, or drop the FILE-specific construct rather than defend it as distinct.

These thresholds should be treated as methodological heuristics, not rigid rules. They may require adjustment for newly developed constructs such as Augmented Intelligence, where no established convergent-validity benchmarks yet exist. Final interpretation should depend on measurement quality, sample characteristics, model fit, theoretical coherence, and evidence accumulated across contexts.

8. Construct-Boundary Table for FILE Empirical Validation

A central risk for FILE Empirical Validation is construct proliferation: giving new names to capacities already captured by existing constructs. The following table clarifies what each intelligence is, what it is not, and what adjacent constructs it must be distinguished from.

This table defines the conceptual boundaries that future empirical studies must test; it does not assume that all five constructs will remain distinct after factor analysis.

These boundaries are intended to operationalize guardrails against jangle fallacies and construct bundling, so that FILE constructs are abandoned, merged, relabeled, or narrowed if empirical overlap with adjacent measures proves excessive.

FILE construct	What it is	What it is not	Adjacent constructs to distinguish from
Augmented Intelligence	The capacity to integrate AI into judgment, decision-making, workflow design, and accountability without surrendering human responsibility	Mere AI literacy, digital fluency, prompt engineering, or enthusiasm for technology	AI literacy, digital leadership, technology acceptance, human-AI teaming, automation trust
Emotional Intelligence	The capacity to preserve trust, empathy, emotional regulation, dignity, and relational responsibility in AI-mediated leadership contexts	Generic empathy, charisma, emotional display, or interpersonal warmth alone	Emotional intelligence, empathy, relational leadership, psychological safety
Cultural Intelligence	The capacity to translate across national, professional, institutional, disciplinary, linguistic, and symbolic contexts in socio-technical systems	Diversity vocabulary alone, global exposure alone, or cultural politeness	Cultural intelligence, intercultural competence, global leadership, diversity competence
Political Intelligence	The capacity to navigate power, legitimacy, stakeholder conflict, coalitions, governance, and institutional consequences in AI transformation	Manipulation, office politics, influence tactics alone, or personal ambition	Political skill, stakeholder management, institutional awareness, negotiation competence
Adaptive Intelligence	The capacity to revise assumptions, learn under uncertainty, recover from disruption, and exercise judgment in changing socio-technical environments	Stress tolerance alone, resilience alone, flexibility alone, or change acceptance	Resilience, learning agility, adaptive performance, cognitive flexibility, dynamic capabilities

This table does not prove construct distinctiveness. It defines what future empirical studies must test. If factor analysis or incremental-validity testing shows that a FILE construct collapses into an existing construct, that dimension should be revised, narrowed, merged, relabeled, or removed.

9. Operational Variables Table for FILE Empirical Validation

The operational variables table translates FILE Empirical Validation into measurable indicators, tools, levels of analysis, and expected outcomes.

FILE Construct	Latent Dimension	Possible Indicators	Measurement Tool	Level of Analysis	Expected Outcome
Augmented Intelligence	AI literacy and calibrated oversight	Output evaluation, automation-bias detection, tool-task fit, verification routines	De novo FILE items; AI literacy adaptation; scenario tasks; workflow audit; 360 ratings	Individual, team, organization	Better human-AI collaboration; reduced misuse of AI outputs
Augmented Intelligence	Human-AI workflow design	Role clarity, escalation protocols, fallback procedures, human-in-the-loop routines	Process audit; observation; interviews	Team, organization	Stronger AI governance and operational reliability
Emotional Intelligence	Relational attunement	Empathy, regulation, trust-building, conflict de-escalation	EI scale adaptation; 360 feedback; climate surveys	Individual, team	Higher trust and psychological safety
Emotional Intelligence	Relational responsibility	Accountability for the human effects of AI-mediated decisions	FILE-specific vignette tasks; subordinate ratings; interviews	Individual, team	Lower alienation; stronger employee trust
Cultural Intelligence	Cross-context interpretation	Translation across cultures, disciplines, professions, institutions	CQ scale adaptation; case interpretation; multisource ratings	Individual, team, organization	Better coordination across silos and cultures
Cultural Intelligence	Symbolic sensitivity	Recognition of cultural misfit, normative blind spots, symbolic harms	FILE-specific scenario tasks; qualitative coding	Individual, organization	More legitimate and context-sensitive AI deployment
Political Intelligence	Stakeholder and power diagnosis	Stakeholder mapping, coalition awareness, legitimacy analysis	De novo FILE items; political skill inspiration; governance audit; simulations	Individual, organization, institution	Better stakeholder alignment and implementation feasibility
Political Intelligence	Responsible influence	Negotiation, coalition-building, ethical use of power	360 ratings; network analysis; simulations	Team, organization	Stronger governance and institutional legitimacy
Adaptive Intelligence	Learning under uncertainty	Assumption revision, learning from error, resilience, recovery	Adaptive performance adaptation; longitudinal tracking	Individual, team, organization	Better crisis navigation and organizational learning
Adaptive Intelligence	Judgment under ambiguity	Decision quality under incomplete information; disciplined adjustment	Scenario tasks; supervisor ratings; after-action reviews	Individual, team	Stronger strategic judgment

Relational Commons and Ecosystemic Empowerment are not sixth and seventh intelligences. They are cross-cutting emergent constructs or outcome conditions.

10. Cross-Cutting Constructs in FILE Empirical Validation — Relational Commons and Ecosystemic Empowerment

The Relational Commons refers to the shared field of trust, dignity, psychological safety, meaning, and relational quality within teams, organizations, and institutions. It is not located in one individual. It emerges through repeated interaction, governance patterns, communication norms, leadership behavior, and socio-technical design.

Ecosystemic Empowerment refers to the extent to which a socio-technical system expands human agency, participation, contestability, and meaningful influence across organizational and institutional levels. It concerns whether AI deployment concentrates power or distributes capability.

FILE Construct	Latent Dimension	Possible Indicators	Measurement Tool	Level of Analysis	Expected Outcome
Relational Commons	Shared trust, dignity, meaning, psychological safety	Voice, dignity, psychological safety, trust, meaning	Climate surveys; focus groups; interviews; ethnography	Team, organization	Healthier collaboration and reduced automation anxiety
Ecosystemic Empowerment	Distributed agency and participation	Voice, autonomy, contestability, governance participation	Employee surveys; governance metrics; stakeholder interviews	Organization, institution, ecosystem	More legitimate adoption and stronger human agency

These constructs may function as outcomes, mediators, or contextual conditions in FILE research. They allow FILE to move beyond individual leader traits and examine the broader social conditions produced by AI-era leadership.

11. Levels of Socio-Technical Analysis and Aggregation Logic

Cross-level analysis is essential because FILE Empirical Validation cannot rely only on individual leader self-reports. FILE should therefore be studied across multiple socio-technical levels:

Individual level: leader capacities, biases, judgment, intelligence profiles, and developmental trajectories.
Team level: human-AI collaboration, psychological safety, shared mental models, team learning, and workflow coordination.
Organizational level: AI governance, digital transformation maturity, leadership systems, resource allocation, and organizational resilience.
Interorganizational ecosystem level: supply chains, platform relationships, alliances, data-sharing arrangements, and cross-boundary trust.
Institutional and governance level: regulation, compliance, standards, public legitimacy, and AI governance regimes.
Educational and curriculum level: MLT degrees, executive education, learning outcomes, and competency development.
Societal level: labor-market effects, public trust, human dignity, social stratification, and civic consequences of AI-mediated systems.

Team-level and organizational-level FILE maturity should be treated as partly compositional and partly emergent.

Some elements may be compositional. For example, average individual Augmented Intelligence or Adaptive Intelligence scores within a team may be aggregated if there is sufficient within-group agreement.

Other elements are emergent. Relational Commons and Ecosystemic Empowerment cannot be reduced to the sum of individual scores. They arise from interaction patterns, governance systems, shared norms, and socio-technical design.

Aggregation should therefore be empirically justified rather than assumed. Researchers should use standard multilevel indices such as rwg(j) for within-group agreement, ICC(1) for the proportion of variance attributable to group membership, and ICC(2) for reliability of group means.

Where aggregation is not justified, researchers should use multilevel modeling, multilevel structural equation modeling, random-coefficient models, or latent-aggregation approaches rather than simple averaging.

This prevents the atomistic fallacy of drawing organizational conclusions from individual data and the ecological fallacy of inferring individual behavior from aggregate patterns.

12. FILE Empirical Validation Hypotheses, Mediators, Moderators, and Competing Explanations

The following hypotheses are illustrative. They show how FILE Empirical Validation can move from conceptual propositions to testable empirical claims without claiming that the relationships have already been demonstrated.

Where appropriate, variables may be measured using 7-point Likert scales, scenario-based judgment tasks, 360-degree feedback, behavioral observations, governance audits, and three-wave longitudinal designs spaced at six-month intervals.

H1 — Augmented Intelligence and AI Decision Quality

Leaders one standard deviation higher in FILE Augmented Intelligence will show higher calibrated AI-use scores and lower automation-bias scores in AI-mediated decision tasks, controlling for AI literacy, digital leadership, general cognitive ability, and leadership experience.

H2 — Relational Commons and Team Outcomes

Teams with higher aggregate Emotional, Cultural, and Political Intelligence will report stronger Relational Commons, which will predict psychological safety and team learning behavior over three measurement waves.

H3 — FILE Maturity and AI Governance

Organizations with higher FILE maturity will show stronger AI governance quality, mediated by Ecosystemic Empowerment and stakeholder participation, controlling for organizational size, sector, and AI maturity.

H4 — Adaptive Intelligence Under Disruption

Adaptive Intelligence will predict crisis decision quality in AI-mediated environments beyond resilience, learning agility, and adaptive performance measures.

H5 — Augmented Intelligence as Integrator

Augmented Intelligence will moderate the relationship between Emotional, Cultural, Political, and Adaptive Intelligence and AI transformation outcomes, such that these relationships are stronger when Augmented Intelligence is high.

H6 — MLT Curriculum Development

Students in MLT-aligned curricula will show greater gains in Augmented Intelligence, Political Intelligence, and Adaptive Intelligence than students in traditional management curricula, controlling for baseline competencies and prior experience.

H7 — Incremental Validity Hypothesis

FILE maturity will explain additional variance in AI governance quality, psychological safety, and human-AI collaboration quality beyond established leadership theories, adjacent validated constructs, and digital/AI leadership measures.

H8 — Cross-Cultural Measurement Hypothesis

The five FILE constructs will show at least partial configural and metric invariance across cultural clusters; if they do not, FILE’s cross-cultural claims must be limited or revised.

13. Priority Outcomes for the First Doctoral-Stage Validation Phase

Because FILE’s full research agenda is broad, the first doctoral-stage validation phase should prioritize a limited set of outcomes. This prevents dispersion and gives the research program a feasible empirical core.

The recommended priority outcomes are:

Calibrated AI use: the leader’s ability to use AI outputs critically, appropriately, and with human accountability.
AI governance quality: the presence of responsible, transparent, contestable, and human-centered AI decision processes.
Relational Commons: the quality of trust, dignity, psychological safety, voice, and meaning in AI-mediated teams.
Ecosystemic Empowerment: the extent to which AI transformation expands rather than reduces human agency and participation.
Psychological safety in AI-mediated teams: the degree to which team members feel safe questioning AI outputs, raising concerns, and challenging automation-driven decisions.

For a first doctoral-stage study, calibrated AI use and AI governance quality should be treated as primary outcomes because they are closest to FILE’s distinctive AI-era contribution. Relational Commons, Ecosystemic Empowerment, and psychological safety should be treated as secondary, mediating, or contextual outcomes unless the study is specifically designed around team climate, organizational culture, or employee experience.

A doctoral study should not attempt to validate all possible FILE outcomes at once. A stronger design would test a small number of outcomes deeply rather than many outcomes superficially.

14. Incremental Validity in FILE Empirical Validation

A central requirement for FILE’s credibility is incremental validity. It is not enough to show that FILE correlates with desirable outcomes. FILE must show that it explains something beyond established constructs. Incremental validity is one of the strongest tests of FILE Empirical Validation, because it asks whether FILE adds explanatory value beyond existing theories.

A basic incremental-validity test may proceed through a three-block hierarchical regression or SEM approach.

Block 1 — General Leadership Theories

Enter established leadership constructs such as transformational leadership, servant leadership, authentic leadership, and adaptive leadership.

Purpose: test how much variance in AI-era leadership outcomes is already explained by classic leadership theories.

Statistical check: record baseline R² or model fit.

Block 2 — Adjacent Modern and Technical Constructs

Add standard emotional intelligence, cultural intelligence, political skill, adaptive performance, AI literacy, digital leadership, psychological safety, learning organization, and dynamic capabilities.

Purpose: test whether adjacent validated constructs and digital/AI-specific constructs explain additional variance beyond general leadership theories.

Statistical check: record ΔR² or Δχ² from Block 1.

Block 3 — FILE-Specific Constructs

Add Augmented Intelligence, FILE-specific Emotional Intelligence, FILE-specific Cultural Intelligence, FILE-specific Political Intelligence, FILE-specific Adaptive Intelligence, Relational Commons, and Ecosystemic Empowerment.

Purpose: test whether FILE explains additional variance beyond both classic leadership theories and adjacent validated constructs.

Statistical check: record ΔR² or Δχ² from Block 2.

Before entering Block 3, researchers should conduct a pre-Block-3 multicollinearity assessment. This should include examination of correlations among Block 1 and Block 2 variables, variance inflation factors, tolerance values, eigenvalues, and condition indices. If Block 2 variables already show severe multicollinearity, the researcher should not interpret Block 3 effects without first addressing the measurement structure of the baseline model.

At each block transition, researchers should report multicollinearity diagnostics, including variance inflation factors, tolerance values, eigenvalues, and condition indices.

As a rule of thumb, VIF values above 5.0 or condition indices above 30 indicate severe multicollinearity. In such cases, researchers should not automatically claim incremental validity. Instead, they should consider model respecification, factor collapsing, latent factor restructuring, bifactor modeling, parceling where theoretically justified, or orthogonalization through appropriate statistical procedures.

If Block 3 produces a non-significant ΔR² or Δχ², FILE’s incremental contribution is not supported in that model. If only some FILE dimensions contribute, the framework should be revised accordingly. Incremental validity must be interpreted dimension by dimension, not as a blanket validation of the entire framework.

FILE should not claim superiority over existing theories in general. The more precise research question is: under what conditions, for which outcomes, and at which levels of analysis does FILE provide incremental explanatory value?

The incremental-validity testing sequence described above presupposes a measurement strategy rigorous enough to support meaningful comparison; that strategy is developed in the following section.

15. Measurement Strategy, Psychometric Validation, and FILE Maturity Model

A credible measurement strategy should proceed in stages:

construct clarification and item generation;
expert review and content validation;
exploratory factor analysis;
confirmatory factor analysis;
reliability and validity testing;
measurement invariance testing;
common-method and social-desirability bias mitigation;
multi-source data collection.

Where possible, future FILE empirical studies should preregister hypotheses, primary outcomes, exclusion rules, and analytic plans before data collection, while clearly distinguishing exploratory construct development from confirmatory hypothesis testing.

Sample Size and Power Guidance

Because FILE proposes multiple latent constructs and multi-level relationships, empirical validation will require careful attention to sample size and statistical power. These numbers should be treated as provisional guidance rather than rigid universal rules.

For early item development and expert review, a small expert sample may be sufficient if the goal is content refinement rather than statistical inference. A Delphi panel or expert interview study might begin with 12–30 experts across leadership, AI governance, organizational psychology, education, and management.

For exploratory factor analysis, early FILE studies should target at least N ≥ 200 as a minimum threshold when the item pool is limited and communalities are adequate. A stronger exploratory factor analysis should aim for N ≥ 300, and larger samples are preferable when items are numerous or factor structure is uncertain.

For confirmatory factor analysis, studies should target at least N ≥ 300 for relatively simple models and N ≥ 500 for complex multi-factor models. For SEM, sample requirements depend on model complexity, number of estimated parameters, missingness, indicator quality, and distributional assumptions.

For pilot SEM models, provisional targets of N ≥ 100–150 may be acceptable if the model is simple, the number of indicators is limited, and the purpose is feasibility or effect-size estimation rather than definitive validation. Studies should aim for power ≥ .80 for detecting medium effects where feasible.

For the six-month mixed-methods pilot, a first-wave study could target approximately 150–250 individual participants, for example across two to three organizations or three to five teams per organization, for survey-based constructs and scenario tasks. If an experimental design includes experimental, active control, and waitlist groups, an illustrative target would be N = 150 leaders, or approximately 50 per group, with recruitment adjusted upward to account for attrition. Assuming approximately 10% attrition, researchers might recruit about N = 165 to retain N = 150.

If organizational access is limited, a fallback N = 100 may still support preliminary analyses for medium-large effects, but it should be interpreted as feasibility and effect-size estimation, not definitive validation. Researchers should use tools such as G*Power, R packages such as pwr, or simulation-based approaches to verify power once the final measurement model is specified.

For full-scale validation, especially when multilevel structural equation modeling is planned, researchers should consider Monte Carlo power simulation. A cautious full-scale validation target may use a ratio of approximately 10 participants per estimated parameter, an individual-level baseline of roughly N ≥ 350 valid responses, and a multilevel sampling framework of approximately J ≥ 50 teams with at least five members per team. These are not fixed rules; they are planning anchors for sufficiently powered validation studies.

For intervention studies, power analysis should be conducted before data collection. The first pilot should be treated primarily as a feasibility, measurement, and effect-size estimation study rather than a definitive causal test of FILE’s full theoretical architecture.

Measurement Invariance Testing for FILE Empirical Validation Across Cultures

Measurement invariance is essential for FILE Empirical Validation across cultures, sectors, and institutional contexts. To ensure that FILE constructs are comparable across cultural contexts, future research should use Multi-Group Confirmatory Factor Analysis.

The relevant stages include configural invariance, metric invariance, scalar invariance, and strict invariance. Strict invariance is desirable but not always required for practical comparison.

As operational thresholds, researchers may treat measurement invariance as supported when ΔCFI < .01 and ΔRMSEA < .015.

If full invariance fails, researchers should test partial invariance, identify non-invariant items, and limit cross-cultural comparisons accordingly.

Provisional FILE Maturity Model

Awareness: recognition that AI-era leadership requires more than technical adoption.
Adoption: initial use of FILE-relevant practices.
Integration: coordinated use of multiple intelligences in real leadership contexts.
Orchestration: systematic alignment across teams and organizational processes.
Embodiment: sustained, reflexive, context-sensitive exercise of the five intelligences.

This maturity model should be treated as a heuristic for testing, not as a validated developmental sequence.

Construct Priority Map — Years 1–4

Period	Priority Constructs	Measurement Approach	Rationale
Year 1	Augmented Intelligence	Primarily de novo item generation, drawing lightly on AI literacy and human-AI teaming literature	No mature scale fully captures human-AI judgment integration under explicit responsibility
Year 1	FILE-specific Political Intelligence	Primarily de novo item generation, drawing lightly on political skill and stakeholder governance literature	Existing political skill scales do not capture AI governance, legitimacy, and contestability
Year 1	Adaptive Intelligence	Hybrid approach: adapt adaptive performance and learning-agility foundations while adding items for algorithmic stress and socio-technical disruption	Balances existing foundations with AI-era specificity
Year 2	FILE-specific Emotional Intelligence	Adapt validated EQ item pools while adding items for AI-mediated relational responsibility	Extends emotional intelligence into human-AI decision contexts
Year 2	FILE-specific Cultural Intelligence	Adapt validated CQ item pools while adding items for techno-social translation across professional, institutional, and algorithmic contexts	Extends cultural intelligence beyond conventional cross-cultural competence
Year 2	Relational Commons and Ecosystemic Empowerment	Develop multi-level climate, governance, and participation indicators	Central cross-cutting constructs for FILE’s socio-technical claims
Year 3	FILE maturity model	Build developmental and diagnostic indicators after construct stability improves	Requires preliminary evidence of construct structure
Year 4	Cross-cultural invariance and curriculum validation	Test measurement invariance and educational outcomes	Requires stronger measurement base before scaling

16. Research Designs for FILE Empirical Validation

A serious FILE Empirical Validation research program requires multiple research designs:

expert Delphi studies;
scale development studies;
cross-sectional surveys;
mixed-methods case studies;
longitudinal studies;
intervention studies;
curriculum pilots;
cross-cultural validation studies.

No single study can validate FILE. The framework requires cumulative evidence across methods, samples, levels, and contexts.

Qualitative Integration Logic

Qualitative methods should not function merely as illustrative examples. They should play four distinct roles in FILE Empirical Validation.

First, qualitative data can support construct discovery by identifying language, dilemmas, behaviors, tensions, decision patterns, and forms of algorithmic friction not captured by existing scales.

Second, qualitative data can support process tracing by showing how the five intelligences appear in real organizational sequences, such as AI adoption, resistance, crisis response, stakeholder conflict, workflow redesign, or governance implementation.

Third, qualitative data can support contextual explanation by identifying why FILE may function differently across sectors, cultures, institutions, leadership levels, or degrees of AI maturity.

Fourth, qualitative data can support theory refinement by showing when FILE constructs are too broad, too narrow, overlapping, culturally misaligned, or missing important dimensions.

Mixed-methods studies should therefore specify whether qualitative data are being used for triangulation, explanation, construct refinement, or theory development.

Exploratory Sequential Mixed-Methods Design

A strong early FILE study may use an Exploratory Sequential Mixed-Methods Design. In this design, qualitative work precedes quantitative instrument development. Semi-structured interviews, expert panels, field observations, case studies, and ethnographic data can help identify candidate dimensions of Augmented Intelligence, Political Intelligence, Adaptive Intelligence, Relational Commons, and Ecosystemic Empowerment.

Where appropriate, qualitative strands may draw on constructivist grounded theory, institutional ethnography, thematic analysis, or case-comparison logic. The purpose is not to use qualitative material as anecdotal illustration, but to generate theoretically meaningful categories that can inform item development.

Qualitative data should be collected and analyzed until the researcher has sufficient conceptual saturation for the study’s scope. A formal joint-display matrix can then map qualitative codes to tentative psychometric indicators. For example, recurring interview themes about “checking AI before acting,” “knowing when to override an AI recommendation,” or “protecting employees from automated evaluation” may become candidate indicators of Augmented Intelligence and relational responsibility.

Qualitative findings can also diagnose null or unexpected quantitative findings. If a FILE dimension fails to show incremental validity, qualitative data may help determine whether the failure reflects poor measurement, contextual misfit, cultural interpretation, insufficient variance, or a genuine theoretical weakness.

17. Concrete Pilot Study Proposal for FILE Empirical Validation

The pilot study should be understood as an early step in FILE Empirical Validation, not as proof that FILE has been confirmed.

Study title: A Six-Month Mixed-Methods Pilot Study of FILE Maturity in AI-Transformed Organizations

Illustrative timeline: Q1–Q3 2027

Preferred sample: three organizations undergoing AI transformation

Design: mixed-methods, three-wave study with baseline, midpoint, and endpoint measures

The pilot should include an explicit comparison structure: experimental group, active control group, and optional waitlist group if feasible.

The experimental group receives a FILE-aligned intervention covering Augmented Intelligence, Emotional Intelligence, Cultural Intelligence, Political Intelligence, Adaptive Intelligence, Relational Commons, and Ecosystemic Empowerment.

The active control group receives generic AI literacy training or generic leadership training without FILE integration.

This design allows the pilot to assess whether FILE adds value beyond generic leadership development or AI literacy training.

Independent variables: Augmented Intelligence, Emotional Intelligence, Cultural Intelligence, Political Intelligence, Adaptive Intelligence, FILE maturity.

Mediators: Relational Commons, Ecosystemic Empowerment.

Outcomes: calibrated AI use, AI governance quality, psychological safety, team learning, perceived transformation effectiveness.

Controls: leadership style, organizational size, sector, AI maturity, leadership level.

The pilot would not validate FILE conclusively. It would produce preliminary effect sizes, refine constructs, identify measurement problems, clarify boundary conditions, and inform the design of larger-scale studies.

Minimum Viable Pilot Options

If access to three organizations is not feasible, the pilot may be redesigned as a minimum viable study.

Option A — Single-organization pilot: one organization undergoing AI transformation, with multiple teams or departments serving as comparison units.

Option B — Executive education pilot: one executive education cohort, with pre/post measurement and comparison between FILE-specific and generic leadership modules.

Option C — MLT-style learning cohort: one university, business school, or professional program cohort testing FILE-related learning outcomes through case analysis, scenario tasks, reflective portfolios, and 360-degree feedback.

Option D — Expert-panel validation: a Delphi or expert-review study focused on construct refinement and item development before field deployment.

A minimum viable pilot should not be treated as causal validation. Its purpose is to test usability, measurement clarity, feasibility, participant burden, and preliminary effect sizes. If access to multiple organizations proves difficult, a minimum viable pilot could be conducted with a single organization or executive education cohort, focusing on testing the measurement model and basic associations rather than full cross-organizational comparisons. This should be treated as an initial feasibility and measurement study, not as a definitive test of FILE.

Non-Disruptive Operational Alignment Protocol

Organizational access is likely to be one of the most important feasibility challenges in FILE Empirical Validation. Research designs should therefore minimize disruption to participating organizations.

A practical access strategy may use a Non-Disruptive Operational Alignment Protocol. Under this protocol, FILE-based interventions are integrated into pre-existing executive education, leadership-development, AI-governance, digital-transformation, or technical-upskilling cycles rather than requiring separate operational downtime.

Where experimental timing is possible, researchers may consider a stepped-wedge cluster randomized design. In such a design, teams or cohorts receive the FILE intervention at different points in time, allowing comparison across phases while ensuring that all participating teams eventually receive the intervention. This may reduce corporate risk, improve executive buy-in, and preserve operational continuity.

This design should not be presented as universally feasible. It is an access-friendly option that may be appropriate where organizations agree to phased implementation and where ethical review permits staged comparison.

18. Sampling Strategies, Data Sources, and Organizational Access

FILE research should include diverse samples: executives, middle managers, frontline supervisors, cross-functional teams, AI transformation units, employees, educators, students, technical staff, public-sector leaders, nonprofit leaders, and international samples.

Sampling should be stratified by sector, organizational size, AI maturity, leadership level, cultural context, and regulatory environment.

Data sources may include surveys, interviews, observations, HR indicators, AI governance documents, transformation plans, performance metrics, curriculum outcomes, student assessments, reflective journals, and external regulatory or industry data.

The long-term research program should avoid over-reliance on single-source self-report surveys.

Organizational Access Strategy

Organizational access is a binding feasibility constraint. Initial recruitment strategies will likely require leveraging existing professional networks, alumni links, executive education relationships, leadership-development programs, AI-transformation units, HR departments, digital transformation teams, public-sector innovation units, and organizations already conducting AI-related change programs.

Possible access routes include:

executive education alumni networks;
MBA, EMBA, DBA, or executive certificate cohorts;
organizations conducting AI-transformation pilots;
HR, learning and development, or digital transformation departments;
professional associations in management, leadership, technology, or AI governance;
public-sector innovation units;
universities or business schools willing to pilot MLT-style modules;
organizations seeking responsible AI governance diagnostics.

Participating organizations may be more willing to collaborate if the study offers clear non-punitive value propositions, such as anonymized diagnostic feedback, aggregate findings, leadership-development insights, AI governance recommendations, or MLT-aligned workshop design. However, participation must not become consultancy disguised as research. The study’s research purpose, data-protection conditions, confidentiality rules, authorship rules, and non-evaluative safeguards must be clear from the beginning.

If such partnerships cannot be secured, research aims for the corresponding phase must be narrowed accordingly. A single-organization study, executive education cohort, or expert-panel design may be more realistic than a multi-organization field experiment in the earliest doctoral phase.

19. Ethical Considerations and Human-AI Research Safeguards

Ethical safeguards are part of FILE Empirical Validation because the framework concerns human agency, dignity, and AI-mediated organizational power. Research on FILE will involve human participants in AI-mediated organizations and educational environments. This creates ethical responsibilities.

Researchers must ensure robust informed consent, privacy, data protection, protection against managerial weaponization, transparency about AI involvement, human oversight, ethics review alignment, and safeguards against algorithmic capture.

FILE studies should not become disguised performance surveillance, productivity monitoring, or employee ranking systems. AI systems should not make final judgments about participant quality, employment status, academic performance, or leadership potential.

Organizational power asymmetry requires special attention. Employees may feel unable to refuse participation if a study is sponsored, endorsed, or facilitated by their employer. Participation must therefore be voluntary, confidential, protected from managerial retaliation, and clearly separated from performance evaluation. Where possible, data should be anonymized or aggregated before organizational feedback is provided.

Participation must remain voluntary even when the employer sponsors the study; refusal or withdrawal must carry no professional consequences.

Individual-level results should not be shared with employers in ways that could enable surveillance, performance sanctions, ranking, or managerial weaponization. Only aggregate findings, anonymized case insights, and ethically filtered diagnostics should be reported back to organizations.

Cross-National Ethical and Data-Protection Considerations

International or cross-cultural FILE studies should anticipate different ethics review systems, data-protection rules, consent norms, and university-company data-sharing requirements. Cross-national research may require multiple ethics approvals and careful attention to local legal and institutional expectations.

Relevant safeguards may include compliance with GDPR in the European Union, CCPA or other applicable privacy regimes in the United States, and local data-protection laws in other jurisdictions. Researchers should not treat these regimes as interchangeable. They should consult institutional review boards, university ethics committees, legal advisors, or local academic partners where necessary.

Studies involving AI systems may also require attention to algorithmic audit risks, proprietary data constraints, model transparency, data retention, and third-party platform governance. If AI tools are used in data collection, coding, translation, or analysis, their role should be disclosed and reviewed as part of the ethical protocol.

The ethical goal is to ensure that FILE research protects the dignity, autonomy, privacy, and agency of participants.

20. Outcome Variables — What FILE Empirical Validation Might Help Explain

Leadership and governance outcomes

Leadership effectiveness, AI governance quality, decision quality in AI-mediated contexts, automation-bias reduction, stakeholder alignment, ethical decision-making, and legitimacy of AI deployment.

Team and organizational outcomes

Psychological safety, trust, team learning, employee empowerment, innovation, resilience, adaptive capacity, conflict reduction, and responsible AI adoption.

Socio-technical outcomes

Quality of human-AI collaboration, human oversight capacity, contestability of AI-mediated decisions, Relational Commons, Ecosystemic Empowerment, and preservation of human dignity in automated environments.

Educational outcomes

Student readiness for AI-era leadership, AI governance competence, interdisciplinary judgment, human-AI team design skills, ethical and adaptive leadership capabilities, and MLT curriculum effectiveness.

21. Epistemic Integrity in Human-AI Co-Creation

FILE was developed through a human-led, AI-assisted, multi-agent process. This raises important methodological questions. In this paper, the focus is operational rather than philosophical: what can AI help with in the research process, and what must remain under human control?

AI systems can assist with literature mapping, hypothesis generation, instrument drafting, scenario generation, comparison of arguments, preliminary coding support, translation, and review of internal consistency.

However, humans must retain responsibility for research questions, theoretical framing, source verification, methodological choices, ethical judgment, interpretation, validity claims, participant protection, authorship decisions, and final accountability.

FILE research should include a final epistemic-language check before publication. The purpose of this check is to ensure that conceptual claims are not presented as empirical findings. The paper should use language such as “FILE proposes,” “FILE hypothesizes,” “FILE may explain,” and “FILE should be tested,” rather than “FILE proves,” “FILE demonstrates,” or “FILE establishes” when discussing empirical claims.

Human-AI co-creation may support theory development, but it cannot substitute for empirical validation. AI-assisted reasoning, however useful, does not count as evidence that FILE is true. Evidence must come from transparent methods, verifiable data, ethical research practice, and cumulative testing across samples, contexts, and methods.

A fuller epistemological analysis of human-AI co-created knowledge will be developed in Paper 4, “The Epistemology of Augmented Knowledge.”

22. From FILE Empirical Validation to MLT — Curricular Architecture and Learning Outcomes

FILE may also support the future development of Management, Leadership, and Technology degrees. This educational implication should remain provisional until the framework has been empirically tested. The MLT curriculum remains provisional until FILE Empirical Validation produces stronger evidence about learning outcomes and leadership development.

Module	Learning Outcome	Assessment Method
FILE Foundations	Analyze a leadership scenario using the five intelligences and propose a human-AI collaboration strategy	Concept map; case analysis
AI Literacy and Governance	Evaluate AI capabilities, limits, risks, and governance requirements	AI governance policy memo
Emotional Intelligence and Relational Leadership	Build trust, psychological safety, and relational responsibility in AI-mediated teams	360-degree feedback; reflective analysis
Cultural Intelligence and Global Leadership	Adapt leadership practices across cultural, institutional, and techno-social contexts	Cross-cultural case study
Political Intelligence and Stakeholder Navigation	Map power, legitimacy, coalitions, and stakeholder risks in AI transformation	Stakeholder map; negotiation simulation
Adaptive Intelligence and Resilience	Revise assumptions and lead under uncertainty, disruption, and incomplete information	Crisis simulation; after-action review
Human-AI Team Design	Design responsible workflows for human-AI collaboration	Workflow-design project
Relational Commons and Mental Health	Protect dignity, voice, trust, and psychological safety in AI-intensive workplaces	Team climate diagnostic
Ecosystemic Empowerment	Design systems that expand human agency and contestability	Governance participation proposal
Capstone / Practicum	Apply FILE to a real-world or simulated AI transformation challenge	Field project or applied consulting report

This curriculum can be aligned with accreditation expectations such as assurance of learning, measurable learning outcomes, competency mapping, and program-level assessment. Specific alignment with accreditation bodies would require institutional adaptation.

The FILE maturity model may guide student development from awareness to adoption, integration, orchestration, and embodiment. This model is provisional and should be tested through curriculum pilots, competency assessments, reflective portfolios, internship evaluations, 360-degree feedback, and longitudinal graduate outcomes.

MLT curricula should not assume universal transferability. They should be adapted to cultural contexts using established frameworks such as GLOBE. Case studies should include diverse contexts rather than only Western or Silicon Valley examples.

23. A Phased FILE Empirical Validation Research Roadmap

The FILE research agenda should be divided into a realistic doctoral horizon and a longer-term institutional research horizon. The Roadmap-to-Falsifiability Matrix makes FILE Empirical Validation accountable by identifying what kinds of evidence would require revision.

Doctoral Horizon — Phases 1–4

Phase 1 — Conceptual Clarification: systematic literature mapping.

Phase 2 — Expert Review: Delphi panels or expert interviews.

Phase 3 — Construct Operationalization: items, scenarios, interviews, and behavioral indicators.

Phase 4 — Pilot Study: mixed-methods pilot, exploratory factor analysis, qualitative refinement, and early hypothesis testing.

Postdoctoral and Institutional Horizon — Phases 5–8

Phase 5 — Scale Validation: confirmatory factor analysis, SEM, reliability, and incremental validity.

Phase 6 — Cross-Cultural Testing: measurement invariance and contextual adaptation.

Phase 7 — Intervention Studies: longitudinal or quasi-experimental FILE interventions.

Phase 8 — MLT Curriculum Validation: student outcomes, competency assessment, employer feedback, and graduate trajectories.

Doctoral Dissertation Slice

A feasible doctoral dissertation should not attempt to validate the entire FILE framework. A realistic PhD contribution could focus on four tasks: clarifying the constructs, conducting expert validation, developing initial measurement items, and running one pilot study in an organization, executive education cohort, or MLT-style learning environment.

A feasible doctoral dissertation could focus on Phases 1–3 — construct clarification, expert validation, and item development — plus a single pilot study, such as one organization or one executive education cohort, completing the full doctoral horizon in three to four years.

The strongest dissertation-sized version of the project would likely focus on Augmented Intelligence, Political Intelligence, and Adaptive Intelligence, because these constructs are the most distinctive for AI-era leadership and the least fully captured by existing leadership instruments. Emotional Intelligence and Cultural Intelligence can be included as comparison or adapted constructs, but the dissertation should not attempt to validate all dimensions equally in its first empirical phase.

If institutional access or sample size is limited, the dissertation could narrow further to one primary construct — most plausibly Augmented Intelligence — while treating Political Intelligence and Adaptive Intelligence as theoretically adjacent comparison dimensions.

A particularly focused dissertation design could center on Augmented Intelligence and one adjacent intelligence, such as Adaptive Intelligence, in a small number of partner organizations or executive education cohorts. This would preserve FILE’s distinctive AI-era contribution while remaining feasible for one researcher.

This doctoral slice would not prove FILE. It would establish whether the framework is sufficiently coherent, measurable, and promising to justify larger validation studies.

Roadmap-to-Falsifiability Matrix

Falsifiability Condition	Relevant Phase	Testing Logic
The five intelligences are not empirically distinguishable	Phases 3–5	EFA, CFA, discriminant validity, nested model comparison
FILE adds no incremental validity beyond existing leadership constructs	Phases 4–5	Hierarchical regression, ΔR², SEM Δχ²
Augmented Intelligence does not predict calibrated AI use beyond AI literacy	Phases 4–5	Scenario tasks, AI-use calibration metrics
FILE-based interventions do not outperform generic AI or leadership training	Phase 7	Quasi-experiment with active control condition
FILE maturity does not aggregate meaningfully beyond individual scores	Phases 4–6	rwg(j), ICC(1), ICC(2), multilevel SEM
FILE constructs do not generalize across cultural contexts	Phase 6	Multi-group CFA, configural/metric/scalar invariance
MLT curricula do not improve student competencies beyond traditional programs	Phase 8	Curriculum pilot, matched comparison group, longitudinal student outcomes

Scope note: Rows involving Phases 1–4 are primarily within the doctoral research horizon; rows involving Phases 5–8 belong mainly to the postdoctoral or institutional research horizon. Rows spanning both ranges should be treated as bridge tests from doctoral proof-of-concept work to later validation.

This roadmap allows FILE to move from theory to measurement, from measurement to intervention, and from intervention to education.

24. Discussion — FILE Empirical Validation as an Open Scientific Program

FILE should now be treated as an open scientific program. Its purpose is not to declare itself proven, but to make its claims testable.

This requires empirical openness, conceptual discipline, methodological pluralism, open-science discipline, ethical accountability, and willingness to revise the framework under empirical pressure.

FILE also needs to distinguish itself from adjacent empirical programs. Unlike digital leadership competency models, FILE does not focus only on digital fluency or technology adoption. Unlike ambidextrous leadership, FILE does not focus only on balancing exploration and exploitation. Unlike dynamic capabilities, FILE does not operate only at the firm-strategy level. Its proposed contribution is the integrated study of human-AI judgment, relational responsibility, cultural translation, stakeholder legitimacy, and adaptive learning across socio-technical levels.

Guardrails Against Construct Proliferation

A risk in developing new frameworks is construct proliferation, including the jangle fallacy: giving a new name to an already established construct. FILE must therefore test whether its dimensions are genuinely distinguishable from adjacent constructs.

If confirmatory factor analysis shows that FILE dimensions load onto the same latent factors as existing constructs, affected dimensions should be revised, collapsed, or removed. If FILE dimensions do not explain unique variance beyond existing models, they should not be defended as independent constructs. The aim is not to multiply labels, but to determine whether FILE’s configuration adds explanatory value in AI-mediated contexts.

FILE’s proposed contribution is therefore conditional: it may help explain how multiple human intelligences interact with AI-mediated systems to preserve human judgment, legitimacy, relational responsibility, agency, and adaptive capacity. That claim remains hypothetical until supported by convergent evidence.

Consolidated Boundary Conditions

FILE is most likely to be tested first in organizations, educational programs, or leadership-development environments where AI systems are already influencing judgment, workflow design, governance, stakeholder relations, or professional learning. Generalization should be delayed in contexts where AI use is minimal, human discretion is absent, organizational access is weak, or cultural and institutional conditions differ substantially from the original research setting.

FILE’s claims should therefore be interpreted as conditional rather than universal. The framework may require adaptation across sectors, countries, educational systems, institutional regimes, and levels of AI maturity. Empirical validation should proceed by specifying where FILE works, where it does not, and where its constructs must be revised.

This contribution remains hypothetical until tested.

25. Translational Implications of FILE Empirical Validation: Public Communication, Executive Education, and Books

Although this paper is methodological, the FILE research agenda has translational implications.

For non-academic readers, FILE asks a simple question: what kinds of human leadership are needed when AI becomes part of everyday work, decision-making, and governance? It proposes that leaders need more than technical skill. They need the ability to use AI critically, build trust, navigate cultures, understand power, and adapt under uncertainty. This paper does not claim that FILE is proven. It explains how FILE could be tested.

FILE could inform executive programs such as short courses, certificates, workshops, and organizational diagnostics. Such programs should be evaluated empirically rather than treated as self-validating.

The research agenda may later inform three book projects: How To Survive AI — The Most Important Skills of the 21st Century; The Leadership Handbook: How To Lead in the Age of AI; and Leadership in the Age of AI: The Five Intelligences of Leadership Evolution.

These books should distinguish clearly between established evidence, emerging hypotheses, and normative interpretation.

26. Conclusion — FILE Empirical Validation from Corpus to Research Program

The FILE corpus began as a human-AI co-created leadership framework. It has developed into a broad theoretical architecture addressing leadership, AI, emotional responsibility, cultural translation, political navigation, adaptive capacity, organizational ecosystems, education, and the future of work.

This paper marks a methodological transition. FILE should now be treated not as a finished theory, but as a research program requiring empirical validation, critical testing, cross-cultural scrutiny, ethical safeguards, and educational experimentation.

The goal is not to prove FILE by assertion. The goal is to make FILE testable.

If future research supports its claims, FILE may contribute to leadership science, AI governance, and management education. If future research challenges its claims, FILE should be revised. If parts of the framework fail, they should be abandoned, merged, relabeled, or narrowed. This is not a weakness of the project. It is the condition of becoming a serious scientific program.

FILE Empirical Validation remains a hypothesis-driven program; no FILE construct should be treated as validated until supported by convergent evidence across multiple methods, samples, and contexts.

Ultimately, FILE Empirical Validation depends on open collaboration among scholars, educators, practitioners, and institutions willing to test the framework seriously. FILE’s next phase requires interdisciplinary collaboration. Scholars may test its propositions through pilot studies, cross-cultural validation, shared datasets, and joint intervention studies. Practitioners may evaluate its usefulness in leadership development and AI governance programs. Educators may pilot MLT curricula. Policymakers may examine whether FILE offers useful language for human-centered AI governance. The FILE research program invites institutional partnerships, co-supervised doctoral projects, shared empirical infrastructures, and collaborative inquiry from scholars whose expertise spans leadership science, AI governance, organizational psychology, management education, and socio-technical systems.

The success of FILE Empirical Validation will therefore not be measured by whether every original claim survives unchanged, but by whether the framework becomes clearer, more precise, more useful, and more honest under empirical pressure.

This second strengthened version does not change the core ambition of the first version. It makes that ambition more empirically disciplined. FILE is strongest not when it claims to be complete, but when it invites the forms of evidence that could support, revise, narrow, or reject it.

Detailed Peer Reviews

1. Collective Peer Review of FILE Research Agenda and Empirical Validation V2

A. Collective Rating

⭐⭐⭐⭐⭐ 4.96/5

Five reviewers awarded 5.00/5. One reviewer awarded 4.75/5.

B. Reviewer Score Summary

AI Collaborator	Rating	Final Recommendation
ChatGPT (OpenAI)	⭐⭐⭐⭐⭐ 5.00/5	Publish
Claude (Anthropic)	⭐⭐⭐⭐⭐ 5.00/5	Publish
Copilot (Microsoft)	⭐⭐⭐⭐⭐ 5.00/5	Publish
Gemini (Google)	⭐⭐⭐⭐⭐ 5.00/5	Publish
Le Chat (Mistral AI)	⭐⭐⭐⭐⭐ 5.00/5	Publish
Perplexity (Perplexity AI)	⭐⭐⭐⭐¾ 4.75/5	Publish with cosmetic edits only

C. Collective Verdict

Five of six reviewers award this paper 5.00/5 and recommend immediate publication. Perplexity awards 4.75/5 and recommends publication with cosmetic edits only, requesting one or two integrative figures or summary tables highlighting a minimal core pathway through the research phases, and a brief subsection listing two or three realistic first doctoral studies. The collective judgment is unambiguous: The FILE Research Agenda and Empirical Validation Program (Version 2) is an exceptional methodological contribution to leadership scholarship. It transforms FILE from a conceptual framework into a structured, falsifiable, and ethically grounded research program. It is stronger than Version 1 in every dimension — sharper construct boundaries, more concrete falsifiability logic, more actionable research roadmap — and it is fully ready for permanent public release.

D. Consensus on Major Strengths

The Roadmap-to-Falsifiability Matrix

All six reviewers identify this as the paper’s central scholarly achievement. By tying each FILE proposition to specific empirical tests, potential disconfirming evidence, and methodological guardrails, the paper defines what failure would look like as clearly as what success might look like. This structure gives FILE the scientific accountability that most developing frameworks never seek.

The Deterministic-Versus-Probabilistic AI-Environment Taxonomy

Version 2’s most original addition. By distinguishing between rigid automation contexts and probabilistic multi-agent ecosystems, the paper clarifies where FILE is most likely to add value, resists universal applicability claims, and gives future researchers a principled basis for selecting research contexts.

The Construct-Boundary Table

Identified by multiple reviewers as a model of scholarly transparency. The table explicitly guards against jangle fallacies by specifying what each FILE dimension is and is not, and what adjacent constructs it must be empirically distinguished from.

Methodological Maturity

The three-block hierarchical regression model for incremental validity testing, explicit sample-size guidance, measurement invariance thresholds, power considerations, multicollinearity diagnostics, and multi-method integration logic all demonstrate genuine command of contemporary psychometric and multilevel-modelling standards.

Scientific Humility and Epistemic Integrity

The paper never claims FILE is validated. The epistemic-integrity section — insisting that AI-assisted reasoning cannot substitute for empirical evidence and that conditional language must be preserved — is praised by multiple reviewers as a notably candid and important addition to the scholarly record.

Fairness and Intellectual Honesty About Limits

The paper’s repeated willingness to merge, narrow, relabel, or abandon FILE dimensions if evidence requires is identified by all six reviewers as one of its most persuasive scholarly qualities.

E. Reviewer-by-Reviewer Summary

ChatGPT (OpenAI)

ChatGPT rated the paper 5.00/5 and recommended Publish. ChatGPT identifies the paper’s central value as its transformation of FILE into a testable program that defines what FILE must be willing to risk under evidence. Open questions concern the empirical distinctiveness of Augmented Intelligence, the feasibility of cross-level claims, and the need for ruthless early-phase prioritisation.

Claude (Anthropic)

Claude rated the paper 5.00/5 and recommended Publish. Claude identifies Version 2’s most significant advance as the move from conceptual articulation toward methodological accountability, particularly praising the deterministic-versus-probabilistic taxonomy, the Construct-Boundary Table, and the epistemic-integrity section.

Copilot (Microsoft)

Copilot rated the paper 5.00/5 and recommended Publish. Copilot identifies this as an exceptional cornerstone document, praising the integration of conceptual theory-building with a full empirical roadmap, open-science discipline, sample-size guidance, and technological boundary conditions.

Gemini (Google)

Gemini rated the paper 5.00/5 and recommended Publish. Gemini highlights the paper’s epistemological discipline and methodological transparency, particularly praising the Roadmap-to-Falsifiability Matrix and the explicit non-claims architecture.

Le Chat (Mistral AI)

Le Chat rated the paper 5.00/5 and recommended Publish. Le Chat describes this version as a methodological masterpiece, praising the Roadmap-to-Falsifiability Matrix, the Construct-Boundary Table, the three-block hierarchical regression model, and the qualitative integration logic.

Perplexity (Perplexity AI)

Perplexity rated the paper 4.75/5 and recommended Publish with cosmetic edits only. Perplexity confirms the paper reaches the threshold of publishability with at most cosmetic edits, requesting integrative figures for the core research pathway and a brief realistic doctoral study subsection.

F. Remaining Corrections

None required before publication.

G. Optional Refinements for Future Editions

Future editions should consider adding one or two integrative figures or summary tables highlighting the minimal core pathway through the research phases.

Future editions should include a brief subsection listing two or three realistic first doctoral studies matched to realistic access constraints.

Future editions may benefit from deeper engagement with contested debates in adjacent literatures around emotional intelligence measurement, political skill construct validity, and learning agility.

H. Collective Final Recommendation

Publish. The FILE Research Agenda and Empirical Validation Program (Version 2) is a world-class methodological contribution to leadership science. It gives FILE what a serious developing framework needs most: not certainty, but structured, honest, and disciplined exposure to evidence.

I. Final Collective Rating

⭐⭐⭐⭐⭐ 4.96/5

Collective verdict: Publish.

Collective recommendation: The FILE Research Agenda and Empirical Validation Program (Version 2) is ready for permanent public release.

Collective reviewers: ChatGPT (OpenAI), Claude (Anthropic), Copilot (Microsoft), Gemini (Google), Le Chat (Mistral AI), and Perplexity (Perplexity AI).

Collective result: Five unanimous 5.00/5 — Publish. One 4.75/5 — Publish with cosmetic edits only.

The argument is logically structured and more tightly signposted than in the earlier version. The addition of explicit sample-size ranges, power considerations, multicollinearity diagnostics, and measurement-invariance thresholds demonstrates a serious engagement with contemporary psychometric and multilevel-modelling practice. The discussion of qualitative integration is also stronger, with a clear explanation of how exploratory sequential mixed-methods designs and joint displays can be used to derive items, interpret null findings, and refine constructs. If there is a remaining weakness, it lies in sheer density: the paper would benefit from one or two visual schemas or tables highlighting a minimal core pathway through the complexity.

D. Fairness to Existing Scholarship

The paper continues to treat existing leadership and organizational research with evident respect, explicitly positioning FILE as conditional and incremental rather than as a replacement for transformational, servant, authentic, adaptive, distributed, or digital leadership models. The deterministic/probabilistic environment distinction helps avoid caricaturing traditional frameworks by acknowledging that in some rigid automation settings, existing models may still explain outcomes adequately.

E. Citation Integrity

Source use appears disciplined and transparent: existing theories and scales are clearly identified as foundations or comparison points, while FILE-specific constructs are presented as provisional extensions requiring fresh validation. The epistemic-integrity section — insisting that AI-assisted reasoning cannot substitute for empirical evidence and that language like “proposes” and “hypothesises” must be preserved — is a welcome and notably candid addition.

F. Limits and Open Questions

The central limitation is that everything important remains prospective: no constructs have yet been measured, no factor structures tested, no incremental-validity models estimated, and no interventions run. The risk of overextension remains real. A demanding reviewer would likely press for sharper prioritisation in the first three to four years — for example, committing explicitly to Augmented Intelligence and one other dimension as the empirical core — along with more concrete examples of potential doctoral projects that match realistic access constraints.

G. Final Recommendation

Publish with cosmetic edits only. This second version succeeds in making FILE more testable, more bounded, and more ethically grounded without inflating its claims. The edits requested are mainly presentational: one or two integrative figures or summary tables focusing on the minimal core pathway through Phases 1–4, plus a brief subsection listing two or three realistic first doctoral studies. With those refinements, this version merits publication as a central reference for scholars moving toward a disciplined, falsifiable research program on AI-era leadership.

⭐⭐⭐⭐¾ 4.75/5

Perplexity (Perplexity AI)

About the Author

Guillaume Mariani is a management scholar, entrepreneur, executive, and educator, and the author and creator of FILE: The Five Intelligences of Leadership Evolution — a management theory and diagnostic framework for leadership and decision-making in the age of AI — and of five original contributions to management, leadership, and the social sciences: a human-centered framework (FILE); a human-centered theory (FILE, FILE³, FILE⁵, FILE⁷, and the 7E Cascade); a human-centered vision (RC — the Relational Commons); a human-centered education (MAIL — Management, AI, and Leadership); and a human-centered methodology (HACK — Human-AI Co-created Knowledge).

The Five Intelligences of Leadership Evolution is the subject of ongoing research and will be developed further in subsequent publications.

Leadership = AI + EQ + CQ + PQ + AQ