A Dynamic Fusion Model for Consistent Crisis Response

Published in The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), 2025

Overview

This paper presents a dynamic fusion framework that integrates multiple Large Language Models (LLMs) to generate consistent and high-quality crisis responses.
Our approach addresses a critical gap in crisis communication: ensuring that AI-generated responses maintain uniform professionalism, actionability, and relevance across diverse crisis needs and user queries.

We introduce:

  • A new evaluation metric — Consistency, to measure stylistic stability.
  • A fusion-based response generation pipeline that combines outputs from Instructional Prompts and Retrieval-Augmented Generation (RAG).
  • Empirical validation across multiple LLMs (LLaMA 3.1 8B and Mistral 8B) and crisis datasets.

Motivation

During disasters, affected individuals often rely on social media for real-time guidance and verified information.
While LLMs can generate informative responses, they often vary in style and tone:

  • Some are professional and actionable, others vague or inconsistent.
  • Inconsistent responses erode trust and reduce usability in high-stakes scenarios.

Our goal:
➡️ Guarantee stylistic consistency across all responses, so every user receives the same quality of help — regardless of their query or crisis type.


Methodology

We propose a two-stage fusion framework:

  1. Candidate Generation:
    • Instructional Prompting: Zero-shot structured prompts for general reasoning.
    • Retrieval-Augmented Generation (RAG): Injects verified knowledge from FEMA’s official documents (e.g., Individual Assistance Guide).
  2. Fusion Mechanism:
    • Evaluates candidate responses across three dimensions: Professionalism, Actionability, Relevance.
    • Synthesizes a final response using weighted evaluation guidance, ensuring balanced optimization.

We test several fusion variants:

  • Fusion w/o Eval
  • Fusion w/ Eval
  • Fusion w/ Eval & Instruct
  • Fusion w/ Eval & Weight Instruct (Best-performing)

Consistency Metric

We define Consistency as the inverse of variance across the three communicative dimensions:

\[\text{Consistency Score} = 1 - \frac{1}{3}\big(\mathrm{Var}_{\text{prof}} + \mathrm{Var}_{\text{act}} + \mathrm{Var}_{\text{rel}}\big)\]

Higher score → more uniform style and quality.


Experiments

Dataset:

  • 540 need-related tweets from hurricanes Harvey, Irma, and Maria.
  • Additional experiments on CrisisBench (earthquakes, typhoons).

Models:

  • LLaMA 3.1 8B-Instruct
  • Mistral 8B-Instruct
  • GPT-4o-mini (for evaluation)

Evaluation Metrics:

  • Professionalism
  • Actionability
  • Relevance
  • Consistency
  • Human Preference (qualitative study)

Results

ModelMethodProfessionalismActionabilityRelevanceConsistency
LLaMAInstructional Prompt0.740.520.800.76
LLaMARAG0.960.630.800.84
LLaMAFusion w/ Eval & Weight Instruct0.990.990.790.94

Fusion outperforms all baselines, delivering the most consistent and highest-quality responses.
Cross-crisis generalization (earthquake, typhoon): maintains >0.95 overall quality.
Human evaluations show 86% preference for fused responses.


Key Findings

  • Mid-range temperature (0.6) yields optimal consistency.
  • Fusion with evaluation guidance essential for stable outputs.
  • Few-shot learning helps but fusion is more scalable and generalizable.
  • Sentiment and linguistic style affect consistency (neutral and formal queries yield higher stability).

Impact

  • Improves trust in AI-assisted crisis communication.
  • Provides uniform-quality responses across diverse users.
  • Applicable to emergency management agencies and NGO communication platforms.
  • Framework can extend to health misinformation, public safety, and customer support.

Resources


BibTeX

@article{song2025dynamic,
  title={A Dynamic Fusion Model for Consistent Crisis Response},
  author={Song, Xiaoying and Anik, Anirban Saha and Blanco, Eduardo and Frias-Martinez, Vanessa and Hong, Lingzi},
  journal={arXiv preprint arXiv:2509.01053},
  year={2025}
}

Recommended citation: Song, Xiaoying, Anirban Saha Anik, Eduardo Blanco, Vanessa Frias-Martinez, and Lingzi Hong. "A Dynamic Fusion Model for Consistent Crisis Response." arXiv preprint arXiv:2509.01053 (2025).
Download Paper