Add Question Answering feature with display formatting and command integration

2025-10-12 15:24:55 +02:00 · 2025-10-12 15:24:55 +02:00 · 8115bd1eb7
parent b990f80263
commit 8115bd1eb7
8 changed files with 585 additions and 12 deletions
--- a/README.md
+++ b/README.md
@ -1,14 +1,14 @@
 # 🧠 AI Lab – Transformers CLI Playground
 > A **pedagogical and technical project** designed for AI practitioners and students to experiment with Hugging Face Transformers through an **interactive Command‑Line Interface (CLI)**.  
-> This playground provides ready‑to‑use NLP pipelines (Sentiment Analysis, Named Entity Recognition, Text Generation, Fill‑Mask, Moderation, etc.) in a modular, extensible, and educational codebase.
+> This playground provides ready‑to‑use NLP pipelines (Sentiment Analysis, Named Entity Recognition, Text Generation, Fill‑Mask, Question Answering, Moderation, etc.) in a modular, extensible, and educational codebase.
 ---
 ## 📚 Overview
 The **AI Lab – Transformers CLI Playground** allows you to explore multiple natural language processing tasks directly from the terminal.  
-Each task (e.g., sentiment, NER, text generation) is implemented as a **Command Module**, which interacts with a **Pipeline Module** built on top of the `transformers` library.
+Each task (e.g., sentiment, NER, text generation, question answering) is implemented as a **Command Module**, which interacts with a **Pipeline Module** built on top of the `transformers` library.
 The lab is intentionally structured to demonstrate **clean software design for ML codebases** — with strict separation between configuration, pipelines, CLI logic, and display formatting.
@ -32,7 +32,8 @@ src/
 │   ├── fillmask.py         # Masked token prediction command
 │   ├── textgen.py          # Text generation command
 │   ├── ner.py              # Named Entity Recognition command
-│   └── moderation.py       # Toxicity / content moderation command
+│   ├── moderation.py       # Toxicity / content moderation command
 │   └── qa.py               # Question Answering command
 │
 ├── pipelines/              # Machine learning logic (Hugging Face Transformers)
 │   ├── __init__.py
@ -41,7 +42,8 @@ src/
 │   ├── fillmask.py
 │   ├── textgen.py
 │   ├── ner.py
-│   └── moderation.py
+│   ├── moderation.py
 │   └── qa.py               # Question Answering pipeline
 │
 └── config/
    ├── __init__.py
@ -104,7 +106,7 @@ python -m src.main
 poetry run python src/main.py
 ```
-You’ll see an interactive menu listing the available commands:
+You'll see an interactive menu listing the available commands:
 ```
 Welcome to AI Lab - Transformers CLI Playground
@ -114,6 +116,7 @@ Available commands:
  • textgen       – Generate text from a prompt
  • ner           – Extract named entities from text
  • moderation    – Detect toxic or unsafe content
  • qa            – Question Answering on given text context
 ```
 ### Example Sessions
@ -152,6 +155,14 @@ Available commands:
  - California (LOC)
 ```
 #### 🔹 Question Answering
 ```text
 💬 Context: Albert Einstein was born in 1879 in Germany. He developed the theory of relativity.
 ❓ Question: When was Einstein born?
 → Answer: 1879 (confidence: 0.95)
 ```
 #### 🔹 Moderation
 ```text
@ -173,13 +184,13 @@ The internal structure follows a clean **Command ↔ Pipeline ↔ Display** patt
                      │
                      ▼
             ┌─────────────────┐
-             │   Command Layer │  ← e.g. sentiment.py
+             │   Command Layer │  ← e.g. sentiment.py, qa.py
             │ (user commands) │
             └───────┬─────────┘
                     │
                     ▼
             ┌─────────────────┐
-             │  Pipeline Layer │  ← e.g. pipelines/sentiment.py
+             │  Pipeline Layer │  ← e.g. pipelines/sentiment.py, pipelines/qa.py
             │ (ML logic)      │
             └───────┬─────────┘
                     │
@ -195,8 +206,8 @@ The internal structure follows a clean **Command ↔ Pipeline ↔ Display** patt
 | Layer        | Description                                                                |
 | ------------ | -------------------------------------------------------------------------- |
 | **CLI**      | Manages user input/output, help menus, and navigation between commands.    |
-| **Command**  | Encapsulates a single user-facing operation (e.g., run sentiment).         |
+| **Command**  | Encapsulates a single user-facing operation (e.g., run sentiment, QA).     |
-| **Pipeline** | Wraps Hugging Face’s `transformers.pipeline()` to perform inference.       |
+| **Pipeline** | Wraps Hugging Face's `transformers.pipeline()` to perform inference.       |
 | **Display**  | Handles clean console rendering (colored output, tables, JSON formatting). |
 | **Config**   | Centralizes model names, limits, and global constants.                     |
@ -215,7 +226,8 @@ class Config:
        "fillmask":  "bert-base-uncased",
        "textgen":   "gpt2",
        "ner":       "dslim/bert-base-NER",
-        "moderation":"unitary/toxic-bert"
+        "moderation":"unitary/toxic-bert",
        "qa":        "distilbert-base-cased-distilled-squad"
    }
    MAX_LENGTH = 512
    BATCH_SIZE = 8
@ -260,6 +272,7 @@ Recommended structure:
 tests/
 ├── test_sentiment.py
 ├── test_textgen.py
 ├── test_qa.py
 └── ...
 ```
--- a/src/cli/display.py
+++ b/src/cli/display.py
@ -190,3 +190,78 @@ class DisplayFormatter:
                    output.append(f"  • {entity} ({count}x)")
        return "\n".join(output)
    @staticmethod
    def format_qa_result(result: Dict[str, Any]) -> str:
        """Format Question Answering result for display"""
        if "error" in result:
            return f"❌ {result['error']}"
        output = []
        output.append(f"❓ Question: {result['question']}")
        # Confidence indicator
        confidence = result['confidence']
        confidence_emoji = "✅" if result['is_confident'] else "⚠️"
        confidence_bar = "█" * int(confidence * 10)
        output.append(f"{confidence_emoji} Answer: {result['answer']}")
        output.append(f"📊 Confidence: {result['confidence_level']} ({confidence:.1%}) {confidence_bar}")
        if not result['is_confident']:
            output.append("⚠️  Low confidence - answer might not be reliable")
        output.append(f"\n📍 Position: characters {result['start_position']}-{result['end_position']}")
        output.append(f"📄 Context with answer highlighted:")
        output.append(f"   {result['highlighted_context']}")
        return "\n".join(output)
    @staticmethod
    def format_qa_context_analysis(analysis: Dict[str, Any]) -> str:
        """Format QA context analysis for display"""
        if "error" in analysis:
            return f"❌ {analysis['error']}"
        output = []
        output.append("✅ Context set successfully!")
        output.append(f"📊 Context Statistics:")
        stats = analysis['context_stats']
        output.append(f"   • Words: {stats['word_count']}")
        output.append(f"   • Sentences: ~{stats['sentence_count']}")
        output.append(f"   • Characters: {stats['character_count']}")
        if analysis['suggested_questions']:
            output.append(f"\n💡 Suggested question types:")
            for suggestion in analysis['suggested_questions']:
                output.append(f"   • {suggestion}")
        if analysis['tips']:
            output.append(f"\n📝 Tips for good questions:")
            for tip in analysis['tips']:
                output.append(f"   • {tip}")
        return "\n".join(output)
    @staticmethod
    def format_qa_multiple_result(result: Dict[str, Any]) -> str:
        """Format multiple QA results for display"""
        if "error" in result:
            return f"❌ {result['error']}"
        output = []
        output.append(f"📊 Multiple Questions Analysis")
        output.append("=" * 50)
        output.append(f"Total Questions: {result['total_questions']}")
        output.append(f"Successfully Processed: {result['processed_questions']}")
        output.append(f"Confident Answers: {result['confident_answers']}")
        output.append(f"Average Confidence: {result['average_confidence']:.1%}")
        output.append(f"\n📋 Results:")
        for qa_result in result['results']:
            confidence_emoji = "✅" if qa_result['is_confident'] else "⚠️"
            output.append(f"\n{qa_result['question_number']}. {qa_result['question']}")
            output.append(f"   {confidence_emoji} {qa_result['answer']} ({qa_result['confidence']:.1%})")
        return "\n".join(output)
--- a/src/commands/init.py
+++ b/src/commands/init.py
@ -6,5 +6,6 @@ from .fillmask import FillMaskCommand
 from .textgen import TextGenCommand
 from .moderation import ModerationCommand
 from .ner import NERCommand
 from .qa import QACommand
-__all__ = ['SentimentCommand', 'FillMaskCommand', 'TextGenCommand', 'ModerationCommand', 'NERCommand']
+__all__ = ['SentimentCommand', 'FillMaskCommand', 'TextGenCommand', 'ModerationCommand', 'NERCommand', 'QACommand']
--- a/src/commands/qa.py
+++ b/src/commands/qa.py
@ -0,0 +1,214 @@
 from src.cli.base import CLICommand
 from src.cli.display import DisplayFormatter
 from src.pipelines.qa import QuestionAnsweringSystem
 class QACommand(CLICommand):
    """Interactive Question Answering command"""
    def __init__(self):
        self.qa_system = None
        self.current_context = None
        self.session_questions = []
    @property
    def name(self) -> str:
        return "qa"
    @property
    def description(self) -> str:
        return "Question Answering - Ask questions about a given text"
    def _initialize_qa_system(self):
        """Lazy initialization of the QA system"""
        if self.qa_system is None:
            print("🔄 Loading Question Answering model...")
            self.qa_system = QuestionAnsweringSystem()
            DisplayFormatter.show_success("QA model loaded!")
    def _show_instructions(self):
        """Show usage instructions and examples"""
        print("\n❓ Question Answering System")
        print("Ask questions about a text context and get precise answers.")
        print("\n📝 How it works:")
        print("  1. First, provide a context (text containing information)")
        print("  2. Then ask questions about that context")
        print("  3. The system extracts answers directly from the text")
        print("\n💡 Example context:")
        print("  'Albert Einstein was born in 1879 in Germany. He developed the theory of relativity.'")
        print("💡 Example questions:")
        print("  - When was Einstein born?")
        print("  - Where was Einstein born?")
        print("  - What theory did Einstein develop?")
        print("\n🎛️  Commands:")
        print("  'back' - Return to main menu")
        print("  'help' - Show these instructions")
        print("  'context' - Set new context")
        print("  'multi' - Ask multiple questions at once")
        print("  'session' - Review session history")
        print("  'settings' - Adjust confidence threshold")
        print("-" * 70)
    def _set_context(self):
        """Allow user to set or change the context"""
        print("\n📄 Set Context")
        print("Enter the text that will serve as context for your questions.")
        print("You can enter multiple lines. Type 'done' when finished.")
        print("-" * 50)
        lines = []
        while True:
            line = input("📝 ").strip()
            if line.lower() == 'done':
                break
            if line:
                lines.append(line)
        if not lines:
            DisplayFormatter.show_warning("No context provided")
            return False
        self.current_context = " ".join(lines)
        # Analyze context
        analysis = self.qa_system.interactive_qa(self.current_context)
        if "error" in analysis:
            DisplayFormatter.show_error(analysis["error"])
            return False
        formatted_analysis = DisplayFormatter.format_qa_context_analysis(analysis)
        print(formatted_analysis)
        return True
    def _ask_single_question(self):
        """Ask a single question about the current context"""
        if not self.current_context:
            DisplayFormatter.show_warning("Please set a context first using 'context' command")
            return
        question = input("\n❓ Your question: ").strip()
        if not question:
            DisplayFormatter.show_warning("Please enter a question")
            return
        DisplayFormatter.show_loading("Finding answer...")
        result = self.qa_system.answer(question, self.current_context)
        if "error" not in result:
            self.session_questions.append(result)
        formatted_result = DisplayFormatter.format_qa_result(result)
        print(formatted_result)
    def _multi_question_mode(self):
        """Allow asking multiple questions at once"""
        if not self.current_context:
            DisplayFormatter.show_warning("Please set a context first using 'context' command")
            return
        print("\n❓ Multiple Questions Mode")
        print("Enter your questions one by one. Type 'done' when finished.")
        print("-" * 50)
        questions = []
        while True:
            question = input(f"Question #{len(questions)+1}: ").strip()
            if question.lower() == 'done':
                break
            if question:
                questions.append(question)
        if not questions:
            DisplayFormatter.show_warning("No questions provided")
            return
        DisplayFormatter.show_loading(f"Processing {len(questions)} questions...")
        result = self.qa_system.answer_multiple(questions, self.current_context)
        if "error" not in result:
            self.session_questions.extend(result["results"])
        formatted_result = DisplayFormatter.format_qa_multiple_result(result)
        print(formatted_result)
    def _show_session_history(self):
        """Show the history of questions asked in this session"""
        if not self.session_questions:
            DisplayFormatter.show_warning("No questions asked in this session yet")
            return
        print(f"\n📚 Session History ({len(self.session_questions)} questions)")
        print("=" * 60)
        for i, qa in enumerate(self.session_questions, 1):
            confidence_emoji = "✅" if qa["is_confident"] else "⚠️"
            print(f"\n{i}. {qa['question']}")
            print(f"   {confidence_emoji} {qa['answer']} (confidence: {qa['confidence']:.1%})")
    def _adjust_settings(self):
        """Allow user to adjust QA settings"""
        current_threshold = self.qa_system.confidence_threshold
        print(f"\n⚙️  Current Settings:")
        print(f"Confidence threshold: {current_threshold:.2f}")
        print("\nLower threshold = more answers accepted (less strict)")
        print("Higher threshold = fewer answers accepted (more strict)")
        try:
            new_threshold = input(f"Enter new threshold (0.0-1.0, current: {current_threshold}): ").strip()
            if new_threshold:
                threshold = float(new_threshold)
                self.qa_system.set_confidence_threshold(threshold)
                DisplayFormatter.show_success(f"Threshold set to {threshold:.2f}")
        except ValueError:
            DisplayFormatter.show_error("Invalid threshold value")
    def run(self):
        """Run interactive Question Answering"""
        self._initialize_qa_system()
        self._show_instructions()
        while True:
            if self.current_context:
                context_preview = (self.current_context[:50] + "...") if len(self.current_context) > 50 else self.current_context
                prompt = f"\n💬 [{context_preview}] Ask a question: "
            else:
                prompt = "\n💬 Enter command or set context first: "
            user_input = input(prompt).strip()
            if user_input.lower() == 'back':
                break
            elif user_input.lower() == 'help':
                self._show_instructions()
                continue
            elif user_input.lower() == 'context':
                self._set_context()
                continue
            elif user_input.lower() == 'multi':
                self._multi_question_mode()
                continue
            elif user_input.lower() == 'session':
                self._show_session_history()
                continue
            elif user_input.lower() == 'settings':
                self._adjust_settings()
                continue
            if not user_input:
                DisplayFormatter.show_warning("Please enter a question or command")
                continue
            # If we have a context and user input is not a command, treat it as a question
            if self.current_context:
                DisplayFormatter.show_loading("Finding answer...")
                result = self.qa_system.answer(user_input, self.current_context)
                if "error" not in result:
                    self.session_questions.append(result)
                formatted_result = DisplayFormatter.format_qa_result(result)
                print(formatted_result)
            else:
                DisplayFormatter.show_warning("Please set a context first using 'context' command")
--- a/src/config/settings.py
+++ b/src/config/settings.py
@ -19,6 +19,7 @@ class Config:
        "textgen": "gpt2",
        "moderation": "unitary/toxic-bert",
        "ner": "dbmdz/bert-large-cased-finetuned-conll03-english",
        "qa": "distilbert-base-cased-distilled-squad",
    }
    # Interface
--- a/src/main.py
+++ b/src/main.py
@ -13,6 +13,7 @@ from src.commands import (
    FillMaskCommand,
    ModerationCommand,
    NERCommand,
    QACommand,
    SentimentCommand,
    TextGenCommand,
 )
@ -31,6 +32,7 @@ def main():
            TextGenCommand,
            ModerationCommand,
            NERCommand,
            QACommand,
        ]
        for command in commands_to_register:
            cli.register_command(command())
--- a/src/pipelines/init.py
+++ b/src/pipelines/init.py
@ -6,6 +6,7 @@ from .fillmask import FillMaskAnalyzer
 from .textgen import TextGenerator
 from .moderation import ContentModerator
 from .ner import NamedEntityRecognizer
 from .qa import QuestionAnsweringSystem
 from .template import TemplatePipeline
-__all__ = ['SentimentAnalyzer', 'FillMaskAnalyzer', 'TextGenerator', 'ContentModerator', 'NamedEntityRecognizer', 'TemplatePipeline']
+__all__ = ['SentimentAnalyzer', 'FillMaskAnalyzer', 'TextGenerator', 'ContentModerator', 'NamedEntityRecognizer', 'QuestionAnsweringSystem', 'TemplatePipeline']
--- a/src/pipelines/qa.py
+++ b/src/pipelines/qa.py
@ -0,0 +1,266 @@
 from transformers import pipeline
 from typing import Dict, List, Optional, Tuple
 from src.config import Config
 import re
 class QuestionAnsweringSystem:
    """Question Answering system using transformers"""
    def __init__(self, model_name: Optional[str] = None):
        """
        Initialize the question-answering pipeline
        Args:
            model_name: Name of the model to use (optional)
        """
        self.model_name = model_name or Config.get_model("qa")
        print(f"Loading Question Answering model: {self.model_name}")
        self.pipeline = pipeline("question-answering", model=self.model_name)
        print("QA model loaded successfully!")
        # Default confidence threshold
        self.confidence_threshold = 0.1
    def answer(self, question: str, context: str, max_answer_len: int = 50) -> Dict:
        """
        Answer a question based on the given context
        Args:
            question: Question to answer
            context: Context text containing the answer
            max_answer_len: Maximum length of the answer
        Returns:
            Dictionary with answer, score, and position information
        """
        if not question.strip():
            return {"error": "Empty question"}
        if not context.strip():
            return {"error": "Empty context"}
        try:
            result = self.pipeline(
                question=question,
                context=context,
                max_answer_len=max_answer_len
            )
            confidence_level = self._get_confidence_level(result["score"])
            highlighted_context = self._highlight_answer_in_context(
                context, result["answer"], result["start"], result["end"]
            )
            return {
                "question": question,
                "context": context,
                "answer": result["answer"],
                "confidence": round(result["score"], 4),
                "confidence_level": confidence_level,
                "start_position": result["start"],
                "end_position": result["end"],
                "highlighted_context": highlighted_context,
                "is_confident": result["score"] >= self.confidence_threshold
            }
        except Exception as e:
            return {"error": f"QA processing error: {str(e)}"}
    def _get_confidence_level(self, score: float) -> str:
        """
        Convert numerical score to confidence level
        Args:
            score: Confidence score (0-1)
        Returns:
            Confidence level description
        """
        if score >= 0.8:
            return "Very High"
        elif score >= 0.6:
            return "High"
        elif score >= 0.4:
            return "Medium"
        elif score >= 0.2:
            return "Low"
        else:
            return "Very Low"
    def _highlight_answer_in_context(self, context: str, answer: str, start: int, end: int) -> str:
        """
        Highlight the answer within the context
        Args:
            context: Original context
            answer: Extracted answer
            start: Start position of answer
            end: End position of answer
        Returns:
            Context with highlighted answer
        """
        if start < 0 or end > len(context):
            return context
        before = context[:start]
        highlighted_answer = f"**{answer}**"
        after = context[end:]
        return before + highlighted_answer + after
    def answer_multiple(self, questions: List[str], context: str, max_answer_len: int = 50) -> Dict:
        """
        Answer multiple questions for the same context
        Args:
            questions: List of questions to answer
            context: Context text
            max_answer_len: Maximum length of answers
        Returns:
            Dictionary with all answers and summary statistics
        """
        if not questions:
            return {"error": "No questions provided"}
        if not context.strip():
            return {"error": "Empty context"}
        results = []
        confident_answers = 0
        total_confidence = 0
        for i, question in enumerate(questions, 1):
            result = self.answer(question, context, max_answer_len)
            if "error" not in result:
                results.append({
                    "question_number": i,
                    **result
                })
                if result["is_confident"]:
                    confident_answers += 1
                total_confidence += result["confidence"]
        if not results:
            return {"error": "No valid questions processed"}
        average_confidence = total_confidence / len(results) if results else 0
        return {
            "context": context,
            "total_questions": len(questions),
            "processed_questions": len(results),
            "confident_answers": confident_answers,
            "average_confidence": round(average_confidence, 4),
            "confidence_threshold": self.confidence_threshold,
            "results": results
        }
    def interactive_qa(self, context: str) -> Dict:
        """
        Prepare context for interactive Q&A session
        Args:
            context: Context text for questions
        Returns:
            Context analysis and preparation info
        """
        if not context.strip():
            return {"error": "Empty context"}
        # Basic context analysis
        word_count = len(context.split())
        sentence_count = len([s for s in context.split('.') if s.strip()])
        char_count = len(context)
        # Suggest question types based on content
        suggested_questions = self._generate_question_suggestions(context)
        return {
            "context": context,
            "context_stats": {
                "word_count": word_count,
                "sentence_count": sentence_count,
                "character_count": char_count
            },
            "suggested_questions": suggested_questions,
            "tips": [
                "Ask specific questions about facts mentioned in the text",
                "Use question words: Who, What, When, Where, Why, How",
                "Keep questions clear and focused",
                "The answer should be present in the provided context"
            ]
        }
    def _generate_question_suggestions(self, context: str) -> List[str]:
        """
        Generate suggested questions based on context analysis
        Args:
            context: Context text
        Returns:
            List of suggested question templates
        """
        suggestions = []
        # Check for common patterns and suggest relevant questions
        if re.search(r'\b\d{4}\b', context):  # Years
            suggestions.append("When did [event] happen?")
        if re.search(r'\b[A-Z][a-z]+ [A-Z][a-z]+\b', context):  # Names
            suggestions.append("Who is [person name]?")
        if re.search(r'\b(founded|created|established|built)\b', context, re.IGNORECASE):
            suggestions.append("Who founded/created [organization]?")
        if re.search(r'\b(located|situated|based)\b', context, re.IGNORECASE):
            suggestions.append("Where is [place/organization] located?")
        if re.search(r'\b(because|due to|reason)\b', context, re.IGNORECASE):
            suggestions.append("Why did [event] happen?")
        if re.search(r'\b(how|method|process)\b', context, re.IGNORECASE):
            suggestions.append("How does [process] work?")
        if not suggestions:
            suggestions = [
                "What is the main topic of this text?",
                "Who are the key people mentioned?",
                "What important events are described?"
            ]
        return suggestions[:5]  # Limit to 5 suggestions
    def set_confidence_threshold(self, threshold: float):
        """
        Set the confidence threshold for answers
        Args:
            threshold: Threshold between 0 and 1
        """
        if 0 <= threshold <= 1:
            self.confidence_threshold = threshold
        else:
            raise ValueError("Threshold must be between 0 and 1")
    def answer_batch(self, qa_pairs: List[Tuple[str, str]], max_answer_len: int = 50) -> List[Dict]:
        """
        Process multiple question-context pairs
        Args:
            qa_pairs: List of (question, context) tuples
            max_answer_len: Maximum length of answers
        Returns:
            List of QA results
        """
        return [
            self.answer(question, context, max_answer_len) 
            for question, context in qa_pairs
        ]