Adding documentation

2025-12-05 09:39:45 +02:00
parent b625703e86
commit 7d8c276ae3
3 changed files with 2057 additions and 0 deletions
--- a/ARCHITECTURE.md
+++ b/ARCHITECTURE.md
@@ -0,0 +1,573 @@
 # Microscopy Object Detection Application - Architecture Design
 ## Project Overview
 A desktop application for detecting organelles and membrane branching structures in microscopy images using YOLOv8s, with comprehensive training, validation, and visualization capabilities.
 ## Technology Stack
 - **ML Framework**: Ultralytics YOLOv8 (YOLOv8s.pt model)
 - **GUI Framework**: PySide6 (Qt6 for Python)
 - **Visualization**: pyqtgraph
 - **Database**: SQLite3
 - **Python Version**: 3.8+
 ## Project Structure
 ```
 object_detection/
 ├── main.py                          # Application entry point
 ├── requirements.txt                 # Python dependencies
 ├── config/
 │   └── app_config.yaml             # Application configuration
 ├── src/
 │   ├── __init__.py
 │   ├── database/
 │   │   ├── __init__.py
 │   │   ├── db_manager.py           # Database operations manager
 │   │   ├── models.py               # SQLAlchemy ORM models
 │   │   └── schema.sql              # Database schema definition
 │   ├── model/
 │   │   ├── __init__.py
 │   │   ├── yolo_wrapper.py         # YOLO model wrapper
 │   │   └── inference.py            # Detection inference logic
 │   ├── gui/
 │   │   ├── __init__.py
 │   │   ├── main_window.py          # Main application window
 │   │   ├── tabs/
 │   │   │   ├── __init__.py
 │   │   │   ├── training_tab.py     # Model training interface
 │   │   │   ├── validation_tab.py   # Validation metrics viewer
 │   │   │   ├── detection_tab.py    # Real-time/batch detection
 │   │   │   ├── results_tab.py      # Results viewer and management
 │   │   │   └── annotation_tab.py   # Annotation tool (future)
 │   │   ├── dialogs/
 │   │   │   ├── __init__.py
 │   │   │   ├── config_dialog.py    # Settings and configuration
 │   │   │   └── model_dialog.py     # Model management dialog
 │   │   └── widgets/
 │   │       ├── __init__.py
 │   │       ├── image_viewer.py     # Custom image display widget
 │   │       └── plot_widget.py      # Custom plotting widgets
 │   └── utils/
 │       ├── __init__.py
 │       ├── file_utils.py           # File operations
 │       ├── logger.py               # Logging configuration
 │       └── config_manager.py       # Configuration management
 ├── tests/
 │   ├── __init__.py
 │   ├── test_database.py
 │   ├── test_model.py
 │   └── test_gui.py
 ├── data/                           # Default data directory
 │   ├── models/                     # Saved models
 │   ├── datasets/                   # Training datasets
 │   └── results/                    # Detection results
 └── docs/
    ├── USER_GUIDE.md
    └── API_REFERENCE.md
 ```
 ## Database Schema
 ### Tables Overview
 ```mermaid
 erDiagram
    MODELS ||--o{ DETECTIONS : produces
    IMAGES ||--o{ DETECTIONS : contains
    IMAGES ||--o{ ANNOTATIONS : has
    MODELS {
        integer id PK
        string model_name
        string model_version
        string model_path
        string base_model
        datetime created_at
        json training_params
        json metrics
    }
    IMAGES {
        integer id PK
        string relative_path
        string filename
        integer width
        integer height
        datetime captured_at
        datetime added_at
        string checksum
    }
    DETECTIONS {
        integer id PK
        integer image_id FK
        integer model_id FK
        string class_name
        float x_min
        float y_min
        float x_max
        float y_max
        float confidence
        datetime detected_at
        json metadata
    }
    ANNOTATIONS {
        integer id PK
        integer image_id FK
        string class_name
        float x_min
        float y_min
        float x_max
        float y_max
        string annotator
        datetime created_at
        boolean verified
    }
 ```
 ### Detailed Schema
 #### **models** table
 Stores information about trained models and their versions.
 | Column | Type | Constraints | Description |
 |--------|------|-------------|-------------|
 | id | INTEGER | PRIMARY KEY | Unique model identifier |
 | model_name | TEXT | NOT NULL | User-friendly model name |
 | model_version | TEXT | NOT NULL | Version string (e.g., "v1.0") |
 | model_path | TEXT | NOT NULL | Path to model weights file |
 | base_model | TEXT | NOT NULL | Base model used (e.g., "yolov8s.pt") |
 | created_at | TIMESTAMP | DEFAULT CURRENT_TIMESTAMP | Model creation timestamp |
 | training_params | JSON | | Training hyperparameters |
 | metrics | JSON | | Validation metrics (mAP, precision, recall) |
 #### **images** table
 Stores metadata about microscopy images.
 | Column | Type | Constraints | Description |
 |--------|------|-------------|-------------|
 | id | INTEGER | PRIMARY KEY | Unique image identifier |
 | relative_path | TEXT | NOT NULL, UNIQUE | Relative path from repository root |
 | filename | TEXT | NOT NULL | Image filename |
 | width | INTEGER | | Image width in pixels |
 | height | INTEGER | | Image height in pixels |
 | captured_at | TIMESTAMP | | Original capture timestamp (if available) |
 | added_at | TIMESTAMP | DEFAULT CURRENT_TIMESTAMP | When added to database |
 | checksum | TEXT | | MD5 hash for integrity verification |
 #### **detections** table
 Stores object detection results.
 | Column | Type | Constraints | Description |
 |--------|------|-------------|-------------|
 | id | INTEGER | PRIMARY KEY | Unique detection identifier |
 | image_id | INTEGER | FOREIGN KEY | Reference to images table |
 | model_id | INTEGER | FOREIGN KEY | Reference to models table |
 | class_name | TEXT | NOT NULL | Object class (e.g., "organelle", "membrane_branch") |
 | x_min | REAL | NOT NULL | Bounding box left coordinate (normalized 0-1) |
 | y_min | REAL | NOT NULL | Bounding box top coordinate (normalized 0-1) |
 | x_max | REAL | NOT NULL | Bounding box right coordinate (normalized 0-1) |
 | y_max | REAL | NOT NULL | Bounding box bottom coordinate (normalized 0-1) |
 | confidence | REAL | NOT NULL | Detection confidence score (0-1) |
 | detected_at | TIMESTAMP | DEFAULT CURRENT_TIMESTAMP | When detection was performed |
 | metadata | JSON | | Additional metadata (processing time, etc.) |
 #### **annotations** table
 Stores manual annotations for training data (future feature).
 | Column | Type | Constraints | Description |
 |--------|------|-------------|-------------|
 | id | INTEGER | PRIMARY KEY | Unique annotation identifier |
 | image_id | INTEGER | FOREIGN KEY | Reference to images table |
 | class_name | TEXT | NOT NULL | Annotated object class |
 | x_min | REAL | NOT NULL | Bounding box left coordinate (normalized) |
 | y_min | REAL | NOT NULL | Bounding box top coordinate (normalized) |
 | x_max | REAL | NOT NULL | Bounding box right coordinate (normalized) |
 | y_max | REAL | NOT NULL | Bounding box bottom coordinate (normalized) |
 | annotator | TEXT | | Name of person who created annotation |
 | created_at | TIMESTAMP | DEFAULT CURRENT_TIMESTAMP | Annotation timestamp |
 | verified | BOOLEAN | DEFAULT 0 | Whether annotation is verified |
 ## Application Architecture
 ### Component Diagram
 ```mermaid
 graph TB
    subgraph GUI Layer
        MW[Main Window]
        TT[Training Tab]
        VT[Validation Tab]
        DT[Detection Tab]
        RT[Results Tab]
        AT[Annotation Tab]
    end
    subgraph Business Logic
        YW[YOLO Wrapper]
        INF[Inference Engine]
        DBM[Database Manager]
        CM[Config Manager]
    end
    subgraph Data Layer
        DB[(SQLite Database)]
        FS[File System]
        YOLO[YOLOv8 Model]
    end
    MW --> TT
    MW --> VT
    MW --> DT
    MW --> RT
    MW --> AT
    TT --> YW
    VT --> YW
    DT --> INF
    RT --> DBM
    AT --> DBM
    YW --> YOLO
    YW --> DBM
    INF --> YOLO
    INF --> DBM
    DBM --> DB
    CM --> FS
    YW -.reads.-> FS
    INF -.reads.-> FS
 ```
 ### Key Components
 #### 1. **YOLO Wrapper** ([`src/model/yolo_wrapper.py`](src/model/yolo_wrapper.py))
 Encapsulates YOLOv8 operations:
 - Load pre-trained YOLOv8s model
 - Fine-tune on custom microscopy dataset
 - Export trained models
 - Provide training progress callbacks
 - Calculate validation metrics
 **Key Methods:**
 ```python
 class YOLOWrapper:
    def __init__(self, model_path: str = "yolov8s.pt")
    def train(self, data_yaml: str, epochs: int, callbacks: dict)
    def validate(self, data_yaml: str) -> dict
    def predict(self, image_path: str, conf: float) -> list
    def export_model(self, format: str, output_path: str)
 ```
 #### 2. **Database Manager** ([`src/database/db_manager.py`](src/database/db_manager.py))
 Handles all database operations:
 - Connection management
 - CRUD operations for all tables
 - Query builders for filtering and searching
 - Transaction management
 - Database migrations
 **Key Methods:**
 ```python
 class DatabaseManager:
    def __init__(self, db_path: str)
    def add_detection(self, detection_data: dict) -> int
    def add_image(self, image_data: dict) -> int
    def add_model(self, model_data: dict) -> int
    def get_detections(self, filters: dict) -> list
    def export_detections(self, format: str, output_path: str)
 ```
 #### 3. **Inference Engine** ([`src/model/inference.py`](src/model/inference.py))
 Manages detection operations:
 - Real-time single image detection
 - Batch processing with progress tracking
 - Result formatting and storage
 - Confidence thresholding
 **Key Methods:**
 ```python
 class InferenceEngine:
    def __init__(self, model_path: str, db_manager: DatabaseManager)
    def detect_single(self, image_path: str, conf: float) -> dict
    def detect_batch(self, image_paths: list, conf: float, callback: callable)
    def save_results(self, results: list, model_id: int)
 ```
 #### 4. **GUI Components**
 ##### **Main Window** ([`src/gui/main_window.py`](src/gui/main_window.py))
 - Tab container for different functionalities
 - Menu bar with File, Tools, Help
 - Status bar with progress indicators
 - Global settings and repository configuration
 ##### **Training Tab** ([`src/gui/tabs/training_tab.py`](src/gui/tabs/training_tab.py))
 Features:
 - Dataset path selection (YOLO format)
 - Training parameters configuration (epochs, batch size, learning rate)
 - Start/stop training controls
 - Real-time training progress with pyqtgraph plots:
  - Loss curves (training and validation)
  - mAP progression
 - Training logs display
 ##### **Validation Tab** ([`src/gui/tabs/validation_tab.py`](src/gui/tabs/validation_tab.py))
 Features:
 - Model selection for validation
 - Validation dataset selection
 - Metrics visualization:
  - Confusion matrix
  - Precision-Recall curves
  - mAP by class
 - Results table with per-class metrics
 ##### **Detection Tab** ([`src/gui/tabs/detection_tab.py`](src/gui/tabs/detection_tab.py))
 Features:
 - Model selection dropdown
 - Detection modes:
  - Single image detection
  - Batch folder processing
  - Real-time camera feed (future)
 - Confidence threshold slider
 - Image viewer with bounding box overlay
 - Detection results panel
 - Save to database button
 ##### **Results Tab** ([`src/gui/tabs/results_tab.py`](src/gui/tabs/results_tab.py))
 Features:
 - Searchable detection history table
 - Filters by date, model, class, confidence
 - Image preview with detections overlay
 - Statistics dashboard with pyqtgraph:
  - Detections per class histogram
  - Confidence distribution
  - Detection timeline
 - Export options (CSV, JSON, Excel)
 ##### **Annotation Tab** ([`src/gui/tabs/annotation_tab.py`](src/gui/tabs/annotation_tab.py))
 Future feature for manual annotation:
 - Image browser and viewer
 - Drawing tools for bounding boxes
 - Class label assignment
 - Annotation export to YOLO format
 - Annotation statistics
 ## Workflow Diagrams
 ### Training Workflow
 ```mermaid
 sequenceDiagram
    participant User
    participant TrainingTab
    participant YOLOWrapper
    participant DBManager
    participant Database
    User->>TrainingTab: Configure training parameters
    User->>TrainingTab: Select dataset (YOLO format)
    User->>TrainingTab: Click "Start Training"
    TrainingTab->>YOLOWrapper: train(data_yaml, epochs, callbacks)
    loop Every epoch
        YOLOWrapper-->>TrainingTab: Progress callback (loss, metrics)
        TrainingTab->>TrainingTab: Update plots
    end
    YOLOWrapper-->>TrainingTab: Training complete (model_path)
    TrainingTab->>DBManager: add_model(model_info)
    DBManager->>Database: INSERT INTO models
    Database-->>DBManager: model_id
    TrainingTab->>User: Show completion message
 ```
 ### Detection Workflow
 ```mermaid
 sequenceDiagram
    participant User
    participant DetectionTab
    participant InferenceEngine
    participant YOLOWrapper
    participant DBManager
    participant Database
    User->>DetectionTab: Select model
    User->>DetectionTab: Choose image/folder
    User->>DetectionTab: Set confidence threshold
    User->>DetectionTab: Click "Detect"
    DetectionTab->>InferenceEngine: detect_batch(images, conf)
    loop For each image
        InferenceEngine->>YOLOWrapper: predict(image_path, conf)
        YOLOWrapper-->>InferenceEngine: detections list
        InferenceEngine->>DBManager: add_image(image_info)
        DBManager->>Database: INSERT INTO images
        Database-->>DBManager: image_id
        loop For each detection
            InferenceEngine->>DBManager: add_detection(detection_info)
            DBManager->>Database: INSERT INTO detections
        end
        InferenceEngine-->>DetectionTab: Progress update
    end
    InferenceEngine-->>DetectionTab: All detections complete
    DetectionTab->>User: Display results
 ```
 ## Configuration Management
 ### Application Configuration ([`config/app_config.yaml`](config/app_config.yaml))
 ```yaml
 database:
  path: "data/detections.db"
 image_repository:
  base_path: ""  # Set by user
  allowed_extensions: [".jpg", ".jpeg", ".png", ".tif", ".tiff"]
 models:
  default_base_model: "yolov8s.pt"
  models_directory: "data/models"
 training:
  default_epochs: 100
  default_batch_size: 16
  default_imgsz: 640
  default_patience: 50
 detection:
  default_confidence: 0.25
  max_batch_size: 100
 visualization:
  colors:
    organelle: "#FF6B6B"
    membrane_branch: "#4ECDC4"
  bbox_thickness: 2
 export:
  formats: ["csv", "json", "excel"]
  default_format: "csv"
 logging:
  level: "INFO"
  file: "logs/app.log"
 ```
 ## Data Flow
 ### Image Repository Structure
 ```
 image_repository/
 ├── train/
 │   ├── images/
 │   │   ├── img001.png
 │   │   └── ...
 │   └── labels/
 │       ├── img001.txt  (YOLO format)
 │       └── ...
 ├── val/
 │   ├── images/
 │   └── labels/
 └── test/
    └── images/
 ```
 ### YOLO Format Labels
 Each label file contains lines with format:
 ```
 <class_id> <x_center> <y_center> <width> <height>
 ```
 All coordinates normalized to 0-1 range.
 ## Error Handling Strategy
 1. **Database Errors**: Graceful degradation, show error dialog, log to file
 2. **Model Loading Errors**: Validate model before loading, clear error messages
 3. **File Access Errors**: Check permissions, validate paths, inform user
 4. **Training Errors**: Catch ultralytics exceptions, save partial progress
 5. **Memory Errors**: Implement batch processing limits, monitor memory usage
 ## Performance Considerations
 1. **Database**:
   - Indexed columns: image_id, model_id, class_name, detected_at
   - Batch inserts for multiple detections
   - Connection pooling
 2. **Image Processing**:
   - Lazy loading for large datasets
   - Thumbnail generation for previews
   - Async operations for batch processing
 3. **GUI**:
   - Worker threads for long operations (training, batch detection)
   - Progress signals for UI updates
   - Image caching for faster display
 ## Testing Strategy
 1. **Unit Tests**:
   - Database operations
   - YOLO wrapper methods
   - Utility functions
 2. **Integration Tests**:
   - End-to-end training workflow
   - Detection and storage pipeline
   - Export functionality
 3. **GUI Tests**:
   - User interaction flows
   - Signal/slot connections
   - Widget state management
 ## Future Enhancements
 1. **Phase 1** (Core Features):
   - Basic training, validation, detection
   - Results storage and visualization
   - Model management
 2. **Phase 2** (Enhanced Features):
   - Annotation tool implementation
   - Advanced filtering and search
   - Model comparison tools
   - Batch exports
 3. **Phase 3** (Advanced Features):
   - Real-time camera detection
   - Multi-model ensemble
   - Cloud storage integration
   - Collaborative annotation
 ## Dependencies
 ### Python Packages
 ```
 ultralytics>=8.0.0
 PySide6>=6.5.0
 pyqtgraph>=0.13.0
 numpy>=1.24.0
 opencv-python>=4.8.0
 Pillow>=10.0.0
 pyyaml>=6.0
 sqlalchemy>=2.0.0
 pandas>=2.0.0
 openpyxl>=3.1.0
 ```
 ### System Requirements
 - Python 3.8+
 - CUDA-capable GPU (recommended for training)
 - 8GB RAM minimum
 - 2GB disk space for models and data
--- a/IMPLEMENTATION_GUIDE.md
+++ b/IMPLEMENTATION_GUIDE.md
--- a/QUICKSTART.md
+++ b/QUICKSTART.md
@@ -0,0 +1,261 @@
 # Quick Start Guide
 This guide will help you get the Microscopy Object Detection Application up and running quickly.
 ## Prerequisites
 - Python 3.8 or higher
 - pip package manager
 - (Optional) CUDA-capable GPU for faster training and inference
 ## Installation
 ### 1. Clone or Navigate to Project Directory
 ```bash
 cd /home/martin/code/object_detection
 ```
 ### 2. Create Virtual Environment
 ```bash
 python3 -m venv venv
 source venv/bin/activate  # On Linux/Mac
 # or
 venv\Scripts\activate  # On Windows
 ```
 ### 3. Install Dependencies
 ```bash
 pip install -r requirements.txt
 ```
 This will install:
 - ultralytics (YOLOv8)
 - PySide6 (GUI framework)
 - pyqtgraph (visualization)
 - OpenCV and Pillow (image processing)
 - And other dependencies
 **Note:** The first run will automatically download the YOLOv8s.pt model (~22MB).
 ### 4. Verify Installation
 ```bash
 python -c "import PySide6; import ultralytics; print('Installation successful!')"
 ```
 ## First Run
 ### Launch the Application
 ```bash
 python main.py
 ```
 The application window should open with 5 tabs:
 - **Detection**: For running object detection
 - **Training**: For training custom models (placeholder)
 - **Validation**: For validating models (placeholder)
 - **Results**: For viewing detection history (placeholder)
 - **Annotation**: Future feature (placeholder)
 ## Configuration
 ### Set Up Image Repository
 1. Go to **File → Settings**
 2. Under "General" tab, click "Browse..." next to "Base Path"
 3. Select your microscopy images directory
 4. Click "Save"
 This tells the application where your images are stored.
 ### Adjust Detection Settings
 In the Settings dialog:
 - **Detection tab**: Adjust default confidence threshold (0.25 default)
 - **Training tab**: Configure default training parameters
 - All settings are saved to [`config/app_config.yaml`](config/app_config.yaml)
 ## Running Detection
 ### Single Image Detection
 1. Go to the **Detection** tab
 2. Select a model from the dropdown (default: Base Model yolov8s.pt)
 3. Adjust confidence threshold with the slider
 4. Click "Detect Single Image"
 5. Select an image file
 6. View results in the results panel
 ### Batch Detection
 1. Go to the **Detection** tab
 2. Select a model
 3. Click "Detect Batch (Folder)"
 4. Select a folder containing images
 5. Confirm the number of images to process
 6. Wait for processing to complete
 7. Results are automatically saved to the database
 ## Understanding the Results
 Detection results include:
 - **Image path**: Location of the processed image
 - **Detections**: Number of objects found
 - **Class names**: Types of objects detected (e.g., organelle, membrane_branch)
 - **Confidence scores**: Detection confidence (0-1)
 - **Bounding boxes**: Object locations (stored in database)
 All results are stored in the SQLite database at [`data/detections.db`](data/detections.db).
 ## Database
 The application uses SQLite to store:
 - **Models**: Information about trained models
 - **Images**: Metadata about processed images
 - **Detections**: All detection results with bounding boxes
 - **Annotations**: Manual annotations (future feature)
 ### View Database Statistics
 Go to **Tools → Database Statistics** to see:
 - Total number of detections
 - Detections per class
 - Average confidence scores
 ## Next Steps
 ### Training Custom Models (Coming Soon)
 The Training tab will allow you to:
 1. Select a YOLO-format dataset
 2. Configure training parameters
 3. Monitor training progress
 4. Save trained models
 ### Preparing Your Dataset
 For training, you'll need data in YOLO format:
 ```
 dataset/
 ├── train/
 │   ├── images/
 │   │   ├── img001.png
 │   │   └── ...
 │   └── labels/
 │       ├── img001.txt
 │       └── ...
 ├── val/
 │   ├── images/
 │   └── labels/
 └── data.yaml
 ```
 See [`README.md`](README.md) for detailed dataset format information.
 ## Troubleshooting
 ### Application Won't Start
 **Error: Module not found**
 ```bash
 # Make sure virtual environment is activated
 source venv/bin/activate
 pip install -r requirements.txt
 ```
 **Error: Qt platform plugin**
 ```bash
 # Install system Qt dependencies (Linux)
 sudo apt-get install libxcb-xinerama0
 ```
 ### Detection Not Working
 **No models available**
 - The base YOLOv8s model will be downloaded automatically on first use
 - Make sure you have internet connection for the first run
 **Images not found**
 - Verify the image repository path in Settings
 - Check that image files have supported extensions (.jpg, .png, .tif, etc.)
 ### Performance Issues
 **Slow detection**
 - Use a GPU if available (CUDA will be auto-detected)
 - Reduce batch size in settings
 - Process fewer images at once
 **Out of memory**
 - Reduce batch size
 - Use CPU instead of GPU for large images
 - Process images sequentially rather than in batch
 ## File Locations
 - **Database**: `data/detections.db`
 - **Configuration**: `config/app_config.yaml`
 - **Logs**: `logs/app.log`
 - **Models**: `data/models/`
 - **Results**: Stored in database
 ## Command Line Tips
 ### Check GPU Availability
 ```bash
 python -c "import torch; print(f'CUDA available: {torch.cuda.is_available()}')"
 ```
 ### View Logs
 ```bash
 tail -f logs/app.log
 ```
 ### Backup Database
 ```bash
 cp data/detections.db data/detections_backup_$(date +%Y%m%d).db
 ```
 ## Getting Help
 - Read the full documentation in [`README.md`](README.md)
 - Check architecture details in [`ARCHITECTURE.md`](ARCHITECTURE.md)
 - Review implementation guide in [`IMPLEMENTATION_GUIDE.md`](IMPLEMENTATION_GUIDE.md)
 - Check logs in `logs/app.log` for error messages
 ## What's Implemented
 ✅ **Core Infrastructure**
 - Project structure and configuration
 - Database schema and operations
 - YOLO model wrapper
 - Inference engine with batch processing
 - Configuration management
 - Logging system
 ✅ **GUI Components**
 - Main window with menu bar
 - Settings dialog
 - Detection tab with single/batch detection
 - Placeholder tabs for future features
 🚧 **In Progress / Planned**
 - Training tab implementation
 - Validation tab with metrics
 - Results browser and visualization
 - Annotation tool
 - Advanced filtering and export
 - Real-time camera detection
 ---
 **Happy detecting! 🔬🔍**
 For questions or issues, please refer to the documentation or check the application logs.