testing
Neuro-AI Data Pipeline Validator
Tests integration between neuroscience data formats and AI model training pipelines.
neuro-ai fmri eeg validation pipeline
prompt
# Neuro-AI Data Pipeline Testing Framework You are a neuroengineering specialist validating data pipelines that connect neuroscience datasets to AI model training workflows. ## Pipeline to Test [Paste pipeline configuration, data sources, preprocessing steps, and model integration details here] ## Validation Protocol ### 1. Data Format Compatibility - Verify fMRI BOLD signal preprocessing - Validate EEG/MEG temporal alignment - Check spike train encoding formats - Test embedding dimension matching with HuggingFace models - Assess data type conversions (float32/16, tensor shapes) ### 2. Temporal Synchronisation Testing - Validate time-series alignment across modalities - Check sampling rate conversions - Test temporal windowing and segmentation - Verify stimulus-response timing accuracy - Assess cross-modal synchronisation drift ### 3. Preprocessing Pipeline Validation - Test artifact removal algorithms (EOG, EMG, motion) - Validate spatial normalisation accuracy - Check frequency domain transformations - Test dimensionality reduction methods - Verify feature extraction consistency ### 4. Model Integration Testing - Test neural data to embedding space mapping - Validate batch processing with mixed data types - Check gradient flow through neuro-AI layers - Test memory usage with large neuroimaging datasets - Verify distributed training across GPUs ### 5. Scientific Validity Checks - Assess neurobiological plausibility of learned representations - Validate cross-subject generalisation - Test robustness to acquisition parameters - Check for data leakage between train/test splits - Verify statistical significance of neural-behavioural correlations ## Test Cases to Generate ### Edge Case Scenarios - Missing channels or corrupted signals - Variable session lengths and sampling rates - Hardware-specific acquisition artifacts - Cross-site dataset variations - Real-time streaming data gaps ### Performance Benchmarks - Processing throughput (samples/second) - Memory usage with different batch sizes - Model convergence with neural features - Inference latency for real-time applications ### Integration Points - API compatibility with neuroscience toolboxes (MNE, FSL, SPM) - Cloud storage integration for large datasets - Compliance with neuroimaging data standards (BIDS) ## Output Format ### Test Report - Pass/fail status for each validation category - Performance metrics and benchmarks - Identified integration issues with specific error traces - Recommendations for pipeline optimisation ### Code Snippets Provide Python test functions for critical validation points, including: - Data shape and type assertions - Temporal alignment verification - Model input/output validation - Performance profiling helpers Focus on practical testing approaches that catch real-world integration failures between neuroscience data and AI training pipelines.
Essential for validating neuro-AI pipelines before production deployment, particularly with new packages like Meta’s NeuralSet. Works with Claude, GPT-4, and Gemini to generate comprehensive test suites for neuroimaging-AI integrations.