Gospel Command Line Interface Reference

This comprehensive reference covers all Gospel CLI commands, options, and usage patterns for genomic analysis workflows.

Table of Contents

  1. Command Overview
  2. Global Options
  3. Analyze Command
  4. Query Command
  5. Visualize Command
  6. Knowledge Base Commands
  7. LLM Commands
  8. Configuration
  9. Workflow Examples

Command Overview

Gospel provides a comprehensive CLI for genomic analysis and AI-powered interpretation:

gospel --help

Main Commands

Command Purpose Primary Use Case
analyze Process genomic data and extract insights Core genomic analysis
query Interactive AI-powered queries Explore analysis results
visualize Generate charts and network visualizations Data presentation
kb Manage knowledge base Build and query scientific databases
llm Work with language models Train and query domain-specific AI

Quick Start

# Basic analysis across all domains
gospel analyze --vcf genome.vcf --output results/

# Domain-specific analysis
gospel analyze --vcf genome.vcf --domains fitness --output fitness_results/

# Interactive exploration
gospel query --interactive --results results/

# Generate visualizations
gospel visualize --results results/ --output charts/

Global Options

These options are available for all Gospel commands:

gospel [GLOBAL_OPTIONS] COMMAND [COMMAND_OPTIONS]

Common Global Options

--version                    # Show Gospel version
--config PATH               # Specify configuration file
--verbose, -v               # Enable verbose output
--quiet, -q                 # Suppress non-error output
--log-level LEVEL           # Set logging level (DEBUG, INFO, WARN, ERROR)
--threads N                 # Number of parallel threads
--memory-limit SIZE         # Memory usage limit (e.g., "8GB")

Configuration File

Specify a custom configuration:

gospel --config ~/.gospel/custom_config.yaml analyze --vcf genome.vcf

Analyze Command

The analyze command is Gospel’s core genomic analysis engine.

Basic Syntax

gospel analyze --vcf INPUT.vcf [OPTIONS]

Input Options

# Required
--vcf PATH                  # Input VCF file

# Optional input files
--reference PATH            # Reference genome (default: GRCh38)
--annotation PATH           # Custom annotation file
--pedigree PATH             # Family pedigree file
--phenotype PATH            # Phenotype data file

Domain Selection

# All domains (default)
--domains all

# Specific domains
--domains fitness
--domains pharmacogenetics  
--domains nutrition

# Multiple domains
--domains fitness,pharmacogenetics

Analysis Parameters

# Quality filters
--min-quality N             # Minimum variant quality score (default: 30)
--min-depth N               # Minimum read depth (default: 10)
--max-allele-freq FLOAT     # Maximum population frequency (default: 0.05)

# Analysis scope
--include-regulatory        # Include regulatory region variants
--include-structural        # Include structural variants
--include-cnvs              # Include copy number variations

# Population parameters
--population CODE           # Population ancestry (EUR, AFR, AMR, EAS, SAS)
--custom-frequencies PATH   # Custom allele frequency database

Scoring Options

# Scoring weights
--functional-weight FLOAT   # Weight for functional impact (default: 0.4)
--conservation-weight FLOAT # Weight for conservation (default: 0.3)
--frequency-weight FLOAT    # Weight for population frequency (default: 0.2)
--literature-weight FLOAT   # Weight for literature evidence (default: 0.1)

# Confidence thresholds
--min-confidence FLOAT      # Minimum confidence for reporting (default: 0.6)
--high-confidence FLOAT     # Threshold for high confidence (default: 0.8)

Output Options

# Output directory and format
--output PATH               # Output directory (default: ./results)
--format FORMAT             # Output format (html, json, csv, all)

# Report customization
--detailed-annotations      # Include detailed variant annotations
--include-networks          # Generate protein interaction networks
--include-pathways          # Include pathway enrichment analysis
--generate-plots            # Create visualization plots

# File naming
--prefix STRING             # Prefix for output files
--timestamp                 # Add timestamp to output files

Performance Options

# Computational resources
--threads N                 # Number of CPU cores (default: auto)
--memory-limit SIZE         # Memory limit (e.g., "16GB")
--cache-dir PATH            # Directory for caching (default: ~/.gospel/cache)

# Processing mode
--streaming                 # Stream large VCF files
--chunk-size N              # Process variants in chunks of size N
--parallel-domains          # Process domains in parallel

Examples

Basic Analysis

# Simple analysis with default settings
gospel analyze --vcf sample.vcf --output basic_analysis/

# Analysis with quality filters
gospel analyze --vcf sample.vcf \
    --min-quality 50 \
    --min-depth 20 \
    --max-allele-freq 0.01 \
    --output high_quality_analysis/

Domain-Specific Analysis

# Fitness domain only
gospel analyze --vcf athlete.vcf \
    --domains fitness \
    --include-regulatory \
    --output fitness_profile/

# Pharmacogenetics with drug focus
gospel analyze --vcf patient.vcf \
    --domains pharmacogenetics \
    --drugs "warfarin,clopidogrel,simvastatin" \
    --output pharma_analysis/

# Nutritional genomics
gospel analyze --vcf genome.vcf \
    --domains nutrition \
    --nutrients "folate,vitamin_d,caffeine" \
    --output nutrition_profile/

Advanced Analysis

# Comprehensive analysis with all features
gospel analyze --vcf genome.vcf \
    --domains all \
    --include-regulatory \
    --include-structural \
    --include-cnvs \
    --include-networks \
    --include-pathways \
    --population EUR \
    --min-confidence 0.7 \
    --generate-plots \
    --format all \
    --threads 8 \
    --output comprehensive_analysis/

Family Analysis

# Trio analysis (parents + child)
gospel analyze --vcf family.vcf \
    --pedigree family.ped \
    --inheritance-mode recessive \
    --domains all \
    --output family_analysis/

Query Command

The query command provides AI-powered exploration of genomic analysis results.

Basic Syntax

gospel query [OPTIONS]

Input Sources

# Query analysis results
--results PATH              # Directory containing analysis results
--vcf PATH                  # Direct VCF file query
--variant VARIANT           # Specific variant (e.g., "rs1234567")

# Knowledge base query
--kb-dir PATH               # Query knowledge base directly
--pubmed-search             # Search PubMed for additional context

Query Modes

# Interactive mode
--interactive               # Start interactive query session

# Single query mode
--query "QUESTION"          # Ask specific question

# Batch query mode
--query-file PATH           # File containing multiple queries

AI Model Options

# Model selection
--model MODEL_NAME          # Specify AI model (default: llama3)
--temperature FLOAT         # Model temperature (default: 0.1)
--max-tokens N              # Maximum response tokens (default: 2000)

# Context options
--include-literature        # Include literature context
--include-pathways          # Include pathway information
--include-population        # Include population genetics context

Output Options

# Response format
--format FORMAT             # Response format (text, json, markdown)
--save-session PATH         # Save query session to file
--export-results PATH       # Export all responses to file

Example Queries

Interactive Mode

# Start interactive session
gospel query --interactive --results analysis_results/

# Example session:
> What are my genetic advantages for endurance sports?
> Which medications should I be cautious about?
> How does my MTHFR variant affect folate metabolism?
> Show me genes connected to ACTN3 in my network
> What supplements might benefit my genetic profile?

Direct Queries

# Specific genetic question
gospel query --results analysis/ \
    --query "What does my APOE genotype mean for cardiovascular health?"

# Drug interaction query
gospel query --results analysis/ \
    --query "Is it safe for me to take warfarin based on my genetics?"

# Training optimization
gospel query --results fitness_analysis/ \
    --query "What type of training would be most effective for my genetic profile?"

Batch Queries

# Create query file
cat > queries.txt << EOF
What are my top 5 genetic risk factors?
Which domains show the highest scores?
What lifestyle modifications are recommended?
Are there any drug-gene interactions I should know about?
EOF

# Run batch queries
gospel query --results analysis/ --query-file queries.txt

Visualize Command

Generate comprehensive visualizations of genomic analysis results.

Basic Syntax

gospel visualize --results RESULTS_DIR [OPTIONS]

Input Options

# Source data
--results PATH              # Analysis results directory
--variants PATH             # Variant data file
--scores PATH               # Score data file
--networks PATH             # Network data file

Visualization Types

# Chart types
--score-distributions       # Domain score distributions
--variant-impacts           # Variant impact plots
--pathway-enrichment        # Pathway enrichment charts
--population-comparisons    # Population frequency comparisons

# Network visualizations
--protein-networks          # Protein interaction networks
--pathway-networks          # Biological pathway networks
--cross-domain-networks     # Cross-domain gene networks

# Specialized plots
--fitness-radar             # Fitness profile radar chart
--pharma-heatmap            # Pharmacogenetic heatmap
--nutrition-wheel           # Nutritional requirements wheel

Output Options

# Output settings
--output PATH               # Output directory
--format FORMAT             # Image format (png, svg, pdf, html)
--resolution N              # Image resolution (DPI)
--theme THEME               # Visualization theme (light, dark, publication)

# Interactive features
--interactive               # Generate interactive HTML plots
--include-tooltips          # Add detailed tooltips
--enable-zoom               # Enable plot zooming

Example Visualizations

# Basic visualization suite
gospel visualize --results analysis/ \
    --score-distributions \
    --variant-impacts \
    --protein-networks \
    --output charts/

# Publication-quality figures
gospel visualize --results analysis/ \
    --score-distributions \
    --pathway-enrichment \
    --format pdf \
    --resolution 300 \
    --theme publication \
    --output figures/

# Interactive web report
gospel visualize --results analysis/ \
    --interactive \
    --include-tooltips \
    --enable-zoom \
    --format html \
    --output web_report/

Knowledge Base Commands

Manage Gospel’s scientific knowledge base for enhanced AI queries.

Build Knowledge Base

gospel kb build --pdf-dir PDFS/ --output-dir KB/ [OPTIONS]

Options

# Input sources
--pdf-dir PATH              # Directory containing PDF papers
--pubmed-ids FILE           # File with PubMed IDs to download
--text-dir PATH             # Directory with text files

# Processing options
--model MODEL_NAME          # Model for text processing (default: llama3)
--chunk-size N              # Text chunk size (default: 1000)
--overlap N                 # Chunk overlap (default: 200)

# Output options
--output-dir PATH           # Knowledge base output directory
--index-name NAME           # Vector index name
--metadata-format FORMAT    # Metadata format (json, csv)

Examples

# Build from PDF collection
gospel kb build \
    --pdf-dir research_papers/ \
    --output-dir knowledge_base/ \
    --model llama3

# Build with custom parameters
gospel kb build \
    --pdf-dir papers/ \
    --pubmed-ids pubmed_list.txt \
    --output-dir kb/ \
    --chunk-size 1500 \
    --overlap 300

Query Knowledge Base

gospel kb query --kb-dir KB_DIR --query "QUESTION" [OPTIONS]

Options

# Query parameters
--kb-dir PATH               # Knowledge base directory
--query STRING              # Query string
--top-k N                   # Number of results (default: 5)
--similarity-threshold FLOAT # Minimum similarity (default: 0.7)

# Output options
--include-sources           # Include source citations
--format FORMAT             # Output format (text, json)

Examples

# Query specific topic
gospel kb query \
    --kb-dir knowledge_base/ \
    --query "ACTN3 variants and sprint performance" \
    --top-k 10

# Query with citations
gospel kb query \
    --kb-dir kb/ \
    --query "CYP2D6 pharmacogenetics" \
    --include-sources \
    --format json

LLM Commands

Train and deploy domain-specific language models.

Train Domain Model

gospel llm train --kb-dir KB_DIR --output-dir MODEL_DIR [OPTIONS]

Options

# Training data
--kb-dir PATH               # Knowledge base directory
--base-model MODEL          # Base model to fine-tune
--training-examples PATH    # Additional training examples

# Training parameters
--epochs N                  # Training epochs (default: 3)
--learning-rate FLOAT       # Learning rate (default: 1e-5)
--batch-size N              # Batch size (default: 4)

# Output options
--output-dir PATH           # Model output directory
--model-name NAME           # Custom model name

Examples

# Train fitness-focused model
gospel llm train \
    --kb-dir fitness_kb/ \
    --output-dir fitness_model/ \
    --base-model llama3 \
    --epochs 5

# Train pharmacogenetics model
gospel llm train \
    --kb-dir pharma_kb/ \
    --output-dir pharma_model/ \
    --base-model mistral \
    --learning-rate 2e-5

Query Domain Model

gospel llm query --model-dir MODEL_DIR [OPTIONS]

Options

# Model parameters
--model-dir PATH            # Trained model directory
--temperature FLOAT         # Sampling temperature
--max-tokens N              # Maximum response tokens

# Query options
--query STRING              # Single query
--interactive               # Interactive mode
--context PATH              # Additional context file

Examples

# Single query
gospel llm query \
    --model-dir fitness_model/ \
    --query "Optimize training for ACTN3 RX genotype"

# Interactive session
gospel llm query \
    --model-dir pharma_model/ \
    --interactive

Configuration

Gospel uses YAML configuration files for customizing analysis parameters.

Configuration File Structure

# ~/.gospel/config.yaml
database:
  path: ~/.gospel/databases
  cache_size: 1000MB
  update_frequency: weekly

analysis:
  default_domains: [fitness, pharmacogenetics, nutrition]
  variant_filters:
    min_quality: 30
    min_depth: 10
    max_allele_frequency: 0.05
  
  scoring:
    weights:
      functional_impact: 0.4
      conservation: 0.3
      population_frequency: 0.2
      literature_evidence: 0.1
    
    thresholds:
      high_confidence: 0.8
      medium_confidence: 0.6
      low_confidence: 0.4

domains:
  fitness:
    focus_traits: [sprint, endurance, power, recovery]
    include_injury_risk: true
    training_recommendations: true
  
  pharmacogenetics:
    drug_classes: [cardiovascular, psychiatric, oncology, pain]
    include_dosing: true
    include_interactions: true
  
  nutrition:
    include_sensitivities: true
    supplement_recommendations: true
    diet_optimization: true

output:
  default_format: html
  include_visualizations: true
  detailed_annotations: true
  compress_results: false

ai:
  default_model: llama3
  temperature: 0.1
  max_tokens: 2000
  include_literature_context: true

performance:
  threads: auto
  memory_limit: 8GB
  cache_enabled: true
  parallel_domains: true

Environment Variables

# Core settings
export GOSPEL_CONFIG_DIR=~/.gospel
export GOSPEL_DATABASE_PATH=~/.gospel/databases
export GOSPEL_CACHE_DIR=~/.gospel/cache

# AI model settings
export OLLAMA_HOST=localhost:11434
export GOSPEL_MODEL=llama3

# Performance settings
export GOSPEL_THREADS=8
export GOSPEL_MEMORY_LIMIT=16GB

Workflow Examples

Complete Analysis Workflow

#!/bin/bash
# complete_analysis.sh

# 1. Run comprehensive analysis
gospel analyze \
    --vcf genome.vcf \
    --domains all \
    --include-regulatory \
    --include-networks \
    --population EUR \
    --output analysis_results/ \
    --format all

# 2. Generate visualizations
gospel visualize \
    --results analysis_results/ \
    --interactive \
    --output visualizations/

# 3. Explore results interactively
gospel query \
    --interactive \
    --results analysis_results/

Athlete Performance Analysis

#!/bin/bash
# athlete_analysis.sh

# Focus on fitness domain with enhanced features
gospel analyze \
    --vcf athlete_genome.vcf \
    --domains fitness \
    --include-regulatory \
    --include-networks \
    --training-optimization \
    --injury-risk-assessment \
    --output athlete_profile/

# Generate sport-specific recommendations
gospel query \
    --results athlete_profile/ \
    --query "What sports and training methods suit my genetic profile?" \
    --include-literature

# Create athlete report visualizations
gospel visualize \
    --results athlete_profile/ \
    --fitness-radar \
    --training-recommendations \
    --format pdf \
    --output athlete_report/

Clinical Pharmacogenetics Workflow

#!/bin/bash
# clinical_pharma.sh

# Pharmacogenetic analysis for clinical use
gospel analyze \
    --vcf patient.vcf \
    --domains pharmacogenetics \
    --clinical-guidelines \
    --drug-interactions \
    --dosing-recommendations \
    --output pharma_analysis/

# Generate clinical report
gospel query \
    --results pharma_analysis/ \
    --query "Provide clinical pharmacogenetic recommendations" \
    --format clinical-report \
    --output clinical_pharma_report.pdf

# Create pharmacist reference
gospel visualize \
    --results pharma_analysis/ \
    --pharma-heatmap \
    --drug-response-table \
    --format html \
    --output pharmacist_reference/

Research Cohort Analysis

#!/bin/bash
# cohort_analysis.sh

# Process multiple samples in parallel
for vcf in cohort/*.vcf; do
    sample=$(basename "$vcf" .vcf)
    gospel analyze \
        --vcf "$vcf" \
        --domains all \
        --population EUR \
        --output "cohort_results/$sample/" &
done
wait

# Aggregate results
gospel aggregate \
    --input-dir cohort_results/ \
    --output cohort_summary/ \
    --generate-statistics

# Perform population analysis
gospel population-analysis \
    --cohort-dir cohort_results/ \
    --output population_genetics/ \
    --include-gwas

This comprehensive CLI reference provides all the tools needed to leverage Gospel’s full genomic analysis capabilities. For specific use cases and detailed examples, see the Examples section.

Next: Explore the API Reference for programmatic usage or check out Examples for real-world scenarios.