Supported File Formats¶

The AI Imaging Agent supports a wide range of file formats for medical and scientific imaging, as well as general data files.

Image Formats¶

Standard Images¶

Format	Extensions	Description
PNG	`.png`	Portable Network Graphics - lossless compression
JPEG	`.jpg`, `.jpeg`	Joint Photographic Experts Group - lossy compression

Currently, only PNG and JPEG are accepted for standard images. Other web formats (e.g. WebP, BMP, GIF) should be converted to PNG or JPEG before upload. Best for: General photographs, screenshots, web images

Medical Imaging Formats¶

DICOM¶

Format	Extensions	Description
DICOM	`.dcm`, `.dicom`	Digital Imaging and Communications in Medicine

Features:

Industry standard for medical imaging
Contains rich metadata (patient info, acquisition parameters)
Supports multiple modalities (CT, MRI, X-ray, etc.)
Can store 2D images or 3D volumes

Metadata Extracted:

Patient ID, Study Instance UID
Modality (CT, MR, CR, DX, etc.)
Image dimensions and spacing
Acquisition date/time
Manufacturer and model

Example Usage:

Upload a CT DICOM file and ask:
"Segment the lungs from this scan"

NIfTI¶

Format	Extensions	Description
NIfTI	`.nii`, `.nii.gz`	Neuroimaging Informatics Technology Initiative

Features:

Standard for neuroimaging research
Supports 3D and 4D (time-series) volumes
Compact storage with optional gzip compression
Contains spatial orientation information

Metadata Extracted:

Volume dimensions (x, y, z, time)
Voxel spacing
Data type and bit depth
Orientation matrix

Example Usage:

Upload a brain MRI NIfTI file:
"Register this brain scan to MNI space"

Scientific Imaging Formats¶

TIFF/TIFF Stacks¶

Format	Extensions	Description
TIFF	`.tif`, `.tiff`	Tagged Image File Format

Features:

Supports multi-page/multi-frame images
Common in microscopy and scientific imaging
Can store extensive metadata
Lossless compression options

Metadata Extracted:

Number of pages/frames (for stacks)
Dimensions (width, height, channels)
Color mode (RGB, grayscale, etc.)
Compression method
DPI/resolution information

Example Usage:

Upload a microscopy TIFF stack:
"Analyze cell structures in this z-stack"

Data Formats¶

Format Detection¶

The agent automatically detects file formats using:

File Extension: Primary detection method
Magic Bytes: Header inspection for validation
Content Analysis: Fallback for ambiguous cases

Metadata Extraction¶

What Gets Extracted¶

For each uploaded file, the agent extracts:

Image Metadata¶

Dimensions: Width, height, depth (for volumes)
Channels: Grayscale, RGB, RGBA
Data Type: uint8, int16, float32, etc.
File Size: Storage size

Medical Image Metadata¶

Modality: CT, MRI, X-ray, Ultrasound, PET, etc.
Patient Info: Anonymized IDs
Study Info: Study UID, dates
Acquisition Parameters: Slice thickness, spacing, orientation
Equipment: Manufacturer, model, software version

Format-Specific Metadata¶

DICOM Tags: Full DICOM header information
NIfTI Header: Spatial orientation, timing information
TIFF Tags: IFD entries, compression, photometric interpretation

Why Metadata Matters¶

Metadata is used for:

Format Matching: Recommend tools that support your file format
Compatibility Scoring: Prioritize tools that work with your specific format
Context Understanding: Help VLM understand image characteristics
Demo Execution: Ensure tools can process your data

Preview Generation¶

Automatic Conversion¶

Medical and scientific images are converted to PNG previews for VLM analysis:

Original Format	Preview Generation
DICOM (2D)	Single-frame converted to PNG
DICOM / NIfTI 3D volumes	Orthogonal 3‑view composite PNG (axial, sagittal, coronal) using middle slices and/or maximum intensity projections (MIPs)
NIfTI 4D (time series)	Middle timepoint volume rendered as an orthogonal 3‑view composite (middle slices and/or MIPs)
TIFF Stack	Orthogonal 3‑view composite for 3D stacks; otherwise contact sheet or animated GIF preview when appropriate
Standard Images	Single-view PNG (content preserved; may be resized/normalized)

Important: Preview generation is for visual analysis only. Original format metadata is preserved and used for compatibility matching.

Multi-Slice Handling¶

For 3D volumes, the agent typically builds an orthogonal 3‑view composite preview:

Axial: Horizontal slices (z-axis)
Sagittal: Side view (x-axis)
Coronal: Front view (y-axis)

Each view may combine the middle slice with a maximum intensity projection (MIP) to capture both anatomical context and bright structures. When a 3‑view composite cannot be generated (e.g., unusual stack layout), the agent may fall back to a contact sheet or an animated GIF preview of multiple slices.

Format Compatibility Matching¶

How It Works¶

The retrieval system adds format tokens to your query:

Original query: "segment lungs"
Enhanced query: "segment lungs format:DICOM format:3D"

Tools are matched based on:

Direct Format Support: Tool explicitly supports your format
Format Category: Tool supports format family (e.g., medical imaging)
Conversion Capability: Tool can convert from your format

IO Compatibility Scoring¶

The VLM considers:

Input Format Match: Can the tool read your file?
Output Format: What format does the tool produce?
Dimension Compatibility: 2D tool for 2D images, 3D for volumes
Modality Specificity: CT tools for CT images, MRI for MRI

Unsupported Formats¶

Currently not supported:

Proprietary Formats: Manufacturer-specific formats (e.g., .PAR/.REC)
Video Processing: Limited video analysis capability
Raw Data: Unformatted binary dumps without headers

Format Best Practices¶

Use Standard Formats

Stick to standard formats (DICOM, NIfTI, PNG, TIFF) for best tool compatibility.

Include Metadata

Use formats that preserve metadata (DICOM, NIfTI) rather than exporting to PNG/JPEG.

Check Compatibility

If a tool doesn't work, check the format compatibility in the recommendation metadata.

Convert When Needed

Some tools prefer specific formats. Convert using standard tools (ITK-SNAP, 3D Slicer) before upload.

Example Workflows by Format¶

DICOM Workflow¶

1. Upload: chest_ct.dcm
2. Query: "Segment lungs"
3. Agent detects: DICOM, CT modality, 3D volume
4. Results: CT-compatible lung segmentation tools

NIfTI Workflow¶

1. Upload: brain_mri.nii.gz
2. Query: "Skull stripping"
3. Agent detects: NIfTI, 3D volume, likely MRI
4. Results: Brain extraction tools supporting NIfTI

TIFF Stack Workflow¶

1. Upload: microscopy_stack.tif
2. Query: "Cell counting"
3. Agent detects: Multi-frame TIFF, 3D stack
4. Results: Microscopy analysis tools

Next Steps¶

Learn about Understanding Recommendations
Explore Running Demos
Check Advanced Features