Quick Start
Command Line Usage
Profile a model from a configuration file:Programmatic Usage
Profiling Report
Report Structure
The profiling report is a JSON file containing:Layer-wise Parameters
Each layer reports:Memory Analysis
Parameter Memory
Memory requirements vary by data type:| Data Type | Bytes per Parameter | Example (100K params) |
|---|---|---|
| float32 | 4 | 0.381 MB |
| float16 | 2 | 0.191 MB |
| int8 | 1 | 0.095 MB |
Activation Memory
Activation memory depends on batch size and layer widths:Summary Table
Configuration-based Profiling
Create a configuration fileconfig.py:
Advanced Usage
Custom Model Classes
Profile any model with alayer_sizes attribute:
Activation Memory Details
Access detailed activation memory breakdown:Memory Optimization Tips
Reduce Precision
Use lower precision for inference:Adjust Batch Size
Activation memory scales linearly with batch size:Layer Size Impact
Parameter memory scales with layer dimensions:Integration with Benchmarking
Enable profiling during benchmarks:Output Files
Profiling generates JSON reports in the specified output directory:Next Steps
Benchmarking
Run performance benchmarks
Hardware Simulation
Simulate hardware constraints