S3 Provisioner Tool documentationยถ
Welcome to the S3 Provisioner Tool documentation. This tool helps you provision and manage AWS S3 buckets with best practices built-in.
๐ Getting Started (Read First)ยถ
Start here if youโre new to the S3 Provisioner.
- README
Quick start and overview - Your first stop. Covers what the tool does, why it exists, quick start commands, and common workflows. Read this first to understand the toolโs value proposition and get running in 15 minutes.
- USER_GUIDE
Complete command reference - Comprehensive guide covering all 15 actions, deployment patterns (shared vs dedicated buckets), volume mounts, AWS credentials, and best practices. Essential reading for daily operations.
๐ Core Documentation (Essential)ยถ
Core references youโll use frequently.
- CONFIGURATION
Configuration parameters reference - Detailed explanation of every YAML parameter (client, environment, s3 sections), complete examples for different scenarios (production, compliance, development), lifecycle policy profiles, and validation rules.
- S3_FOLDERS
Complete folder structure reference - Technical documentation of the 130+ folder hierarchy created by the tool, organized by ML pipeline phase (data, models, notebooks, artifacts, code, config). Use this to understand where to place your ML assets.
- ML_LIFECYCLE_POLICIES
Lifecycle policy details - Deep dive into the 4 lifecycle profiles (ml-optimized, compliance, development, none), cost analysis, transition timelines, and custom policy implementation guidance. Essential for cost optimization.
๐ง Operations (Day-to-Day Use)ยถ
Operational guides for managing deployed infrastructure.
- BACKUP_RECOVERY
Backup and recovery procedures - Disaster recovery strategies, backup procedures for configurations and data, recovery workflows, and RTO/RPO guidelines. Critical for production environments.
- MONITORING_HEALTH_CHECKS
Monitoring and health checks - CloudWatch metrics, S3 event notifications, health check procedures, alerting strategies, and operational dashboards. Essential for production monitoring.
- UPDATE_PROCEDURES
Update and maintenance procedures - How to update configurations, modify existing buckets, add new solutions, and perform maintenance tasks without downtime.
- COST_OPTIMIZATION
Cost optimization strategies - Storage class comparisons, lifecycle policy cost analysis, Pattern A vs Pattern B cost breakdown, and recommendations for reducing S3 spend.
- PERFORMANCE_TUNING
Performance tuning guide - S3 request rate optimization, multipart upload tuning, transfer acceleration, and performance considerations for ML workloads.
๐ Security & Complianceยถ
Security, permissions, and compliance documentation.
- IAM_PERMISSIONS
Required AWS permissions - Complete IAM policy requirements for S3, CloudFormation, Lambda, and VPC operations. Includes least-privilege policies and role-based access examples.
- SECURITY
Security overview - High-level security architecture, encryption at rest/in transit, access control patterns, and security best practices.
- SECURITY_GUIDELINES
Detailed security guidelines - Comprehensive security implementation guide covering bucket policies, encryption, access logging, VPC endpoints, and compliance requirements.
- GOVERNANCE_COMPLIANCE
Enterprise governance framework - Complete reference architecture for implementing governance, compliance, and audit capabilities. Includes ready-to-use JSON schemas for audit logs, data lineage, compliance metadata, RBAC examples, and multi-framework compliance support (GDPR, HIPAA, SOC 2, ISO 27001, CCPA).
๐ Troubleshooting & Supportยถ
Problem resolution and getting help.
- TROUBLESHOOTING
Common issues and solutions - Comprehensive troubleshooting guide covering AWS credentials, configuration errors, lifecycle policy issues, CloudFormation errors, Docker errors, deletion errors, and solution deployment errors. Includes quick diagnostics and advanced troubleshooting techniques.
- SUPPORT
Support information - How to get help, support channels, escalation procedures, and what information to include in support requests.
๐ Advanced Topics (Optional)ยถ
Deep dives for advanced users and specific use cases.
- APPLICATION_ARCHITECTURE
Complete architecture documentation - Enterprise-grade technical documentation covering system architecture, design decisions, CloudFormation implementation, Lambda functions, security architecture, and deployment patterns. For architects and advanced users.
- INTEGRATION_EXAMPLES
Integration examples - How to integrate provisioned S3 buckets with SageMaker, Lambda, Glue, and CI/CD pipelines.
- MIGRATION_GUIDE
Migration guide - Migrating from manual S3 setup, Terraform, or CDK to the S3 Provisioner. Includes data migration strategies by size.
๐ Reference (As Needed)ยถ
Version history, roadmap, feedback, and examples.
- RELEASE_NOTES
Version history and changes - Release history, new features, bug fixes, breaking changes, and upgrade notes for each version.
- ROADMAP
Future features and plans - Planned features, enhancement requests, and product roadmap. See whatโs coming next.
- FEEDBACK
Feedback and feature requests - Submit feature requests, vote on planned features, and provide documentation feedback.
๐ Getting Started:
- README
- User Guide
- Table of Contents
- Prerequisites
- Step 1: Understand Bucket Naming
- Choose Deployment Strategy
- Quick Start
- Deployment Patterns
- Configuration
- Commands Reference
- Common Workflows
- Volume Mounts
- AWS Credentials
- Best Practices
- Command Summary
- Troubleshooting
- Whatโs Next
- Support
- Configuration Reference
- IAM Permissions
- Frequently Asked Questions
๐ Core Documentation:
- Configuration Reference
- Table of Contents
- Quick Reference
- Configuration File Structure
- Section 1: Client Configuration
- Section 2: Environment Configuration
- Section 3: S3 Configuration
- Usage Assumptions File
- Complete Configuration Examples
- Bucket Naming Convention
- Folder Structure Created
- Configuration Validation
- Configuration Best Practices
- Lifecycle Policy Cost Comparison
- YAML Syntax Tips
- Troubleshooting Configuration Issues
- Additional Resources
- Configuration File Locations
- Naming Conventions
- S3 Folder Structure Reference
- Table of Contents
- Key Design Principles
- Bucket Naming Convention
- Folder Structure (solutions only)
- Folder Structure (master-solution one level down)
- Folder Structure (master-solution two levels down)
- Compacted Folder Structure (master-solution all nodes, folders only)
- Complete Folder Structure (folders and example files)
- ML Pipeline Lifecycle Policies
๐ง Operations:
- Backup and Recovery Procedures
- Monitoring and Health Checks
- Update Procedures
- Cost Optimization
- Performance Tuning Guide
๐ Security & Compliance:
- IAM Permissions
- Table of Contents
- Overview
- Quick Start - Generate IAM Policy
- Minimum Required Permissions
- Permissions by Action
- Permission Scoping Best Practices
- IAM Policy Examples
- IAM Role for EC2/ECS
- IAM User Setup
- Additional Permissions for Optional Features
- Security Best Practices
- Troubleshooting Permission Issues
- Permission Summary by Action
- Additional Resources
- Security
- Security Guidelines
- Governance, Compliance, and Audit Capabilities
๐ Troubleshooting and Support:
- Troubleshooting
- Table of Contents
- Quick Diagnostics
- Common Pitfalls
- Common Errors
- Performance Issues
- VPC Endpoint Issues
- Advanced Troubleshooting
- Getting Help
- Additional Resources
- Support
- Feedback
๐ Advanced Topics:
- Application Architecture
- Table of Contents
- Executive Summary
- 1. Introduction & Context
- 2. Architectural Representation
- 3. Technical Strategy & Decisions
- 3.1 Technology Stack
- 3.2 Architecture Decision Records (ADRs)
- ADR-001: CloudFormation vs. Direct Boto3 Resource Creation
- ADR-002: Configuration-Driven Architecture
- ADR-003: Three-Tier Subnet Architecture
- ADR-004: Cython Compilation for Code Protection
- ADR-005: Docker Containerization
- ADR-006: High-Availability NAT Gateway (Optional)
- ADR-007: Waiter Pattern for Async Operations
- ADR-008: Naming Convention with Tenant ID
- 3.3 Design Patterns
- 4. Component Architecture
- 5. Data Architecture
- 6. Security Architecture
- 7. Deployment Architecture
- 8. Quality Attributes
- 9. Integration Architecture
- 10. Operational Architecture
- 11. Future Roadmap
- 12. Appendices
- Integration Examples
- Migration Guide
๐ Reference:
- Sample Reports
- Release Notes
- Roadmap
- Table of Contents
- ๐ณ๏ธ Feature Voting
- ๐ฏ High Priority Features
- ๐ Medium Priority Features
- ๐ฎ Future Considerations
- ๐ Delivery Approach
- โ Completed Features (v1.0.0 - 2026-04-01)
- ๐ฌ Customer Advisory Board
- ๐ข Stay Updated
- ๐ก Feature Request Guidelines
- ๐ Roadmap Updates
- ๐ Backward Compatibility
- API Reference
- Core Modules
- Configuration
- Utilities
- License
- CLI Interface
S3ProvisionerCLIS3ProvisionerCLI.__init__()S3ProvisionerCLI.is_action_required()S3ProvisionerCLI.get_actions()S3ProvisionerCLI.get_actions_help()S3ProvisionerCLI.get_manager_class()S3ProvisionerCLI.get_example_usage()S3ProvisionerCLI.add_custom_arguments()S3ProvisionerCLI.create_manager_instance()S3ProvisionerCLI.requires_force()
main()
๐ Recommended Reading Orderยถ
For New Usersยถ
README - Overview and quick start
User Guide - Complete command reference
Configuration Reference - Configuration parameters
Troubleshooting - Common issues
For Production Deploymentยถ
Configuration Reference - Production configuration
IAM Permissions - Security setup
Security Guidelines - Security hardening
Monitoring and Health Checks - Monitoring setup
Backup and Recovery Procedures - Disaster recovery planning
For Compliance/Governanceยถ
Governance, Compliance, and Audit Capabilities - Governance framework
Security Guidelines - Security compliance
ML Pipeline Lifecycle Policies - Data retention policies
For Architects and Advanced Usersยถ
Application Architecture - Technical architecture
S3 Folder Structure Reference - Folder structure design
Governance, Compliance, and Audit Capabilities - Enterprise patterns
๐๏ธ S3 Architecture Patternsยถ
The S3 Provisioner implements a comprehensive, standardized folder structure for ML workloads:
- ML-Optimized Folder Hierarchy
A 130+ folder structure organized by ML pipeline phases (data, models, notebooks, artifacts, code, config). This standardized layout supports the complete ML lifecycle from data ingestion through model deployment. See S3 Folder Structure Reference for the complete hierarchy and Configuration Reference for deployment examples.
๐ Documentation Conventionsยถ
UPPERCASE_WITH_UNDERSCORES.md - Main documentation files
lowercase-with-hyphens.html - Example reports
All paths are relative to the docs/ directory
Code examples use bash syntax unless otherwise noted
AWS resource names follow the pattern: {company_prefix}-{env}-{account_alias}-{region}
๐ Documentation Updatesยถ
This documentation is version-controlled and updated with each release. See Release Notes for documentation changes in each version.
Last Updated: 2025-04-01
ยฉ 2026 Axon Tech Labs. All rights reserved.