Deployment Process

Step-by-step procedures for deploying services to staging and production environments.

Overview

All deployments follow a consistent process to ensure reliability, traceability, and the ability to rollback if needed.

Pre-Deployment Checklist

Before any deployment:

All tests passing in CI
Code review approved and merged
No known critical bugs in the release
Staging environment tested and validated
Release notes prepared
Database migrations tested (if applicable)
Monitoring and alerts configured
Rollback plan documented

Deployment to Staging

Staging deployments happen automatically on merge to main.

Process

Merge PR to main

# Via GitHub UI or:
git checkout main
git pull origin main
# Changes are now in main

CI/CD automatically triggers
- Tests run
- Build created
- Deploy to staging environment
- Smoke tests execute

Verify deployment

# Check deployment status in GitHub Actions
# Visit staging URL
curl https://staging.example.com/health

# Check logs
# Verify key functionality

Monitor for issues
- Check Uptime Kuma dashboard
- Review error logs
- Test critical user paths
- Verify integrations (APIs, databases, etc.)

Staging Validation

Before promoting to production, validate:

Application loads successfully
Authentication works
Key user flows function correctly
API endpoints respond as expected
Database migrations applied successfully
External integrations working
No console errors or warnings
Performance acceptable (load times, API response times)

Deployment to Production

Production deployments require manual approval and careful coordination.

Timing Considerations

Best times to deploy:

Weekdays during business hours (when team is available)
Low-traffic periods (if applicable)
After QA sign-off
When team can monitor post-deployment

Avoid deploying:

Fridays or before weekends (limited support availability)
During high-traffic events
Before holidays
When key team members unavailable

Process

1. Prepare Release

# Ensure main branch is up to date
git checkout main
git pull origin main

# Review changes since last release
git log --oneline v1.2.0..HEAD

# Create release notes
# Document breaking changes, new features, bug fixes

2. Create Release Tag

# Tag the release
git tag -a v1.2.1 -m "Release v1.2.1: Feature X and Bug Y fix"
git push origin v1.2.1

Or use GitHub's release UI:

Go to repository → Releases
Click "Draft a new release"
Choose or create tag (e.g., v1.2.1)
Add release title and notes
Publish release

3. Trigger Production Deployment

Via GitHub Actions UI:

Go to Actions tab
Select "Release to Production" workflow
Click "Run workflow"
Enter version (e.g., v1.2.1)
Click "Run workflow"

Via CLI:

gh workflow run release.yml -f version=v1.2.1

4. Approve Deployment

Production deployments require manual approval:

Workflow pauses at approval gate
Designated approvers receive notification
Review deployment details
Approve or reject deployment

Approvers check:

Staging validation completed
Release notes reviewed
No critical issues reported
Team ready to monitor

5. Monitor Deployment

Watch the deployment progress:

# Monitor workflow
gh run watch

# Check application health
curl https://example.com/health

# Watch logs (if using Digital Ocean)
ssh user@production-server
docker logs -f app --tail=100

# Monitor metrics
# - Uptime Kuma: https://status.buildersintl.org
# - PostHog: https://app.posthog.com
# - Error tracking dashboard

6. Validate Production

After deployment completes:

7. Communicate Status

Update team on deployment status:

✅ v1.2.1 deployed to production successfully

Changes:
- Added feature X
- Fixed bug Y
- Updated dependency Z

Validation:
- Health checks passing
- Smoke tests passed
- No errors in logs
- Response times normal

Monitoring:
- Uptime Kuma: ✅
- Error rate: Normal
- Performance: ✅

Post in:

Slack #deployments channel
Update status page if needed

Rollback Procedures

If issues are discovered post-deployment, rollback immediately.

When to Rollback

Rollback if:

Critical functionality broken
Significant increase in error rates
Performance degradation impacting users
Security vulnerability introduced
Data integrity issues

Rollback Process

Vercel Deployments

Instant rollback via dashboard:

Go to Vercel project dashboard
Find previous successful deployment
Click "..." → "Promote to Production"
Confirm promotion

Or via CLI:

vercel rollback https://previous-deployment-url.vercel.app --prod

Digital Ocean Deployments

Rollback to previous Docker image:

# SSH to server
ssh user@production-server

# Stop current container
docker stop app
docker rm app

# Run previous version
docker run -d \
  --name app \
  -p 3000:3000 \
  --env-file /opt/app/.env.production \
  registry.digitalocean.com/myregistry/app:PREVIOUS_SHA

# Verify health
curl http://localhost:3000/health

Database Rollback

If migrations were applied:

# Run down migration
npm run migrate:down

# Or restore from backup (if needed)
# Contact database administrator

Post-Rollback

After rollback:

Verify rollback successful
- Application working correctly
- Error rates normal
- Users can access service
Investigate issue
- Review logs
- Identify root cause
- Document findings

Create hotfix or revert

# Revert the problematic commit
git revert <commit-sha>
git push origin main

# Or create hotfix branch
git checkout -b hotfix/fix-critical-issue
# Make fixes
git commit -m "fix: resolve production issue"
# Create PR and fast-track review

Communicate to team

⚠️ Production rollback executed

Reason: Critical bug in feature X
Rolled back to: v1.2.0
Status: Service restored, investigating issue

Next steps:
- Root cause analysis
- Hotfix in progress
- ETA for fix: 2 hours

Database Migrations

When deployments include database changes:

Before Deployment

Test migrations

# Test on staging database
npm run migrate:up

# Verify data integrity
npm run migrate:verify

# Test rollback
npm run migrate:down
npm run migrate:up

Backup production database

# Automatic backups should be configured
# Verify recent backup exists
# Document backup ID for quick restore if needed

During Deployment

Apply migrations before code deployment
- Ensures database ready for new code
- Use backward-compatible migrations when possible
Zero-downtime migrations
- Add new columns (don't remove old ones immediately)
- Deploy code that uses both old and new schema
- Remove old columns in subsequent deployment

After Deployment

Verify migrations

# Check migration status
npm run migrate:status

# Verify data integrity
npm run test:data-integrity

Monitor query performance
- Watch for slow queries
- Check index usage
- Verify no lock contention

Emergency Procedures

Complete Service Outage

Immediate Actions
- Trigger incident response
- Page on-call engineer
- Update status page
- Rollback if deployment-related

Communication

🚨 INCIDENT: Service outage detected

Status: Investigating
Impact: All users unable to access service
Started: 2024-03-26 14:32 UTC
Team: Investigating

Updates: Will provide update in 15 minutes

Resolution
- Identify root cause
- Implement fix or rollback
- Verify service restored
- Post-mortem within 24 hours

Partial Degradation

Assess Impact
- Identify affected functionality
- Determine user impact
- Evaluate severity
Decide on Action
- If minor: Monitor and fix in next deployment
- If moderate: Accelerate hotfix
- If major: Rollback
Communicate
- Update team
- Update status page if user-facing
- Set expectations for fix timeline

Deployment Checklist

Use this checklist for every production deployment:

Pre-Deployment

Deployment

Post-Deployment

If Issues

Deployment Metrics

Track these metrics for each deployment:

Deployment frequency: How often we deploy
Lead time: Time from commit to production
Change failure rate: % of deployments causing issues
Mean time to recovery (MTTR): Time to recover from failures

Goals:

Deploy at least daily to staging
Deploy to production multiple times per week
Change failure rate under 5 percent
MTTR under 30 minutes

Resources

CI/CD Workflows
Environment Configuration
Infrastructure Monitoring
Incident Response Playbook (TODO: Create this document)

Overview​

Pre-Deployment Checklist​

Deployment to Staging​

Process​

Staging Validation​

Deployment to Production​

Timing Considerations​

Process​

1. Prepare Release​

2. Create Release Tag​

3. Trigger Production Deployment​

4. Approve Deployment​

5. Monitor Deployment​

6. Validate Production​

7. Communicate Status​

Rollback Procedures​

When to Rollback​

Rollback Process​

Vercel Deployments​

Digital Ocean Deployments​

Database Rollback​

Post-Rollback​

Database Migrations​

Before Deployment​

During Deployment​

After Deployment​

Emergency Procedures​

Complete Service Outage​

Partial Degradation​

Deployment Checklist​

Pre-Deployment​

Deployment​

Post-Deployment​

If Issues​

Deployment Metrics​

Resources​

Overview

Pre-Deployment Checklist

Deployment to Staging

Process

Staging Validation

Deployment to Production

Timing Considerations

Process

1. Prepare Release

2. Create Release Tag

3. Trigger Production Deployment

4. Approve Deployment

5. Monitor Deployment

6. Validate Production

7. Communicate Status

Rollback Procedures

When to Rollback

Rollback Process

Vercel Deployments

Digital Ocean Deployments

Database Rollback

Post-Rollback

Database Migrations

Before Deployment

During Deployment

After Deployment

Emergency Procedures

Complete Service Outage

Partial Degradation

Deployment Checklist

Pre-Deployment

Deployment

Post-Deployment

If Issues

Deployment Metrics

Resources