Mastering Microservices: A Complete Guide to Modern Deployment and Release Patterns#

In the rapidly evolving landscape of cloud-native applications, choosing the right deployment and release patterns can make the difference between seamless user experiences and catastrophic outages. This comprehensive guide explores modern deployment strategies, from traditional Blue-Green deployments to cutting-edge progressive delivery patterns with service mesh integration.

Blue-Green Deployment Strategy#

Blue-Green deployment maintains two identical production environments, providing zero-downtime deployments with instant rollback capabilities. While resource-intensive, it remains the gold standard for mission-critical applications.

Architecture Overview#

1
graph TB
2
    subgraph "Load Balancer"
3
        LB[Load Balancer/Ingress]
4
    end
5

6
    subgraph "Blue Environment (Production)"
7
        B1[App v1.0]
8
        B2[App v1.0]
9
        B3[App v1.0]
10
        BDB[(Database)]
11
    end
12

13
    subgraph "Green Environment (Staging)"
14
        G1[App v2.0]
15
        G2[App v2.0]
16
        G3[App v2.0]
17
        GDB[(Database)]
18
    end
19

20
    subgraph "Shared Services"
21
        Cache[(Redis Cache)]
22
        Queue[Message Queue]
23
    end
24

25
    Users --> LB
26
    LB -->|100% Traffic| B1
27
    LB -->|100% Traffic| B2
28
    LB -->|100% Traffic| B3
29

30
    LB -.->|0% Traffic| G1
31
    LB -.->|0% Traffic| G2
32
    LB -.->|0% Traffic| G3
33

34
    B1 --> BDB
35
    B2 --> BDB
36
    B3 --> BDB
37

38
    G1 --> GDB
39
    G2 --> GDB
40
    G3 --> GDB
41

42
    B1 --> Cache
43
    B2 --> Cache
44
    B3 --> Cache
45
    G1 --> Cache
46
    G2 --> Cache
47
    G3 --> Cache
48

49
    classDef blue fill:#e3f2fd
50
    classDef green fill:#e8f5e8
51
    classDef shared fill:#fff3e0
52

53
    class B1,B2,B3,BDB blue
54
    class G1,G2,G3,GDB green
55
    class Cache,Queue shared

Kubernetes Implementation#

Here’s a practical Blue-Green deployment using Kubernetes and Argo Rollouts:

1
apiVersion: argoproj.io/v1alpha1
2
kind: Rollout
3
metadata:
4
  name: user-service
5
  namespace: production
6
spec:
7
  replicas: 6
8
  strategy:
9
    blueGreen:
10
      activeService: user-service-active
11
      previewService: user-service-preview
12
      autoPromotionEnabled: false
13
      scaleDownDelaySeconds: 30
14
      prePromotionAnalysis:
15
        templates:
16
          - templateName: success-rate
17
        args:
18
          - name: service-name
19
            value: user-service-preview
20
      postPromotionAnalysis:
21
        templates:
22
          - templateName: success-rate
23
        args:
24
          - name: service-name
25
            value: user-service-active
26
  selector:
27
    matchLabels:
28
      app: user-service
29
  template:
30
    metadata:
31
      labels:
32
        app: user-service
33
    spec:
34
      containers:
35
        - name: user-service
36
          image: myregistry/user-service:v2.0.0
37
          ports:
38
            - containerPort: 8080
39
          resources:
40
            requests:
41
              cpu: 100m
42
              memory: 128Mi
43
            limits:
44
              cpu: 500m
45
              memory: 512Mi
46
          readinessProbe:
47
            httpGet:
48
              path: /health/ready
49
              port: 8080
50
            initialDelaySeconds: 10
51
            periodSeconds: 5
52
          livenessProbe:
53
            httpGet:
54
              path: /health/live
55
              port: 8080
56
            initialDelaySeconds: 30
57
            periodSeconds: 10
58

59
---
60
# Active service (Blue environment)
61
apiVersion: v1
62
kind: Service
63
metadata:
64
  name: user-service-active
65
  namespace: production
66
spec:
67
  selector:
68
    app: user-service
69
  ports:
70
    - port: 80
71
      targetPort: 8080
72
  type: ClusterIP
73

74
---
75
# Preview service (Green environment)
76
apiVersion: v1
77
kind: Service
78
metadata:
79
  name: user-service-preview
80
  namespace: production
81
spec:
82
  selector:
83
    app: user-service
84
  ports:
85
    - port: 80
86
      targetPort: 8080
87
  type: ClusterIP
88

89
---
90
# Analysis template for automated testing
91
apiVersion: argoproj.io/v1alpha1
92
kind: AnalysisTemplate
93
metadata:
94
  name: success-rate
95
  namespace: production
96
spec:
97
  args:
98
    - name: service-name
99
  metrics:
100
    - name: success-rate
101
      successCondition: result[0] >= 0.95
102
      provider:
103
        prometheus:
104
          address: http://prometheus.monitoring:9090
105
          query: |
106
            sum(rate(http_requests_total{service="{{args.service-name}}", status=~"2.."}[5m])) /
107
            sum(rate(http_requests_total{service="{{args.service-name}}"}[5m]))

Deployment Workflow#

1
sequenceDiagram
2
    participant Dev as Developer
3
    participant Git as Git Repository
4
    participant CI as CI/CD Pipeline
5
    participant Argo as Argo Rollouts
6
    participant K8s as Kubernetes
7
    participant Monitor as Monitoring
8

9
    Dev->>Git: Push v2.0.0 code
10
    Git->>CI: Trigger build pipeline
11
    CI->>CI: Run tests & security scans
12
    CI->>CI: Build container image
13
    CI->>Git: Update deployment manifest
14

15
    Git->>Argo: Detect manifest change
16
    Argo->>K8s: Deploy to Green environment
17
    K8s->>Monitor: Start health checks
18

19
    Monitor->>Argo: Health checks passing
20
    Argo->>Argo: Run pre-promotion analysis
21
    Argo->>Dev: Request manual approval
22

23
    Dev->>Argo: Approve promotion
24
    Argo->>K8s: Switch traffic to Green
25
    K8s->>Monitor: Monitor post-deployment
26

27
    alt Success
28
        Monitor->>Argo: All metrics healthy
29
        Argo->>K8s: Scale down Blue environment
30
    else Failure
31
        Monitor->>Argo: Metrics degraded
32
        Argo->>K8s: Rollback to Blue
33
        Argo->>Dev: Send failure notification
34
    end

Canary Deployment with Traffic Splitting#

Canary deployments reduce risk by gradually exposing new versions to increasing percentages of users while monitoring key metrics. This approach is ideal for high-traffic applications where Blue-Green would be cost-prohibitive.

Service Mesh Architecture#

1
graph TB
2
    subgraph "Istio Service Mesh"
3
        subgraph "Gateway"
4
            IGW[Istio Gateway]
5
        end
6

7
        subgraph "Virtual Service"
8
            VS[Traffic Splitting Rules]
9
        end
10

11
        subgraph "Destination Rules"
12
            DR[Version Subsets]
13
        end
14

15
        subgraph "Production Pods"
16
            subgraph "Version 1 (90%)"
17
                V1P1[Pod v1]
18
                V1P2[Pod v1]
19
                V1P3[Pod v1]
20
                V1P4[Pod v1]
21
                V1P5[Pod v1]
22
                V1P6[Pod v1]
23
            end
24

25
            subgraph "Version 2 Canary (10%)"
26
                V2P1[Pod v2]
27
                V2P2[Pod v2]
28
            end
29
        end
30

31
        subgraph "Monitoring"
32
            Prom[Prometheus]
33
            Graf[Grafana]
34
            Jaeger[Jaeger Tracing]
35
        end
36

37
        subgraph "Automation"
38
            Flagger[Flagger Controller]
39
        end
40
    end
41

42
    Users --> IGW
43
    IGW --> VS
44
    VS -->|90%| V1P1
45
    VS -->|90%| V1P2
46
    VS -->|90%| V1P3
47
    VS -->|90%| V1P4
48
    VS -->|90%| V1P5
49
    VS -->|90%| V1P6
50
    VS -->|10%| V2P1
51
    VS -->|10%| V2P2
52

53
    V1P1 --> Prom
54
    V1P2 --> Prom
55
    V2P1 --> Prom
56
    V2P2 --> Prom
57

58
    Flagger --> VS
59
    Flagger --> Prom
60
    Prom --> Graf
61

62
    classDef v1 fill:#e3f2fd
63
    classDef v2 fill:#ffebee
64
    classDef control fill:#f3e5f5
65

66
    class V1P1,V1P2,V1P3,V1P4,V1P5,V1P6 v1
67
    class V2P1,V2P2 v2
68
    class VS,DR,Flagger control

Istio Configuration#

1
# Gateway configuration
2
apiVersion: networking.istio.io/v1beta1
3
kind: Gateway
4
metadata:
5
  name: user-service-gateway
6
  namespace: production
7
spec:
8
  selector:
9
    istio: ingressgateway
10
  servers:
11
    - port:
12
        number: 80
13
        name: http
14
        protocol: HTTP
15
      hosts:
16
        - api.example.com
17

18
---
19
# Destination rule defining version subsets
20
apiVersion: networking.istio.io/v1beta1
21
kind: DestinationRule
22
metadata:
23
  name: user-service-dr
24
  namespace: production
25
spec:
26
  host: user-service
27
  subsets:
28
    - name: v1
29
      labels:
30
        version: v1
31
    - name: v2
32
      labels:
33
        version: v2
34
  trafficPolicy:
35
    loadBalancer:
36
      simple: LEAST_CONN
37
    connectionPool:
38
      tcp:
39
        maxConnections: 100
40
      http:
41
        http1MaxPendingRequests: 50
42
        maxRequestsPerConnection: 2
43

44
---
45
# Virtual service with traffic splitting
46
apiVersion: networking.istio.io/v1beta1
47
kind: VirtualService
48
metadata:
49
  name: user-service-vs
50
  namespace: production
51
spec:
52
  hosts:
53
    - api.example.com
54
  gateways:
55
    - user-service-gateway
56
  http:
57
    - match:
58
        - uri:
59
            prefix: /api/users
60
      route:
61
        - destination:
62
            host: user-service
63
            subset: v1
64
          weight: 90
65
        - destination:
66
            host: user-service
67
            subset: v2
68
          weight: 10
69
      timeout: 30s
70
      retries:
71
        attempts: 3
72
        perTryTimeout: 10s

Automated Canary with Flagger#

1
# Flagger canary deployment
2
apiVersion: flagger.app/v1beta1
3
kind: Canary
4
metadata:
5
  name: user-service
6
  namespace: production
7
spec:
8
  targetRef:
9
    apiVersion: apps/v1
10
    kind: Deployment
11
    name: user-service
12
  progressDeadlineSeconds: 60
13
  service:
14
    port: 80
15
    targetPort: 8080
16
    gateways:
17
      - user-service-gateway
18
    hosts:
19
      - api.example.com
20
  analysis:
21
    interval: 1m
22
    threshold: 5
23
    maxWeight: 50
24
    stepWeight: 10
25
    metrics:
26
      - name: request-success-rate
27
        threshold: 99
28
        interval: 1m
29
      - name: request-duration
30
        threshold: 500
31
        interval: 1m
32
      - name: error-rate
33
        threshold: 1
34
        interval: 1m
35
    webhooks:
36
      - name: load-test
37
        url: http://load-tester.test/
38
        timeout: 5s
39
        metadata:
40
          cmd: "hey -z 10m -q 10 -c 2 http://api.example.com/api/users"
41
      - name: integration-test
42
        url: http://integration-tester.test/
43
        timeout: 30s
44
        metadata:
45
          type: bash
46
          cmd: "curl -s http://api.example.com/api/users/health | grep OK"
47
  provider: istio

Progressive Traffic Shifting#

1
gantt
2
    title Canary Deployment Timeline
3
    dateFormat X
4
    axisFormat %s
5

6
    section Traffic Split
7
    0% Canary (Baseline)     :0, 300
8
    10% Canary (Initial)     :300, 600
9
    20% Canary               :600, 900
10
    30% Canary               :900, 1200
11
    50% Canary (Half Split)  :1200, 1500
12
    100% Canary (Complete)   :1500, 1800
13

14
    section Health Checks
15
    Readiness Probes         :0, 1800
16
    Success Rate Monitor     :300, 1800
17
    Latency Monitor          :300, 1800
18
    Error Rate Monitor       :300, 1800
19

20
    section Automated Actions
21
    Initial Deployment        :0, 300
22
    Traffic Increment         :300, 1500
23
    Promotion Decision        :1500, 1650
24
    Cleanup Old Version       :1650, 1800

Rolling Updates with Kubernetes#

Rolling updates provide the most resource-efficient deployment strategy, gradually replacing old pods with new ones while maintaining service availability.

Rolling Update Flow#

1
sequenceDiagram
2
    participant User as Users
3
    participant LB as Load Balancer
4
    participant K8s as Kubernetes Controller
5
    participant RS1 as ReplicaSet v1
6
    participant RS2 as ReplicaSet v2
7
    participant Pods1 as Pods v1
8
    participant Pods2 as Pods v2
9

10
    Note over User,Pods2: Initial State: 6 pods running v1
11
    User->>LB: Traffic (100%)
12
    LB->>Pods1: Route to 6 v1 pods
13

14
    Note over K8s,Pods2: Rolling Update Initiated
15
    K8s->>RS2: Create ReplicaSet v2
16
    K8s->>RS2: Scale up to 2 pods (maxSurge=2)
17
    RS2->>Pods2: Create 2 v2 pods
18

19
    Note over Pods2: Wait for readiness probes
20
    Pods2->>K8s: Ready signals
21

22
    K8s->>RS1: Scale down by 1 pod
23
    RS1->>Pods1: Terminate 1 v1 pod
24

25
    Note over User,Pods2: Traffic: 5 v1 + 2 v2 pods
26
    User->>LB: Traffic (100%)
27
    LB->>Pods1: Route to 5 v1 pods
28
    LB->>Pods2: Route to 2 v2 pods
29

30
    K8s->>RS2: Scale up to 4 pods
31
    RS2->>Pods2: Create 2 more v2 pods
32
    Pods2->>K8s: Ready signals
33

34
    K8s->>RS1: Scale down by 2 pods
35
    RS1->>Pods1: Terminate 2 v1 pods
36

37
    Note over User,Pods2: Traffic: 3 v1 + 4 v2 pods
38

39
    K8s->>RS2: Scale up to 6 pods
40
    RS2->>Pods2: Create 2 more v2 pods
41
    Pods2->>K8s: Ready signals
42

43
    K8s->>RS1: Scale down to 0
44
    RS1->>Pods1: Terminate remaining v1 pods
45

46
    Note over User,Pods2: Final State: 6 pods running v2
47
    User->>LB: Traffic (100%)
48
    LB->>Pods2: Route to 6 v2 pods

Advanced Rolling Update Configuration#

1
# Deployment with rolling update strategy
2
apiVersion: apps/v1
3
kind: Deployment
4
metadata:
5
  name: user-service
6
  namespace: production
7
  labels:
8
    app: user-service
9
    version: v2.0.0
10
spec:
11
  replicas: 6
12
  strategy:
13
    type: RollingUpdate
14
    rollingUpdate:
15
      maxSurge: 2 # Allow 2 extra pods during update
16
      maxUnavailable: 1 # Max 1 pod unavailable at a time
17
  minReadySeconds: 30 # Wait 30s before considering pod ready
18
  progressDeadlineSeconds: 600 # 10min timeout for rollout
19
  revisionHistoryLimit: 5 # Keep 5 previous versions
20
  selector:
21
    matchLabels:
22
      app: user-service
23
  template:
24
    metadata:
25
      labels:
26
        app: user-service
27
        version: v2.0.0
28
    spec:
29
      containers:
30
        - name: user-service
31
          image: myregistry/user-service:v2.0.0
32
          ports:
33
            - containerPort: 8080
34
              name: http
35
          resources:
36
            requests:
37
              cpu: 200m
38
              memory: 256Mi
39
            limits:
40
              cpu: 1000m
41
              memory: 1Gi
42
          env:
43
            - name: ENV
44
              value: "production"
45
            - name: DATABASE_URL
46
              valueFrom:
47
                secretKeyRef:
48
                  name: database-credentials
49
                  key: url
50
          # Comprehensive health checks
51
          readinessProbe:
52
            httpGet:
53
              path: /health/ready
54
              port: 8080
55
              httpHeaders:
56
                - name: Custom-Header
57
                  value: health-check
58
            initialDelaySeconds: 15
59
            periodSeconds: 10
60
            timeoutSeconds: 5
61
            successThreshold: 2
62
            failureThreshold: 3
63
          livenessProbe:
64
            httpGet:
65
              path: /health/live
66
              port: 8080
67
            initialDelaySeconds: 60
68
            periodSeconds: 30
69
            timeoutSeconds: 10
70
            failureThreshold: 3
71
          # Startup probe for slow-starting applications
72
          startupProbe:
73
            httpGet:
74
              path: /health/startup
75
              port: 8080
76
            initialDelaySeconds: 10
77
            periodSeconds: 5
78
            timeoutSeconds: 3
79
            failureThreshold: 12 # 60 seconds total
80
          # Graceful shutdown
81
          lifecycle:
82
            preStop:
83
              exec:
84
                command: ["/bin/sh", "-c", "sleep 15"]
85
          # Security context
86
          securityContext:
87
            runAsNonRoot: true
88
            runAsUser: 1000
89
            allowPrivilegeEscalation: false
90
            readOnlyRootFilesystem: true
91
            capabilities:
92
              drop:
93
                - ALL
94
      # Pod security and scheduling
95
      securityContext:
96
        fsGroup: 1000
97
      terminationGracePeriodSeconds: 30
98
      affinity:
99
        podAntiAffinity:
100
          preferredDuringSchedulingIgnoredDuringExecution:
101
            - weight: 100
102
              podAffinityTerm:
103
                labelSelector:
104
                  matchExpressions:
105
                    - key: app
106
                      operator: In
107
                      values:
108
                        - user-service
109
                topologyKey: kubernetes.io/hostname
110

111
---
112
# Service for rolling updates
113
apiVersion: v1
114
kind: Service
115
metadata:
116
  name: user-service
117
  namespace: production
118
  labels:
119
    app: user-service
120
spec:
121
  selector:
122
    app: user-service
123
  ports:
124
    - port: 80
125
      targetPort: 8080
126
      protocol: TCP
127
      name: http
128
  type: ClusterIP
129

130
---
131
# Pod Disruption Budget
132
apiVersion: policy/v1
133
kind: PodDisruptionBudget
134
metadata:
135
  name: user-service-pdb
136
  namespace: production
137
spec:
138
  minAvailable: 4 # Always keep at least 4 pods running
139
  selector:
140
    matchLabels:
141
      app: user-service

Health Check Implementation#

1
// Node.js health check endpoints
2
const express = require("express");
3
const app = express();
4

5
let isReady = false;
6
let isLive = true;
7
let startupComplete = false;
8

9
// Simulate application startup
10
setTimeout(() => {
11
  startupComplete = true;
12
  isReady = true;
13
}, 10000); // 10 second startup time
14

15
// Startup probe - for slow-starting applications
16
app.get("/health/startup", (req, res) => {
17
  if (startupComplete) {
18
    res.status(200).json({
19
      status: "started",
20
      timestamp: new Date().toISOString(),
21
    });
22
  } else {
23
    res.status(503).json({
24
      status: "starting",
25
      timestamp: new Date().toISOString(),
26
    });
27
  }
28
});
29

30
// Readiness probe - ready to receive traffic
31
app.get("/health/ready", (req, res) => {
32
  // Check dependencies (database, external services)
33
  const checks = {
34
    database: checkDatabase(),
35
    redis: checkRedis(),
36
    externalAPI: checkExternalAPI(),
37
  };
38

39
  const allHealthy = Object.values(checks).every(check => check);
40

41
  if (isReady && allHealthy) {
42
    res.status(200).json({
43
      status: "ready",
44
      checks,
45
      timestamp: new Date().toISOString(),
46
    });
47
  } else {
48
    res.status(503).json({
49
      status: "not ready",
50
      checks,
51
      timestamp: new Date().toISOString(),
52
    });
53
  }
54
});
55

56
// Liveness probe - application is healthy
57
app.get("/health/live", (req, res) => {
58
  if (isLive) {
59
    res.status(200).json({
60
      status: "alive",
61
      uptime: process.uptime(),
62
      memory: process.memoryUsage(),
63
      timestamp: new Date().toISOString(),
64
    });
65
  } else {
66
    res.status(503).json({
67
      status: "unhealthy",
68
      timestamp: new Date().toISOString(),
69
    });
70
  }
71
});
72

73
function checkDatabase() {
74
  // Implement actual database connectivity check
75
  return Math.random() > 0.1; // 90% success rate for demo
76
}
77

78
function checkRedis() {
79
  // Implement actual Redis connectivity check
80
  return Math.random() > 0.05; // 95% success rate for demo
81
}
82

83
function checkExternalAPI() {
84
  // Implement actual external API check
85
  return Math.random() > 0.15; // 85% success rate for demo
86
}
87

88
// Graceful shutdown handler
89
process.on("SIGTERM", () => {
90
  console.log("SIGTERM received, shutting down gracefully");
91
  isReady = false; // Stop accepting new requests
92

93
  setTimeout(() => {
94
    isLive = false; // Mark as unhealthy
95
    process.exit(0);
96
  }, 15000); // Wait 15 seconds for existing requests
97
});
98

99
app.listen(8080, () => {
100
  console.log("Health check server running on port 8080");
101
});

Feature Flags and Progressive Delivery#

Feature flags enable runtime control over feature availability, allowing teams to separate deployment from release and implement sophisticated rollout strategies.

Feature Flag Architecture#

1
graph TB
2
    subgraph "Client Applications"
3
        Web[Web App]
4
        Mobile[Mobile App]
5
        API[API Gateway]
6
    end
7

8
    subgraph "Feature Flag Service"
9
        FFS[Feature Flag Server]
10
        Admin[Admin Dashboard]
11
        SDK[Client SDKs]
12
    end
13

14
    subgraph "Configuration Store"
15
        DB[(Flag Database)]
16
        Cache[(Redis Cache)]
17
        CDN[Edge Cache/CDN]
18
    end
19

20
    subgraph "User Context"
21
        UserDB[(User Database)]
22
        Segments[User Segments]
23
        Analytics[Analytics Engine]
24
    end
25

26
    subgraph "Application Services"
27
        UserSvc[User Service]
28
        PaymentSvc[Payment Service]
29
        OrderSvc[Order Service]
30
    end
31

32
    subgraph "Monitoring & Observability"
33
        Metrics[Metrics Store]
34
        Logs[Log Aggregation]
35
        Alerts[Alert Manager]
36
    end
37

38
    Web --> SDK
39
    Mobile --> SDK
40
    API --> SDK
41

42
    SDK --> FFS
43
    Admin --> FFS
44

45
    FFS --> DB
46
    FFS --> Cache
47
    SDK --> CDN
48

49
    FFS --> UserDB
50
    FFS --> Segments
51
    FFS --> Analytics
52

53
    UserSvc --> SDK
54
    PaymentSvc --> SDK
55
    OrderSvc --> SDK
56

57
    FFS --> Metrics
58
    FFS --> Logs
59
    FFS --> Alerts
60

61
    classDef client fill:#e3f2fd
62
    classDef service fill:#e8f5e8
63
    classDef data fill:#fff3e0
64
    classDef monitor fill:#fce4ec
65

66
    class Web,Mobile,API client
67
    class FFS,Admin,SDK service
68
    class DB,Cache,CDN,UserDB data
69
    class Metrics,Logs,Alerts monitor

Progressive Rollout Implementation#

1
// Feature flag service implementation
2
interface FeatureFlag {
3
  key: string;
4
  name: string;
5
  description: string;
6
  enabled: boolean;
7
  rolloutPercentage: number;
8
  userSegments: string[];
9
  environmentRules: Record<string, any>;
10
  constraints: Constraint[];
11
  createdAt: Date;
12
  updatedAt: Date;
13
}
14

15
interface User {
16
  id: string;
17
  email: string;
18
  segments: string[];
19
  attributes: Record<string, any>;
20
}
21

22
interface Constraint {
23
  type: "user_id" | "segment" | "attribute" | "percentage";
24
  operator: "in" | "not_in" | "equals" | "greater_than" | "less_than";
25
  values: any[];
26
}
27

28
class FeatureFlagService {
29
  private flags: Map<string, FeatureFlag> = new Map();
30
  private cache: Map<string, boolean> = new Map();
31

32
  constructor(
33
    private database: Database,
34
    private analytics: AnalyticsService,
35
    private logger: Logger
36
  ) {}
37

38
  async isEnabled(
39
    flagKey: string,
40
    user: User,
41
    context?: any
42
  ): Promise<boolean> {
43
    const cacheKey = `${flagKey}:${user.id}`;
44

45
    // Check cache first
46
    if (this.cache.has(cacheKey)) {
47
      return this.cache.get(cacheKey)!;
48
    }
49

50
    const flag = await this.getFlag(flagKey);
51
    if (!flag || !flag.enabled) {
52
      this.cache.set(cacheKey, false);
53
      return false;
54
    }
55

56
    const result = await this.evaluateFlag(flag, user, context);
57

58
    // Cache result for 5 minutes
59
    this.cache.set(cacheKey, result);
60
    setTimeout(() => this.cache.delete(cacheKey), 5 * 60 * 1000);
61

62
    // Track flag evaluation
63
    this.analytics.track("feature_flag_evaluated", {
64
      flagKey,
65
      userId: user.id,
66
      result,
67
      timestamp: new Date(),
68
    });
69

70
    return result;
71
  }
72

73
  private async evaluateFlag(
74
    flag: FeatureFlag,
75
    user: User,
76
    context?: any
77
  ): Promise<boolean> {
78
    // Check constraints
79
    for (const constraint of flag.constraints) {
80
      if (!this.evaluateConstraint(constraint, user, context)) {
81
        return false;
82
      }
83
    }
84

85
    // Check user segments
86
    if (flag.userSegments.length > 0) {
87
      const hasSegment = flag.userSegments.some(segment =>
88
        user.segments.includes(segment)
89
      );
90
      if (!hasSegment) {
91
        return false;
92
      }
93
    }
94

95
    // Check rollout percentage
96
    if (flag.rolloutPercentage < 100) {
97
      const userHash = this.hashUser(user.id, flag.key);
98
      const userPercentile = userHash % 100;
99
      return userPercentile < flag.rolloutPercentage;
100
    }
101

102
    return true;
103
  }
104

105
  private evaluateConstraint(
106
    constraint: Constraint,
107
    user: User,
108
    context?: any
109
  ): boolean {
110
    switch (constraint.type) {
111
      case "user_id":
112
        return this.evaluateOperator(
113
          constraint.operator,
114
          user.id,
115
          constraint.values
116
        );
117

118
      case "segment":
119
        return this.evaluateOperator(
120
          constraint.operator,
121
          user.segments,
122
          constraint.values
123
        );
124

125
      case "attribute":
126
        const attributeValue = user.attributes[constraint.values[0]];
127
        return this.evaluateOperator(
128
          constraint.operator,
129
          attributeValue,
130
          constraint.values.slice(1)
131
        );
132

133
      case "percentage":
134
        const userHash = this.hashUser(user.id, "percentage");
135
        const percentile = userHash % 100;
136
        return percentile < constraint.values[0];
137

138
      default:
139
        return false;
140
    }
141
  }
142

143
  private evaluateOperator(
144
    operator: string,
145
    userValue: any,
146
    constraintValues: any[]
147
  ): boolean {
148
    switch (operator) {
149
      case "in":
150
        return constraintValues.includes(userValue);
151
      case "not_in":
152
        return !constraintValues.includes(userValue);
153
      case "equals":
154
        return userValue === constraintValues[0];
155
      case "greater_than":
156
        return userValue > constraintValues[0];
157
      case "less_than":
158
        return userValue < constraintValues[0];
159
      default:
160
        return false;
161
    }
162
  }
163

164
  private hashUser(userId: string, salt: string): number {
165
    // Simple hash function for consistent user bucketing
166
    let hash = 0;
167
    const str = userId + salt;
168
    for (let i = 0; i < str.length; i++) {
169
      const char = str.charCodeAt(i);
170
      hash = (hash << 5) - hash + char;
171
      hash = hash & hash; // Convert to 32-bit integer
172
    }
173
    return Math.abs(hash);
174
  }
175

176
  async updateRolloutPercentage(
177
    flagKey: string,
178
    percentage: number
179
  ): Promise<void> {
180
    const flag = await this.getFlag(flagKey);
181
    if (!flag) throw new Error(`Flag ${flagKey} not found`);
182

183
    flag.rolloutPercentage = percentage;
184
    flag.updatedAt = new Date();
185

186
    await this.database.updateFlag(flag);
187

188
    // Clear cache to force re-evaluation
189
    this.cache.clear();
190

191
    this.logger.info(
192
      `Updated rollout percentage for ${flagKey} to ${percentage}%`
193
    );
194
  }
195

196
  private async getFlag(flagKey: string): Promise<FeatureFlag | null> {
197
    if (this.flags.has(flagKey)) {
198
      return this.flags.get(flagKey)!;
199
    }
200

201
    const flag = await this.database.getFlag(flagKey);
202
    if (flag) {
203
      this.flags.set(flagKey, flag);
204
    }
205

206
    return flag;
207
  }
208
}

Feature Flag Rollout Strategy#

1
gantt
2
    title Progressive Feature Rollout Strategy
3
    dateFormat X
4
    axisFormat %d/%m
5

6
    section Development
7
    Feature Development        :0, 7
8
    Unit Testing               :5, 10
9
    Integration Testing        :8, 12
10

11
    section Internal Rollout
12
    Team Testing (0.1%)        :12, 15
13
    Beta Users (1%)            :15, 18
14
    Power Users (5%)           :18, 21
15

16
    section Progressive Rollout
17
    Early Adopters (10%)       :21, 25
18
    Segment A (25%)            :25, 28
19
    Segment B (50%)            :28, 32
20
    Full Rollout (100%)        :32, 35
21

22
    section Monitoring
23
    Error Rate Monitoring      :12, 40
24
    Performance Monitoring     :12, 40
25
    User Feedback Collection   :12, 40
26
    Business Metrics Tracking  :21, 40

GitOps Patterns with ArgoCD and Flux#

GitOps treats Git repositories as the single source of truth for declarative infrastructure and application configuration, enabling automated, auditable deployments.

GitOps Architecture Overview#

1
graph TB
2
    subgraph "Git Repositories"
3
        AppRepo[Application Code]
4
        ConfigRepo[Configuration Repo]
5
        InfraRepo[Infrastructure Repo]
6
        EnvRepo[Environment Configs]
7
    end
8

9
    subgraph "CI/CD Pipeline"
10
        CI[CI Pipeline]
11
        Registry[Container Registry]
12
        Scanner[Security Scanner]
13
    end
14

15
    subgraph "GitOps Controllers"
16
        ArgoCD[ArgoCD Controller]
17
        Flux[Flux Controller]
18
        Tekton[Tekton Pipelines]
19
    end
20

21
    subgraph "Kubernetes Clusters"
22
        subgraph "Development"
23
            DevCluster[Dev Cluster]
24
            DevApps[Applications]
25
        end
26

27
        subgraph "Staging"
28
            StageCluster[Staging Cluster]
29
            StageApps[Applications]
30
        end
31

32
        subgraph "Production"
33
            ProdCluster[Prod Cluster]
34
            ProdApps[Applications]
35
        end
36
    end
37

38
    subgraph "Observability"
39
        Prometheus[Prometheus]
40
        Grafana[Grafana]
41
        AlertManager[Alert Manager]
42
    end
43

44
    AppRepo --> CI
45
    CI --> Scanner
46
    CI --> Registry
47
    CI --> ConfigRepo
48

49
    ConfigRepo --> ArgoCD
50
    ConfigRepo --> Flux
51
    InfraRepo --> ArgoCD
52
    EnvRepo --> ArgoCD
53

54
    ArgoCD --> DevCluster
55
    ArgoCD --> StageCluster
56
    ArgoCD --> ProdCluster
57

58
    Flux --> DevCluster
59
    Flux --> StageCluster
60

61
    DevApps --> Prometheus
62
    StageApps --> Prometheus
63
    ProdApps --> Prometheus
64

65
    Prometheus --> Grafana
66
    Prometheus --> AlertManager
67

68
    classDef git fill:#f9f9f9
69
    classDef ci fill:#e3f2fd
70
    classDef gitops fill:#e8f5e8
71
    classDef cluster fill:#fff3e0
72
    classDef monitor fill:#fce4ec
73

74
    class AppRepo,ConfigRepo,InfraRepo,EnvRepo git
75
    class CI,Registry,Scanner ci
76
    class ArgoCD,Flux,Tekton gitops
77
    class DevCluster,StageCluster,ProdCluster cluster
78
    class Prometheus,Grafana,AlertManager monitor

ArgoCD Application Configuration#

1
# ArgoCD Application for multi-environment deployment
2
apiVersion: argoproj.io/v1alpha1
3
kind: Application
4
metadata:
5
  name: user-service
6
  namespace: argocd
7
  finalizers:
8
    - resources-finalizer.argocd.argoproj.io
9
spec:
10
  project: default
11
  source:
12
    repoURL: https://github.com/company/k8s-configs
13
    targetRevision: HEAD
14
    path: applications/user-service/overlays/production
15
    kustomize:
16
      images:
17
        - myregistry/user-service:v2.0.0
18
  destination:
19
    server: https://kubernetes.default.svc
20
    namespace: production
21
  syncPolicy:
22
    automated:
23
      prune: true
24
      selfHeal: true
25
      allowEmpty: false
26
    syncOptions:
27
      - CreateNamespace=true
28
      - PrunePropagationPolicy=foreground
29
      - PruneLast=true
30
      - ServerSideApply=true
31
    retry:
32
      limit: 5
33
      backoff:
34
        duration: 5s
35
        factor: 2
36
        maxDuration: 3m
37
  revisionHistoryLimit: 10
38

39
---
40
# ArgoCD AppProject for RBAC and resource restrictions
41
apiVersion: argoproj.io/v1alpha1
42
kind: AppProject
43
metadata:
44
  name: user-services
45
  namespace: argocd
46
spec:
47
  description: User service applications
48
  sourceRepos:
49
    - "https://github.com/company/k8s-configs"
50
    - "https://charts.bitnami.com/bitnami"
51
  destinations:
52
    - namespace: "user-*"
53
      server: https://kubernetes.default.svc
54
    - namespace: "production"
55
      server: https://kubernetes.default.svc
56
  clusterResourceWhitelist:
57
    - group: ""
58
      kind: Namespace
59
    - group: rbac.authorization.k8s.io
60
      kind: ClusterRole
61
    - group: rbac.authorization.k8s.io
62
      kind: ClusterRoleBinding
63
  namespaceResourceWhitelist:
64
    - group: ""
65
      kind: Service
66
    - group: ""
67
      kind: ConfigMap
68
    - group: ""
69
      kind: Secret
70
    - group: apps
71
      kind: Deployment
72
    - group: apps
73
      kind: ReplicaSet
74
    - group: networking.k8s.io
75
      kind: Ingress
76
  roles:
77
    - name: developer
78
      description: Developers can sync and view applications
79
      policies:
80
        - p, proj:user-services:developer, applications, sync, user-services/*, allow
81
        - p, proj:user-services:developer, applications, get, user-services/*, allow
82
        - p, proj:user-services:developer, applications, action/*, user-services/*, allow
83
      groups:
84
        - company:developers
85
    - name: admin
86
      description: Admins have full access
87
      policies:
88
        - p, proj:user-services:admin, applications, *, user-services/*, allow
89
        - p, proj:user-services:admin, repositories, *, *, allow
90
      groups:
91
        - company:platform-team
92

93
---
94
# Multi-source application for complex deployments
95
apiVersion: argoproj.io/v1alpha1
96
kind: Application
97
metadata:
98
  name: user-service-complete
99
  namespace: argocd
100
spec:
101
  project: user-services
102
  sources:
103
    - repoURL: https://github.com/company/k8s-configs
104
      targetRevision: HEAD
105
      path: applications/user-service/base
106
    - repoURL: https://github.com/company/helm-charts
107
      targetRevision: HEAD
108
      path: user-service
109
      helm:
110
        valueFiles:
111
          - $values/applications/user-service/values-production.yaml
112
    - repoURL: https://github.com/company/k8s-configs
113
      targetRevision: HEAD
114
      path: .
115
      ref: values
116
  destination:
117
    server: https://kubernetes.default.svc
118
    namespace: production
119
  syncPolicy:
120
    automated:
121
      prune: true
122
      selfHeal: true
123
    syncOptions:
124
      - CreateNamespace=true
125
      - ServerSideApply=true

Flux Configuration#

1
# Flux GitRepository source
2
apiVersion: source.toolkit.fluxcd.io/v1beta2
3
kind: GitRepository
4
metadata:
5
  name: user-service-config
6
  namespace: flux-system
7
spec:
8
  interval: 1m
9
  url: https://github.com/company/k8s-configs
10
  ref:
11
    branch: main
12
  secretRef:
13
    name: git-credentials
14
  verify:
15
    mode: head
16
    secretRef:
17
      name: git-gpg-keys
18

19
---
20
# Flux Kustomization
21
apiVersion: kustomize.toolkit.fluxcd.io/v1beta2
22
kind: Kustomization
23
metadata:
24
  name: user-service
25
  namespace: flux-system
26
spec:
27
  interval: 5m
28
  path: "./applications/user-service/overlays/production"
29
  prune: true
30
  sourceRef:
31
    kind: GitRepository
32
    name: user-service-config
33
  targetNamespace: production
34
  healthChecks:
35
    - apiVersion: apps/v1
36
      kind: Deployment
37
      name: user-service
38
      namespace: production
39
  dependsOn:
40
    - name: infrastructure
41
  postBuild:
42
    substitute:
43
      cluster_name: "production"
44
      cluster_region: "us-west-2"
45
  patches:
46
    - patch: |
47
        - op: replace
48
          path: /spec/replicas
49
          value: 6
50
      target:
51
        kind: Deployment
52
        name: user-service
53

54
---
55
# Flux HelmRepository
56
apiVersion: source.toolkit.fluxcd.io/v1beta2
57
kind: HelmRepository
58
metadata:
59
  name: bitnami
60
  namespace: flux-system
61
spec:
62
  interval: 1h
63
  url: https://charts.bitnami.com/bitnami
64

65
---
66
# Flux HelmRelease
67
apiVersion: helm.toolkit.fluxcd.io/v2beta1
68
kind: HelmRelease
69
metadata:
70
  name: postgresql
71
  namespace: production
72
spec:
73
  interval: 5m
74
  chart:
75
    spec:
76
      chart: postgresql
77
      version: "12.x.x"
78
      sourceRef:
79
        kind: HelmRepository
80
        name: bitnami
81
        namespace: flux-system
82
  values:
83
    auth:
84
      postgresPassword: ${postgres_password}
85
      database: userservice
86
    primary:
87
      persistence:
88
        enabled: true
89
        size: 100Gi
90
        storageClass: ssd
91
    metrics:
92
      enabled: true
93
      serviceMonitor:
94
        enabled: true
95
  dependsOn:
96
    - name: user-service
97
      namespace: flux-system

GitOps Workflow#

1
sequenceDiagram
2
    participant Dev as Developer
3
    participant AppRepo as App Repository
4
    participant CI as CI Pipeline
5
    participant ConfigRepo as Config Repository
6
    participant ArgoCD as ArgoCD
7
    participant K8s as Kubernetes
8
    participant Monitor as Monitoring
9

10
    Dev->>AppRepo: Push application code
11
    AppRepo->>CI: Trigger build
12

13
    CI->>CI: Run tests
14
    CI->>CI: Build container image
15
    CI->>CI: Security scanning
16
    CI->>CI: Push to registry
17

18
    CI->>ConfigRepo: Update image tag
19
    ConfigRepo->>ArgoCD: Detect changes
20

21
    ArgoCD->>ArgoCD: Compare desired vs actual state
22
    ArgoCD->>K8s: Apply changes
23

24
    K8s->>K8s: Rolling update
25
    K8s->>Monitor: Send metrics
26

27
    alt Deployment Success
28
        Monitor->>ArgoCD: Healthy status
29
        ArgoCD->>ConfigRepo: Update sync status
30
    else Deployment Failure
31
        Monitor->>ArgoCD: Unhealthy status
32
        ArgoCD->>ArgoCD: Trigger rollback
33
        ArgoCD->>K8s: Revert to previous version
34
        ArgoCD->>Dev: Send failure notification
35
    end

CI/CD Pipeline Best Practices#

Modern CI/CD pipelines emphasize security, efficiency, and reliability through automation, comprehensive testing, and progressive deployment strategies.

Comprehensive Pipeline Architecture#

1
graph TB
2
    subgraph "Source Control"
3
        Git[Git Repository]
4
        PR[Pull Request]
5
        Main[Main Branch]
6
    end
7

8
    subgraph "CI Pipeline"
9
        Trigger[Webhook Trigger]
10
        Checkout[Code Checkout]
11

12
        subgraph "Build Stage"
13
            Test[Unit Tests]
14
            Lint[Code Linting]
15
            Build[Application Build]
16
            Package[Container Build]
17
        end
18

19
        subgraph "Security Stage"
20
            SAST[Static Analysis]
21
            Deps[Dependency Scan]
22
            Secrets[Secret Scan]
23
            Container[Container Scan]
24
        end
25

26
        subgraph "Quality Stage"
27
            Coverage[Code Coverage]
28
            SonarQube[Quality Gate]
29
            Performance[Performance Tests]
30
        end
31
    end
32

33
    subgraph "Artifact Management"
34
        Registry[Container Registry]
35
        Artifacts[Artifact Store]
36
        Signing[Image Signing]
37
    end
38

39
    subgraph "CD Pipeline"
40
        subgraph "Deployment Stages"
41
            Dev[Development]
42
            Integration[Integration Tests]
43
            Staging[Staging Deploy]
44
            E2E[E2E Tests]
45
            Production[Production Deploy]
46
        end
47

48
        subgraph "Deployment Strategies"
49
            BlueGreen[Blue-Green]
50
            Canary[Canary]
51
            Rolling[Rolling Update]
52
        end
53
    end
54

55
    subgraph "Observability"
56
        Logs[Centralized Logging]
57
        Metrics[Metrics Collection]
58
        Traces[Distributed Tracing]
59
        Alerts[Alert Management]
60
    end
61

62
    Git --> Trigger
63
    PR --> Trigger
64
    Trigger --> Checkout
65

66
    Checkout --> Test
67
    Checkout --> Lint
68
    Test --> Build
69
    Lint --> Build
70
    Build --> Package
71

72
    Package --> SAST
73
    Package --> Deps
74
    Package --> Secrets
75
    Package --> Container
76

77
    SAST --> Coverage
78
    Deps --> Coverage
79
    Secrets --> Coverage
80
    Container --> Coverage
81
    Coverage --> SonarQube
82
    SonarQube --> Performance
83

84
    Performance --> Registry
85
    Performance --> Artifacts
86
    Registry --> Signing
87

88
    Signing --> Dev
89
    Dev --> Integration
90
    Integration --> Staging
91
    Staging --> E2E
92
    E2E --> Production
93

94
    Production --> BlueGreen
95
    Production --> Canary
96
    Production --> Rolling
97

98
    Dev --> Logs
99
    Staging --> Logs
100
    Production --> Logs
101

102
    Logs --> Metrics
103
    Metrics --> Traces
104
    Traces --> Alerts
105

106
    classDef source fill:#f9f9f9
107
    classDef ci fill:#e3f2fd
108
    classDef security fill:#ffebee
109
    classDef cd fill:#e8f5e8
110
    classDef observe fill:#fce4ec
111

112
    class Git,PR,Main source
113
    class Test,Lint,Build,Package,SAST,Deps,Secrets,Container,Coverage,SonarQube,Performance ci
114
    class SAST,Deps,Secrets,Container security
115
    class Dev,Integration,Staging,E2E,Production,BlueGreen,Canary,Rolling cd
116
    class Logs,Metrics,Traces,Alerts observe

GitHub Actions Pipeline#

1
name: CI/CD Pipeline
2

3
on:
4
  push:
5
    branches: [main, develop]
6
  pull_request:
7
    branches: [main]
8

9
env:
10
  REGISTRY: ghcr.io
11
  IMAGE_NAME: ${{ github.repository }}
12

13
jobs:
14
  # Static analysis and testing
15
  test:
16
    runs-on: ubuntu-latest
17
    strategy:
18
      matrix:
19
        node-version: [18, 20]
20

21
    steps:
22
      - name: Checkout code
23
        uses: actions/checkout@v4
24
        with:
25
          fetch-depth: 0 # Full history for SonarQube
26

27
      - name: Setup Node.js
28
        uses: actions/setup-node@v4
29
        with:
30
          node-version: ${{ matrix.node-version }}
31
          cache: "npm"
32

33
      - name: Install dependencies
34
        run: npm ci
35

36
      - name: Run linting
37
        run: npm run lint
38

39
      - name: Run type checking
40
        run: npm run type-check
41

42
      - name: Run unit tests
43
        run: npm run test:coverage
44

45
      - name: Upload coverage to Codecov
46
        uses: codecov/codecov-action@v3
47
        with:
48
          file: ./coverage/lcov.info
49
          flags: unittests
50
          name: codecov-umbrella
51

52
      - name: SonarQube Scan
53
        uses: sonarqube-quality-gate-action@master
54
        env:
55
          SONAR_TOKEN: ${{ secrets.SONAR_TOKEN }}
56
          SONAR_HOST_URL: ${{ secrets.SONAR_HOST_URL }}
57

58
  # Security scanning
59
  security:
60
    runs-on: ubuntu-latest
61
    needs: test
62

63
    steps:
64
      - name: Checkout code
65
        uses: actions/checkout@v4
66

67
      - name: Run Trivy vulnerability scanner
68
        uses: aquasecurity/trivy-action@master
69
        with:
70
          scan-type: "fs"
71
          scan-ref: "."
72
          format: "sarif"
73
          output: "trivy-results.sarif"
74

75
      - name: Upload Trivy scan results
76
        uses: github/codeql-action/upload-sarif@v2
77
        with:
78
          sarif_file: "trivy-results.sarif"
79

80
      - name: Dependency Review
81
        uses: actions/dependency-review-action@v3
82

83
      - name: Secret Scan
84
        uses: trufflesecurity/trufflehog@main
85
        with:
86
          path: ./
87
          base: main
88
          head: HEAD
89

90
  # Build and push container image
91
  build:
92
    runs-on: ubuntu-latest
93
    needs: [test, security]
94
    if: github.ref == 'refs/heads/main'
95
    outputs:
96
      image: ${{ steps.image.outputs.image }}
97
      digest: ${{ steps.build.outputs.digest }}
98

99
    steps:
100
      - name: Checkout code
101
        uses: actions/checkout@v4
102

103
      - name: Setup Docker Buildx
104
        uses: docker/setup-buildx-action@v3
105

106
      - name: Login to Container Registry
107
        uses: docker/login-action@v3
108
        with:
109
          registry: ${{ env.REGISTRY }}
110
          username: ${{ github.actor }}
111
          password: ${{ secrets.GITHUB_TOKEN }}
112

113
      - name: Extract metadata
114
        id: meta
115
        uses: docker/metadata-action@v5
116
        with:
117
          images: ${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}
118
          tags: |
119
            type=ref,event=branch
120
            type=ref,event=pr
121
            type=semver,pattern={{version}}
122
            type=semver,pattern={{major}}.{{minor}}
123
            type=sha,prefix=sha-
124

125
      - name: Build and push
126
        id: build
127
        uses: docker/build-push-action@v5
128
        with:
129
          context: .
130
          platforms: linux/amd64,linux/arm64
131
          push: true
132
          tags: ${{ steps.meta.outputs.tags }}
133
          labels: ${{ steps.meta.outputs.labels }}
134
          cache-from: type=gha
135
          cache-to: type=gha,mode=max
136
          provenance: true
137
          sbom: true
138

139
      - name: Set image output
140
        id: image
141
        run: echo "image=${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}:sha-${{ github.sha }}" >> $GITHUB_OUTPUT
142

143
      - name: Install Cosign
144
        uses: sigstore/cosign-installer@v3
145

146
      - name: Sign container image
147
        run: |
148
          cosign sign --yes ${{ env.REGISTRY }}/${{ env.IMAGE_NAME }}@${{ steps.build.outputs.digest }}
149

150
  # Deploy to development
151
  deploy-dev:
152
    runs-on: ubuntu-latest
153
    needs: build
154
    environment: development
155

156
    steps:
157
      - name: Checkout GitOps repo
158
        uses: actions/checkout@v4
159
        with:
160
          repository: company/k8s-configs
161
          token: ${{ secrets.GITOPS_TOKEN }}
162
          path: gitops
163

164
      - name: Update development image
165
        run: |
166
          cd gitops
167
          yq eval '.images[0].newTag = "${{ github.sha }}"' -i applications/user-service/overlays/development/kustomization.yaml
168
          git config user.name "GitHub Actions"
169
          git config user.email "actions@github.com"
170
          git add .
171
          git commit -m "Update user-service dev image to ${{ github.sha }}"
172
          git push
173

174
  # Integration tests
175
  integration-test:
176
    runs-on: ubuntu-latest
177
    needs: deploy-dev
178

179
    steps:
180
      - name: Checkout code
181
        uses: actions/checkout@v4
182

183
      - name: Setup Node.js
184
        uses: actions/setup-node@v4
185
        with:
186
          node-version: "20"
187
          cache: "npm"
188

189
      - name: Install dependencies
190
        run: npm ci
191

192
      - name: Wait for deployment
193
        run: |
194
          timeout 300 bash -c 'until curl -f http://dev.api.company.com/health; do sleep 5; done'
195

196
      - name: Run integration tests
197
        run: npm run test:integration
198
        env:
199
          API_BASE_URL: http://dev.api.company.com
200

201
  # Deploy to staging
202
  deploy-staging:
203
    runs-on: ubuntu-latest
204
    needs: integration-test
205
    environment: staging
206

207
    steps:
208
      - name: Checkout GitOps repo
209
        uses: actions/checkout@v4
210
        with:
211
          repository: company/k8s-configs
212
          token: ${{ secrets.GITOPS_TOKEN }}
213
          path: gitops
214

215
      - name: Update staging image
216
        run: |
217
          cd gitops
218
          yq eval '.images[0].newTag = "${{ github.sha }}"' -i applications/user-service/overlays/staging/kustomization.yaml
219
          git config user.name "GitHub Actions"
220
          git config user.email "actions@github.com"
221
          git add .
222
          git commit -m "Update user-service staging image to ${{ github.sha }}"
223
          git push
224

225
  # E2E tests
226
  e2e-test:
227
    runs-on: ubuntu-latest
228
    needs: deploy-staging
229

230
    steps:
231
      - name: Checkout code
232
        uses: actions/checkout@v4
233

234
      - name: Setup Node.js
235
        uses: actions/setup-node@v4
236
        with:
237
          node-version: "20"
238
          cache: "npm"
239

240
      - name: Install dependencies
241
        run: npm ci
242

243
      - name: Install Playwright
244
        run: npx playwright install
245

246
      - name: Wait for deployment
247
        run: |
248
          timeout 300 bash -c 'until curl -f http://staging.api.company.com/health; do sleep 5; done'
249

250
      - name: Run E2E tests
251
        run: npm run test:e2e
252
        env:
253
          API_BASE_URL: http://staging.api.company.com
254

255
      - name: Upload test results
256
        uses: actions/upload-artifact@v3
257
        if: failure()
258
        with:
259
          name: playwright-report
260
          path: playwright-report/
261

262
  # Deploy to production
263
  deploy-production:
264
    runs-on: ubuntu-latest
265
    needs: e2e-test
266
    environment: production
267
    if: github.ref == 'refs/heads/main'
268

269
    steps:
270
      - name: Checkout GitOps repo
271
        uses: actions/checkout@v4
272
        with:
273
          repository: company/k8s-configs
274
          token: ${{ secrets.GITOPS_TOKEN }}
275
          path: gitops
276

277
      - name: Update production image
278
        run: |
279
          cd gitops
280
          yq eval '.images[0].newTag = "${{ github.sha }}"' -i applications/user-service/overlays/production/kustomization.yaml
281
          git config user.name "GitHub Actions"
282
          git config user.email "actions@github.com"
283
          git add .
284
          git commit -m "Deploy user-service to production: ${{ github.sha }}"
285
          git push
286

287
      - name: Create GitHub Release
288
        uses: actions/create-release@v1
289
        env:
290
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
291
        with:
292
          tag_name: v${{ github.run_number }}
293
          release_name: Release v${{ github.run_number }}
294
          body: |
295
            Automated release of user-service
296

297
            - **Commit**: ${{ github.sha }}
298
            - **Image**: ${{ needs.build.outputs.image }}
299
            - **Digest**: ${{ needs.build.outputs.digest }}
300

301
            Deployed to production via GitOps.
302
          draft: false
303
          prerelease: false
304

305
  # Smoke tests in production
306
  smoke-test:
307
    runs-on: ubuntu-latest
308
    needs: deploy-production
309

310
    steps:
311
      - name: Checkout code
312
        uses: actions/checkout@v4
313

314
      - name: Setup Node.js
315
        uses: actions/setup-node@v4
316
        with:
317
          node-version: "20"
318
          cache: "npm"
319

320
      - name: Install dependencies
321
        run: npm ci
322

323
      - name: Wait for deployment
324
        run: |
325
          timeout 600 bash -c 'until curl -f https://api.company.com/health; do sleep 10; done'
326

327
      - name: Run smoke tests
328
        run: npm run test:smoke
329
        env:
330
          API_BASE_URL: https://api.company.com
331

332
      - name: Notify success
333
        if: success()
334
        uses: 8398a7/action-slack@v3
335
        with:
336
          status: success
337
          text: "✅ User service v${{ github.run_number }} deployed successfully to production!"
338
          webhook_url: ${{ secrets.SLACK_WEBHOOK }}
339

340
      - name: Notify failure
341
        if: failure()
342
        uses: 8398a7/action-slack@v3
343
        with:
344
          status: failure
345
          text: "❌ User service v${{ github.run_number }} deployment failed in production smoke tests!"
346
          webhook_url: ${{ secrets.SLACK_WEBHOOK }}

Multi-Environment Strategies#

Effective multi-environment strategies ensure consistent deployment processes while maintaining appropriate isolation and security boundaries between different stages of the software lifecycle.

Environment Architecture#

1
graph TB
2
    subgraph "Development Environments"
3
        subgraph "Developer Workspaces"
4
            Local[Local Development]
5
            DevPod[Development Pods]
6
        end
7

8
        subgraph "Shared Development"
9
            DevShared[Shared Dev Environment]
10
            Feature[Feature Branches]
11
        end
12
    end
13

14
    subgraph "Testing Environments"
15
        subgraph "Automated Testing"
16
            Integration[Integration Tests]
17
            Performance[Performance Tests]
18
            Security[Security Tests]
19
        end
20

21
        subgraph "Manual Testing"
22
            QA[QA Environment]
23
            UAT[User Acceptance Testing]
24
        end
25
    end
26

27
    subgraph "Pre-Production"
28
        Staging[Staging Environment]
29
        LoadTest[Load Testing]
30
        Rehearsal[Deployment Rehearsal]
31
    end
32

33
    subgraph "Production"
34
        subgraph "Production Deployment"
35
            Blue[Blue Environment]
36
            Green[Green Environment]
37
        end
38

39
        subgraph "Production Traffic"
40
            Canary[Canary Deployment]
41
            MainTraffic[Main Traffic]
42
        end
43
    end
44

45
    subgraph "Configuration Management"
46
        ConfigRepo[Configuration Repository]
47
        Secrets[Secret Management]
48
        FeatureFlags[Feature Flags]
49
    end
50

51
    subgraph "Observability"
52
        Monitoring[Monitoring Stack]
53
        Logging[Centralized Logging]
54
        Alerting[Alert Management]
55
    end
56

57
    Local --> DevShared
58
    DevPod --> DevShared
59
    DevShared --> Feature
60

61
    Feature --> Integration
62
    Integration --> Performance
63
    Performance --> Security
64

65
    Security --> QA
66
    QA --> UAT
67

68
    UAT --> Staging
69
    Staging --> LoadTest
70
    LoadTest --> Rehearsal
71

72
    Rehearsal --> Blue
73
    Rehearsal --> Green
74
    Blue --> Canary
75
    Green --> Canary
76
    Canary --> MainTraffic
77

78
    ConfigRepo --> DevShared
79
    ConfigRepo --> QA
80
    ConfigRepo --> Staging
81
    ConfigRepo --> Blue
82
    ConfigRepo --> Green
83

84
    Secrets --> QA
85
    Secrets --> Staging
86
    Secrets --> Blue
87
    Secrets --> Green
88

89
    FeatureFlags --> QA
90
    FeatureFlags --> Staging
91
    FeatureFlags --> Blue
92
    FeatureFlags --> Green
93

94
    DevShared --> Monitoring
95
    QA --> Monitoring
96
    Staging --> Monitoring
97
    Blue --> Monitoring
98
    Green --> Monitoring
99

100
    Monitoring --> Logging
101
    Logging --> Alerting
102

103
    classDef dev fill:#e3f2fd
104
    classDef test fill:#fff3e0
105
    classDef preprod fill:#f3e5f5
106
    classDef prod fill:#e8f5e8
107
    classDef config fill:#fce4ec
108

109
    class Local,DevPod,DevShared,Feature dev
110
    class Integration,Performance,Security,QA,UAT test
111
    class Staging,LoadTest,Rehearsal preprod
112
    class Blue,Green,Canary,MainTraffic prod
113
    class ConfigRepo,Secrets,FeatureFlags,Monitoring,Logging,Alerting config

Environment Configuration Strategy#

1
# Base configuration (kustomization.yaml)
2
apiVersion: kustomize.config.k8s.io/v1beta1
3
kind: Kustomization
4

5
metadata:
6
  name: user-service-base
7

8
resources:
9
  - deployment.yaml
10
  - service.yaml
11
  - configmap.yaml
12
  - ingress.yaml
13

14
commonLabels:
15
  app: user-service
16
  component: api
17

18
images:
19
  - name: user-service
20
    newName: myregistry/user-service
21
    newTag: latest
22

23
configMapGenerator:
24
  - name: user-service-config
25
    files:
26
      - config/app.properties
27
      - config/logging.properties
28

29
secretGenerator:
30
  - name: user-service-secrets
31
    env: secrets/.env
32

33
---
34
# Development overlay (overlays/development/kustomization.yaml)
35
apiVersion: kustomize.config.k8s.io/v1beta1
36
kind: Kustomization
37

38
metadata:
39
  name: user-service-development
40

41
namespace: development
42

43
resources:
44
  - ../../base
45

46
patchesStrategicMerge:
47
  - deployment-patch.yaml
48
  - ingress-patch.yaml
49

50
configMapGenerator:
51
  - name: user-service-config
52
    behavior: merge
53
    literals:
54
      - ENVIRONMENT=development
55
      - LOG_LEVEL=debug
56
      - DATABASE_POOL_SIZE=5
57
      - CACHE_TTL=300
58
      - RATE_LIMIT_ENABLED=false
59

60
secretGenerator:
61
  - name: user-service-secrets
62
    behavior: merge
63
    literals:
64
      - DATABASE_URL=postgresql://dev-db:5432/userservice_dev
65
      - REDIS_URL=redis://dev-redis:6379
66
      - JWT_SECRET=dev-secret-key
67

68
images:
69
  - name: user-service
70
    newTag: development
71

72
replicas:
73
  - name: user-service
74
    count: 2
75

76
---
77
# Staging overlay (overlays/staging/kustomization.yaml)
78
apiVersion: kustomize.config.k8s.io/v1beta1
79
kind: Kustomization
80

81
metadata:
82
  name: user-service-staging
83

84
namespace: staging
85

86
resources:
87
  - ../../base
88

89
patchesStrategicMerge:
90
  - deployment-patch.yaml
91
  - ingress-patch.yaml
92
  - hpa-patch.yaml
93

94
configMapGenerator:
95
  - name: user-service-config
96
    behavior: merge
97
    literals:
98
      - ENVIRONMENT=staging
99
      - LOG_LEVEL=info
100
      - DATABASE_POOL_SIZE=10
101
      - CACHE_TTL=600
102
      - RATE_LIMIT_ENABLED=true
103
      - METRICS_ENABLED=true
104

105
secretGenerator:
106
  - name: user-service-secrets
107
    behavior: merge
108
    literals:
109
      - DATABASE_URL=postgresql://staging-db:5432/userservice_staging
110
      - REDIS_URL=redis://staging-redis:6379
111

112
images:
113
  - name: user-service
114
    newTag: staging
115

116
replicas:
117
  - name: user-service
118
    count: 4
119

120
---
121
# Production overlay (overlays/production/kustomization.yaml)
122
apiVersion: kustomize.config.k8s.io/v1beta1
123
kind: Kustomization
124

125
metadata:
126
  name: user-service-production
127

128
namespace: production
129

130
resources:
131
  - ../../base
132

133
patchesStrategicMerge:
134
  - deployment-patch.yaml
135
  - ingress-patch.yaml
136
  - hpa-patch.yaml
137
  - pdb-patch.yaml
138
  - networkpolicy-patch.yaml
139

140
configMapGenerator:
141
  - name: user-service-config
142
    behavior: merge
143
    literals:
144
      - ENVIRONMENT=production
145
      - LOG_LEVEL=warn
146
      - DATABASE_POOL_SIZE=20
147
      - CACHE_TTL=1800
148
      - RATE_LIMIT_ENABLED=true
149
      - METRICS_ENABLED=true
150
      - TRACING_ENABLED=true
151
      - SECURITY_HEADERS_ENABLED=true
152

153
secretGenerator:
154
  - name: user-service-secrets
155
    behavior: merge
156
    literals:
157
      - DATABASE_URL=postgresql://prod-db-cluster:5432/userservice_prod
158
      - REDIS_URL=redis://prod-redis-cluster:6379
159

160
images:
161
  - name: user-service
162
    newTag: production
163

164
replicas:
165
  - name: user-service
166
    count: 8
167

168
# Production-specific patches
169
patches:
170
  - target:
171
      kind: Deployment
172
      name: user-service
173
    patch: |-
174
      - op: add
175
        path: /spec/template/spec/containers/0/resources
176
        value:
177
          requests:
178
            cpu: 500m
179
            memory: 1Gi
180
          limits:
181
            cpu: 2000m
182
            memory: 4Gi
183
      - op: add
184
        path: /spec/template/spec/affinity
185
        value:
186
          podAntiAffinity:
187
            requiredDuringSchedulingIgnoredDuringExecution:
188
            - labelSelector:
189
                matchExpressions:
190
                - key: app
191
                  operator: In
192
                  values:
193
                  - user-service
194
              topologyKey: kubernetes.io/hostname

Database Migration Patterns#

Zero-downtime database migrations are crucial for maintaining service availability during deployments. The expand-and-contract pattern provides a systematic approach to schema evolution.

Expand-and-Contract Migration Flow#

1
graph TB
2
    subgraph "Phase 1: Expand"
3
        subgraph "Database Schema"
4
            OldSchema[Old Schema v1]
5
            NewColumns[Add New Columns]
6
            NewTables[Add New Tables]
7
            NewIndexes[Add New Indexes]
8
        end
9

10
        subgraph "Application"
11
            AppV1[Application v1]
12
            DualWrite[Dual Write Logic]
13
        end
14
    end
15

16
    subgraph "Phase 2: Migrate"
17
        subgraph "Data Migration"
18
            Backfill[Backfill Data]
19
            Validation[Data Validation]
20
            Consistency[Consistency Checks]
21
        end
22

23
        subgraph "Application Update"
24
            AppV2[Application v2]
25
            ReadNew[Read New Schema]
26
            WriteNew[Write New Schema]
27
        end
28
    end
29

30
    subgraph "Phase 3: Contract"
31
        subgraph "Cleanup"
32
            RemoveOld[Remove Old Columns]
33
            DropTables[Drop Old Tables]
34
            CleanupCode[Remove Migration Code]
35
        end
36

37
        subgraph "Final State"
38
            FinalSchema[Final Schema v2]
39
            AppV3[Application v3]
40
        end
41
    end
42

43
    OldSchema --> NewColumns
44
    NewColumns --> NewTables
45
    NewTables --> NewIndexes
46

47
    AppV1 --> DualWrite
48
    DualWrite --> Backfill
49

50
    Backfill --> Validation
51
    Validation --> Consistency
52
    Consistency --> AppV2
53

54
    AppV2 --> ReadNew
55
    ReadNew --> WriteNew
56
    WriteNew --> RemoveOld
57

58
    RemoveOld --> DropTables
59
    DropTables --> CleanupCode
60
    CleanupCode --> FinalSchema
61
    FinalSchema --> AppV3
62

63
    classDef expand fill:#e3f2fd
64
    classDef migrate fill:#fff3e0
65
    classDef contract fill:#e8f5e8
66

67
    class OldSchema,NewColumns,NewTables,NewIndexes,AppV1,DualWrite expand
68
    class Backfill,Validation,Consistency,AppV2,ReadNew,WriteNew migrate
69
    class RemoveOld,DropTables,CleanupCode,FinalSchema,AppV3 contract

Migration Implementation Example#

1
-- Phase 1: Expand - Add new columns and tables
2
-- Migration 001: Add new user profile columns
3
ALTER TABLE users
4
ADD COLUMN profile_data JSONB,
5
ADD COLUMN last_login_at TIMESTAMP WITH TIME ZONE,
6
ADD COLUMN created_by_id UUID;
7

8
-- Create new user_profiles table for normalized data
9
CREATE TABLE user_profiles (
10
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
11
    user_id UUID NOT NULL REFERENCES users(id) ON DELETE CASCADE,
12
    first_name VARCHAR(100),
13
    last_name VARCHAR(100),
14
    bio TEXT,
15
    avatar_url VARCHAR(500),
16
    preferences JSONB DEFAULT '{}',
17
    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
18
    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
19

20
    CONSTRAINT unique_user_profile UNIQUE(user_id)
21
);
22

23
-- Add indexes for performance
24
CREATE INDEX idx_user_profiles_user_id ON user_profiles(user_id);
25
CREATE INDEX idx_users_last_login_at ON users(last_login_at);
26
CREATE INDEX idx_users_profile_data_gin ON users USING GIN(profile_data);
27

28
-- Phase 2: Migrate - Backfill data and update application logic
29
-- Data migration script (run in batches)
30
DO $$
31
DECLARE
32
    batch_size INT := 1000;
33
    offset_val INT := 0;
34
    user_record RECORD;
35
BEGIN
36
    LOOP
37
        -- Process users in batches
38
        FOR user_record IN
39
            SELECT id, email, full_name, bio, avatar_url
40
            FROM users
41
            WHERE profile_data IS NULL
42
            ORDER BY id
43
            LIMIT batch_size OFFSET offset_val
44
        LOOP
45
            -- Migrate data to new structure
46
            UPDATE users SET
47
                profile_data = jsonb_build_object(
48
                    'full_name', user_record.full_name,
49
                    'bio', user_record.bio,
50
                    'avatar_url', user_record.avatar_url,
51
                    'migrated_at', NOW()
52
                ),
53
                last_login_at = COALESCE(last_login_at, created_at)
54
            WHERE id = user_record.id;
55

56
            -- Create normalized profile record
57
            INSERT INTO user_profiles (user_id, first_name, last_name, bio, avatar_url)
58
            SELECT
59
                user_record.id,
60
                split_part(user_record.full_name, ' ', 1),
61
                split_part(user_record.full_name, ' ', 2),
62
                user_record.bio,
63
                user_record.avatar_url
64
            ON CONFLICT (user_id) DO NOTHING;
65
        END LOOP;
66

67
        -- Check if we processed all records
68
        IF NOT FOUND THEN
69
            EXIT;
70
        END IF;
71

72
        offset_val := offset_val + batch_size;
73

74
        -- Add delay to avoid overwhelming the database
75
        PERFORM pg_sleep(0.1);
76
    END LOOP;
77
END $$;
78

79
-- Phase 3: Contract - Remove old columns and cleanup
80
-- Migration 003: Remove old columns (after application deployment)
81
ALTER TABLE users
82
DROP COLUMN full_name,
83
DROP COLUMN bio,
84
DROP COLUMN avatar_url;
85

86
-- Add constraints that were deferred
87
ALTER TABLE user_profiles
88
ADD CONSTRAINT check_names_not_empty
89
CHECK (length(trim(first_name)) > 0 AND length(trim(last_name)) > 0);

Application Code Evolution#

1
// Phase 1: Dual Write Implementation
2
class UserService {
3
  async updateUserProfile(
4
    userId: string,
5
    profileData: UserProfile
6
  ): Promise<User> {
7
    const transaction = await this.db.transaction();
8

9
    try {
10
      // Write to old format (backward compatibility)
11
      await transaction.query(
12
        `
13
        UPDATE users SET
14
          full_name = $2,
15
          bio = $3,
16
          avatar_url = $4,
17
          updated_at = NOW()
18
        WHERE id = $1
19
      `,
20
        [userId, profileData.fullName, profileData.bio, profileData.avatarUrl]
21
      );
22

23
      // Write to new format (dual write)
24
      await transaction.query(
25
        `
26
        UPDATE users SET
27
          profile_data = $2,
28
          updated_at = NOW()
29
        WHERE id = $1
30
      `,
31
        [userId, JSON.stringify(profileData)]
32
      );
33

34
      // Upsert to new normalized table
35
      await transaction.query(
36
        `
37
        INSERT INTO user_profiles (user_id, first_name, last_name, bio, avatar_url, preferences)
38
        VALUES ($1, $2, $3, $4, $5, $6)
39
        ON CONFLICT (user_id) DO UPDATE SET
40
          first_name = EXCLUDED.first_name,
41
          last_name = EXCLUDED.last_name,
42
          bio = EXCLUDED.bio,
43
          avatar_url = EXCLUDED.avatar_url,
44
          preferences = EXCLUDED.preferences,
45
          updated_at = NOW()
46
      `,
47
        [
48
          userId,
49
          profileData.firstName,
50
          profileData.lastName,
51
          profileData.bio,
52
          profileData.avatarUrl,
53
          JSON.stringify(profileData.preferences),
54
        ]
55
      );
56

57
      await transaction.commit();
58

59
      return this.getUserById(userId);
60
    } catch (error) {
61
      await transaction.rollback();
62
      throw error;
63
    }
64
  }
65

66
  // Phase 2: Read from new format, fallback to old
67
  async getUserProfile(userId: string): Promise<UserProfile | null> {
68
    const user = await this.db.query(
69
      `
70
      SELECT
71
        u.id,
72
        u.email,
73
        u.profile_data,
74
        u.full_name,  -- Fallback for unmigrated records
75
        u.bio,        -- Fallback for unmigrated records
76
        u.avatar_url, -- Fallback for unmigrated records
77
        up.first_name,
78
        up.last_name,
79
        up.bio as profile_bio,
80
        up.avatar_url as profile_avatar,
81
        up.preferences
82
      FROM users u
83
      LEFT JOIN user_profiles up ON u.id = up.user_id
84
      WHERE u.id = $1
85
    `,
86
      [userId]
87
    );
88

89
    if (!user) return null;
90

91
    // Prefer new format, fallback to old format
92
    if (user.profile_data) {
93
      const profileData = JSON.parse(user.profile_data);
94
      return {
95
        firstName: user.first_name || profileData.first_name,
96
        lastName: user.last_name || profileData.last_name,
97
        bio: user.profile_bio || profileData.bio,
98
        avatarUrl: user.profile_avatar || profileData.avatar_url,
99
        preferences: user.preferences || profileData.preferences || {},
100
      };
101
    } else {
102
      // Fallback to old format
103
      return {
104
        firstName: user.full_name?.split(" ")[0] || "",
105
        lastName: user.full_name?.split(" ")[1] || "",
106
        bio: user.bio,
107
        avatarUrl: user.avatar_url,
108
        preferences: {},
109
      };
110
    }
111
  }
112

113
  // Phase 3: Clean implementation using only new schema
114
  async updateUserProfileFinal(
115
    userId: string,
116
    profileData: UserProfile
117
  ): Promise<User> {
118
    await this.db.query(
119
      `
120
      UPDATE user_profiles SET
121
        first_name = $2,
122
        last_name = $3,
123
        bio = $4,
124
        avatar_url = $5,
125
        preferences = $6,
126
        updated_at = NOW()
127
      WHERE user_id = $1
128
    `,
129
      [
130
        userId,
131
        profileData.firstName,
132
        profileData.lastName,
133
        profileData.bio,
134
        profileData.avatarUrl,
135
        JSON.stringify(profileData.preferences),
136
      ]
137
    );
138

139
    return this.getUserById(userId);
140
  }
141
}

Migration Monitoring#

1
# Kubernetes CronJob for migration monitoring
2
apiVersion: batch/v1
3
kind: CronJob
4
metadata:
5
  name: migration-monitor
6
  namespace: production
7
spec:
8
  schedule: "*/15 * * * *" # Every 15 minutes
9
  jobTemplate:
10
    spec:
11
      template:
12
        spec:
13
          containers:
14
            - name: migration-monitor
15
              image: postgres:15
16
              env:
17
                - name: PGHOST
18
                  value: "production-db.example.com"
19
                - name: PGUSER
20
                  valueFrom:
21
                    secretKeyRef:
22
                      name: db-credentials
23
                      key: username
24
                - name: PGPASSWORD
25
                  valueFrom:
26
                    secretKeyRef:
27
                      name: db-credentials
28
                      key: password
29
                - name: PGDATABASE
30
                  value: "userservice"
31
              command:
32
                - /bin/bash
33
                - -c
34
                - |
35
                  # Check migration progress
36
                  TOTAL_USERS=$(psql -t -c "SELECT COUNT(*) FROM users;")
37
                  MIGRATED_USERS=$(psql -t -c "SELECT COUNT(*) FROM users WHERE profile_data IS NOT NULL;")
38
                  PROFILE_RECORDS=$(psql -t -c "SELECT COUNT(*) FROM user_profiles;")
39

40
                  MIGRATION_PERCENTAGE=$(( (MIGRATED_USERS * 100) / TOTAL_USERS ))
41

42
                  echo "Migration Progress Report:"
43
                  echo "Total Users: $TOTAL_USERS"
44
                  echo "Migrated Users: $MIGRATED_USERS"
45
                  echo "Profile Records: $PROFILE_RECORDS"
46
                  echo "Migration Percentage: $MIGRATION_PERCENTAGE%"
47

48
                  # Check for data consistency
49
                  INCONSISTENT_RECORDS=$(psql -t -c "
50
                    SELECT COUNT(*) FROM users u
51
                    LEFT JOIN user_profiles up ON u.id = up.user_id
52
                    WHERE u.profile_data IS NOT NULL AND up.user_id IS NULL;
53
                  ")
54

55
                  if [ "$INCONSISTENT_RECORDS" -gt 0 ]; then
56
                    echo "WARNING: Found $INCONSISTENT_RECORDS inconsistent records!"
57
                    # Send alert to monitoring system
58
                    curl -X POST "$SLACK_WEBHOOK" -d "{\"text\": \"Migration inconsistency detected: $INCONSISTENT_RECORDS records\"}"
59
                  fi
60

61
                  # Export metrics to Prometheus
62
                  cat << EOF > /tmp/migration-metrics.prom
63
                  # HELP user_migration_total Total number of users
64
                  # TYPE user_migration_total gauge
65
                  user_migration_total $TOTAL_USERS
66

67
                  # HELP user_migration_completed Number of migrated users
68
                  # TYPE user_migration_completed gauge
69
                  user_migration_completed $MIGRATED_USERS
70

71
                  # HELP user_migration_percentage Percentage of migration completion
72
                  # TYPE user_migration_percentage gauge
73
                  user_migration_percentage $MIGRATION_PERCENTAGE
74

75
                  # HELP user_migration_inconsistent Number of inconsistent records
76
                  # TYPE user_migration_inconsistent gauge
77
                  user_migration_inconsistent $INCONSISTENT_RECORDS
78
                  EOF
79

80
                  # Push metrics to Pushgateway
81
                  curl -X POST "http://pushgateway.monitoring:9091/metrics/job/migration-monitor" \
82
                    --data-binary @/tmp/migration-metrics.prom
83
          restartPolicy: OnFailure

Conclusion#

Modern deployment and release patterns have evolved to address the complex requirements of cloud-native applications: zero downtime, rapid iteration, risk mitigation, and operational simplicity. The strategies outlined in this guide provide a comprehensive toolkit for implementing robust deployment pipelines.

Key Takeaways#

Choose the Right Pattern: Blue-Green for instant rollback, Canary for risk mitigation, Rolling Updates for resource efficiency
Embrace GitOps: Declarative configurations provide audit trails, consistency, and automated reconciliation
Implement Progressive Delivery: Feature flags and gradual rollouts reduce blast radius and enable data-driven decisions
Prioritize Observability: Comprehensive monitoring, logging, and alerting are essential for confident deployments
Automate Everything: From testing to deployment to rollback, automation reduces human error and accelerates delivery
Plan for Data: Database migrations require careful planning and execution to maintain zero downtime

Future Trends#

As we move forward, emerging patterns like service mesh-native deployments, AI-driven canary analysis, and platform engineering approaches will further enhance deployment capabilities. The foundation built with these proven patterns will enable teams to adopt new technologies while maintaining reliability and operational excellence.

The journey to mastering deployment patterns is ongoing, but with these tools and techniques, teams can build resilient, scalable systems that deliver value to users while maintaining the agility needed in today’s competitive landscape.

This guide provides practical, production-ready examples for implementing modern deployment patterns. Remember to adapt these patterns to your specific requirements, constraints, and organizational context.