Welcome back, Prateek
Shift oversight — validate logs, approve swaps, execute handovers.
Total Shift Logs
15
Unresolved
10
Pending Validation
1
Pending Swap Requests
2
Shift Handover Protocol
Compile and submit the outgoing handover note.
Handover from Arjun Sharma
Due: 5:48:39 AM
1 P1 active (Stripe webhook failures in Checkout). Search latency spike in Browse still under investigation. Profile 503s being monitored.
Shift Swap Requests
Approve or reject peer swap requests.
Chaitanya Lakshmikumar Addepalli → Brahmateja Kanchibhotla
Family event — need to swap Sat shift.
Rajeev Kumar → Sonam Bhardwaj
Medical appointment in the morning.
Payment service timeout — Stripe webhook failures
Payment confirmations not arriving. Stripe webhook retries exhausting. Escalated to vendor and GAPINC Retail Payments team.
buy-next-prod JS error rate >5.0 — HUI excluding ApplePay/PayPal
JS error rate exceeded 5.0 threshold on buy-next-prod for 5 consecutive minutes (HUI). Reviewed GCP logs, New Relic alerts and PT-Webapps Runbooks. RCA in progress.
Profile service 503 errors — login degraded
Customer login failing intermittently. Profile microservice returning 503 on ~15% of requests. Auth token refresh also impacted.
Vault Service mismatch — AKS properties missing in Chartis
Root cause identified: Vault Service version mismatch between Chartis env and AKS config. Missing properties causing auth token failures for 30% of payment requests. Escalated to Platform team.
Search API latency spike — product browse degraded
P99 search latency increased from 120ms to 850ms. Investigating root cause. Suspect upstream Elasticsearch connection pool saturation.
Buy flow — rule engine rules throwing TypeError on promo stacking
TypeError in checkout rule engine when two or more promotions are stacked. Affects ~12% of checkout sessions with active promo codes. Draft PR raised, addressing review comments.
DAM asset sync job delayed — CDN propagation lag >15 min
Scheduled asset sync job took 47 min instead of expected 30 min. CDN propagation lag observed across 3 edge nodes. Investigating network bottleneck on DAM → CDN pipeline.
Apple Pay token validation — intermittent 40% error rate
Apple Pay token validation returning HTTP 422 for ~40% of requests. Impacting mobile checkout. Apple Way gateway was down for 40 min; converted from P1. Root cause under investigation.
Ecom FUI — story FUI-6211 validation bugs observed
While validating FUI-6211, identified 3 UI regression bugs in the cart summary component. Shared observations to dev. Awaiting fix.
SSL certificate renewal for Core API staging
Renewed SSL cert for api-staging.core.gapinc.com. Cert was expiring in 3 days. Renewed via Let's Encrypt automation.
BRFC — Savings calculator showing 20% instead of 25%
Savings calculator in bag/checkout showing incorrect 20% savings figure instead of 25%. Root cause: percentage constant not updated post campaign config change. Hotfix deployed.
Scheduled DB vacuum job delayed on reporting cluster
Nightly vacuum job started 45 min late due to long-running analytics query. No data loss. Job completed successfully.
Customer profile cache TTL misconfiguration — stale data on refresh
Cache TTL was set to 1h instead of 5m. Customer profile updates not reflecting immediately. TTL updated and cache cleared. No data loss.
KTLO — PWFO-2126 feature flag cleanup reviewed
Reviewed KTLO ticket PWFO-2126 for feature flag cleanup. Change is low-risk, will raise PR in upcoming sprint. No production impact.
Email campaign cron skipped two dispatch cycles
Email dispatch cron skipped 02:00 and 04:00 IST runs. Manually triggered at 05:30. Emails reconciled. Root cause: memory limit exceeded on worker pod.