The warehouse problem nobody talks about
Walk into any mid-sized Indian warehouse — pharma, FMCG, auto-ancillary, e-commerce — and you will find the same setup: a row of DVR monitors in the security cabin, a guard watching six screens simultaneously at 11 PM, and a hard disk overwriting footage every 15 days. Cameras everywhere. Intelligence: nowhere.
This is not a technology gap. It is a deployment gap. The cameras exist. The network infrastructure often exists. What is missing is the intelligence layer that converts raw video into actionable alerts — before the forklift clips a pedestrian, before the PPE violation triggers an OSHA-equivalent show-cause, before the ₹4 lakh carton walks out of the loading dock at 2 AM.
Industry Signal · Accenture, Feb 2026
Accenture's 2026 supply chain report found that AI-powered warehousing systems can deliver up to 15% higher inventory accuracy, up to 22% lower warehousing costs, and up to 20% greater productivity — driven by smarter automation and real-time analytics. Accenture
India is not behind on this curve. 55% of Indian C-suite executives expect AI to deliver more than 10% revenue uplift over the next three years — the most bullish of any country surveyed globally, according to McKinsey's 2025 AI workplace study. McKinsey The question for warehouse operators is no longer whether to deploy AI, but which use cases to start with, which models to run, and how to do it without exposing themselves to regulatory risk under India's new Digital Personal Data Protection (DPDP) Rules.
What does AI actually "see" in a warehouse?
Let's be concrete. An AI surveillance system in a warehouse is not magic. It is a trained computer vision model analysing a camera stream at 15–30 frames per second, comparing what it sees against learned patterns, and triggering an alert when something matches a rule. The intelligence lives in the model, the speed lives in the hardware, and the compliance lives in where the data stays.
Here are the eight use cases that matter most for Indian warehouses right now:
PPE Compliance Detection
Helmet, safety vest, steel-toe boots, gloves — detected per worker, per zone, per shift. Alert fires within seconds of a violation.
High priority · DGFASLIForklift & Pedestrian Safety
Trajectory-prediction models track forklift paths and flag near-miss zones before collision. Works in narrow aisles and blind corners.
OSHA-equivalent · P0 alertPilferage & Concealment Detection
Unusual item handling, garment concealment, and exit-without-scan events are flagged automatically at loading docks and exits.
ROI-critical · High value zonesUnauthorised Access Control
Restricted zone intrusion — stockrooms, server rooms, racking areas above safe height — detected and logged in real time.
Compliance · ISO 27001 alignedFire & Smoke Detection
Vision-based smoke and flame detection that works on existing CCTV, faster than many standalone smoke detectors in open warehouse volumes.
Life safety · Fire NOC alignedSlip, Trip & Fall Detection
Sudden postural changes and falls are detected within milliseconds. Critical for wet loading zones, uneven floors, and ramp areas.
Labour welfare · ESG complianceAfter-Hours Intrusion
Any movement in defined zones outside shift hours triggers immediate alerts. Perimeter and interior zones configured separately.
Security · Insurance coverageCrowd & Queue Analytics
Congestion at picking stations, dock queues, and entry/exit zones — monitored for operational efficiency and throughput optimisation.
Ops efficiency · Labour planningWhich AI models run these use cases — and how
This is where most AI vendor presentations go vague. Here, we will be specific. The AI models running in production warehouse deployments in 2026 are not mysterious black boxes — they are well-documented, open-weight architectures fine-tuned on domain-specific data. What matters for warehouse managers is knowing which model handles which job, and what hardware it needs.
| AI Model | Use Case(s) | Why This Model | Edge Requirement | Status |
|---|---|---|---|---|
| YOLOv8 / YOLO11 | PPE detection, forklift/pedestrian zone, fire, intrusion | Best-in-class real-time object detection. Runs at 30+ FPS on edge GPU. Ultralytics-supported, widely fine-tuned on industrial datasets. | GPU ≥ 4GB VRAM (e.g. Jetson Orin) | Production |
| DeepSORT | Forklift tracking, person trajectory, loitering | Multi-object tracker that pairs with YOLO detectors. Maintains unique IDs across frames to predict movement paths, not just positions. | CPU-capable (light) | Production |
| ResNet-50 / EfficientDet | PPE classification, behaviour category | Heavier classification after YOLO flags a region of interest. Used to confirm helmet/vest/glove type with higher accuracy in ambiguous lighting. | Mid-range GPU | Production |
| MediaPipe Pose | Slip/fall detection, overreach from height | Google's lightweight skeleton-tracking model. Detects body posture events — falls, overreaching, crouching in hazard zones — in real time. | CPU-capable | Production |
| Vision Transformer (ViT) — fine-tuned | Concealment / pilferage, anomaly behaviour | Attention-based model that catches subtle garment bulges, item concealment, and unusual body language patterns missed by YOLO's bounding boxes. | GPU ≥ 6GB VRAM | Selective Deploy |
| OpenCV + Background Subtraction (MOG2/KNN) | After-hours intrusion, zone boundary trigger | Non-deep-learning baseline that runs on CPU at near-zero cost. Ideal for simple perimeter tripwires and after-hours motion detection where GPU is not available. | CPU only | Production |
| FireNet / Custom CNN (fire/smoke) | Fire and smoke detection in warehouse volumes | Purpose-built CNNs trained on smoke-in-open-volume datasets. Faster than traditional ionisation detectors in high-ceiling warehouses where smoke takes longer to reach ceiling sensors. | GPU ≥ 4GB VRAM | Testing / Pilot |
| LPR / ANPR Model | Vehicle entry/exit at docks, GRN/GDN reconciliation | Optical character recognition specialised for Indian number plates — including high-security registration plates (HSRP). Matches vehicle registrations against pre-approved lists automatically. | CPU-capable (dedicated stream) | Production |
News · Ultralytics · Feb 2025
YOLO11's latest release reports significant accuracy improvement for industrial safety use cases, with the model now handling forklift–pedestrian conflict prediction as a first-class task — combining bounding-box detection with trajectory extrapolation. Recent News
The DPDP reality every warehouse manager must know
India's Digital Personal Data Protection Rules 2025 were notified on 13 November 2025 by MeitY. They apply — directly and explicitly — to warehouse operations. Deloitte India If your CCTV system captures footage of identifiable workers, drivers, or visitors, you are processing personal data. The moment that footage is transmitted to a cloud server — even for AI inference — you become a data fiduciary with obligations around consent, retention, purpose limitation, and breach notification.
"Warehouse staff cannot opt out of attendance systems, and surveillance data must remain proportionate, tied to a clear purpose and retained only as long as required."
KS&A Legal Analysis · DPDP Act for Logistics, Feb 2026The practical implication: cloud-based AI CCTV processing is a regulatory liability in 2026 India. Every frame transmitted off-premises must be justified under a lawful basis. Consent is nearly impossible in employment contexts. Operational necessity has limits. And penalties under the DPDP Act reach up to ₹250 crore per instance of failure to implement reasonable security safeguards.
On-premise edge AI eliminates this exposure entirely. When inference runs locally — on an edge compute unit in the warehouse itself — no biometric or personally identifiable frame ever leaves the facility. The AI model processes, alerts, and discards. What is retained (according to your defined policy) stays on-premise, encrypted, and under your control.
A real scenario: What happens at a typical ₹500 crore Indian distribution warehouse
Illustrative Scenario — Composite of typical deployments
A 35,000 sq ft ambient distribution warehouse in Bhiwandi, Mumbai — handling FMCG SKUs for a national brand — had three incidents in the previous financial year: one forklift injury (worker on medical leave for 6 weeks), two pilferage events totalling ₹9.2 lakh in missing goods, and a Factories Act notice for recurring PPE non-compliance.
After deploying edge AI across 16 existing IP cameras, within the first 30 days: 83 PPE violations were logged and resolved with supervisor alerts, one forklift near-miss was caught that the on-ground team had not reported, and dock-area anomaly detection flagged three after-hours gate events that turned out to be a contractor accessing the premises without sign-in. No pilferage event occurred in the following quarter.
How edge AI works in practice: the EdgeBox architecture
IndoAI's EdgeBox is an on-premise AI compute unit that sits at the warehouse network edge — connected to the existing IP CCTV infrastructure via RTSP streams. It runs AI inference locally, with no cloud dependency, no frame-level data leaving the facility, and no per-camera licensing fees.
The architecture is straightforward: cameras stream to EdgeBox over the local network (LAN/PoE), EdgeBox runs the configured AI model pipeline per camera, generates structured alert events (JSON payloads with timestamp, camera ID, zone, event type, thumbnail), and pushes notifications to your preferred channel — WhatsApp, email, Telegram, or your existing WMS via API.
What EdgeBox does not do: it does not store raw video in the cloud, it does not send facial recognition data off-premise, and it does not require an internet connection for core AI inference (internet is needed only for remote monitoring dashboards and software updates).
Industry Signal · PwC 2025
PwC's 2025 Digital Trends in Operations Survey found that 57% of Indian operations executives have integrated AI into selected functions — but only those with on-premise or hybrid architectures reported meaningful progress on compliance-sensitive deployments. PwC
Why MeitY STQC matters for your procurement decision
If your warehouse serves government clients, operates in a sensitive sector (pharma, defence supply chain, critical infrastructure), or participates in GeM procurement — the equipment origin and software certification of your AI CCTV system matters as much as its features. MeitY's Electronic Regulations (ER) framework and the STQC certification process require that AI surveillance equipment used in government or regulated environments meets security and data integrity standards.
Chinese-origin surveillance hardware faces specific compliance barriers under the 2026 regulatory environment, consistent with broader NDAA-aligned restrictions and MeitY STQC guidance. IndoAI EdgeBox is designed and built in India, aligning with Make in India, Atmanirbhar Bharat, and applicable MeitY ER requirements.
Starting right: the AIRI assessment before you deploy
The single biggest reason Indian warehouse AI deployments underperform is wrong expectations at entry — operators assume their existing cameras are AI-ready, or that any model will work in their specific lighting and layout. An honest readiness assessment before deployment prevents 80% of post-deployment problems.
IndoAI's AIRI (AI Readiness Index) provides a structured assessment across five dimensions: camera infrastructure quality, network capacity, lighting and environmental conditions, zone mapping completeness, and alert workflow readiness. A warehouse scoring below threshold on any dimension gets a remediation plan before model deployment begins — not a rushed go-live that underperforms and erodes confidence in AI.
Industry Signal · McKinsey State of AI 2025
McKinsey's 2025 AI report identifies a critical "scaling gap" — only one-third of organisations successfully scale AI across the enterprise. The differentiator is not the model, it is the readiness of the operational environment. Structured pre-deployment validation is the single most correlated factor with successful AI ROI. McKinsey
The deployment pathway: what to expect, step by step
A typical warehouse AI deployment with IndoAI follows a five-step pathway designed to produce measurable outcomes within the first 30 days — not promises about future quarters.
Step 1 — AIRI Assessment (Days 1–3): Site survey, camera audit, zone mapping, and scoring. Output is a formal readiness report with a recommended model configuration and remediation items (if any).
Step 2 — EdgeBox Installation and Camera Integration (Days 4–7): EdgeBox is installed on-premise. Existing IP cameras are mapped via RTSP. No new cabling typically required for cameras that already stream over the existing LAN.
Step 3 — Model Configuration and Alert Workflow Setup (Days 7–10): Selected AI models are configured per zone. Alert thresholds, escalation paths, and notification channels are set. Initial false-positive tuning run.
Step 4 — 30-Day Pilot (Days 10–40): Live deployment. Daily alert logs reviewed. Model fine-tuning based on your specific warehouse environment. ROI events (PPE violations caught, near-misses flagged, anomalies detected) are documented.
Step 5 — Full-Scale Validation and Handover: AIRI post-deployment score, detection accuracy report, and recommended expansion zones. Ongoing AMC includes model updates and 8-hour alert SLA.
Frequently asked questions
Is your warehouse AI-ready?
Book a no-obligation AIRI (AI Readiness Index) assessment. We will map your existing cameras, score your infrastructure, and tell you exactly which AI use cases are deployable today — no vague timelines, no overselling.