The truth table — every dataset, classified

Datasets

About half of DRISHTI's surface runs on real RTGS data; the other half is synthetic because the RTGS drop contains no PII and no enforcement records. Every table here carries a REAL, DERIVED, or SIMULATED badge.

Live tables

TableKindRowsSourceDescription
retailersREAL4,899Retailer Info.xlsxActive outlet master — Code, Name, Address, District, Circle, Depot, Lat/Lon, Status, VendorType
brandsREAL1,457Brand & Supplier Info.xlsxBrand catalogue with SKU, MRP, basic price, supplier, distillery
labelsREAL1,373Label Approvals_2025_2026.xlsxApproved label registry FY26 with front/back image URLs
sales_inflowREAL6,32,025Retailer Wise Sales(in).xlsxDaily inflow ledger by vendor — date, vendorId, district, cases, bottles, sale value
sales_yearwiseREAL6,42,955Retailer Sales Year wise -1.xlsxYear-wise retail sales by shop — date, retailer, depot, cases, bottles, sale value
distillery_quotaREAL42Statement Pharma Molasses and Distillery March 26.xlsxPer-distillery quota: allotted vs lifted PL/BL, with AC office grouping
seizuresREAL430MONTH WISE DATA ID NDPL DPL GANJA …xlsxAggregated month-wise seizure counts 2019-2026 (ID/NDPL/DPL/Spurious/Ganja)
excise_stationsSIMULATED208etl/simulate/excise_stations.py (seed=1947)208 stations allocated proportional to outlet density, K-means centroids per district. Real boundaries arrive at pilot.
retailer_stationDERIVED4,460haversine(retailer, station_centroid) within districtOutlet → nearest station mapping (depends on simulated stations, real outlets)
officersSIMULATED409etl/simulate/officers.py (TODO)1–3 officers per station with rank, tenure, arrest history
offendersSIMULATED12,030etl/simulate/offenders.py (TODO)~12,000 offender personas; ~400 organized into 25 networks
vehiclesSIMULATED214etl/simulate/vehicles_phones.py (TODO)Vehicle plates linked to offender networks
phonesSIMULATED7,913etl/simulate/vehicles_phones.py (TODO)Phone numbers linked to offender networks
firsSIMULATED68,482etl/simulate/firs_inspections.py (TODO)~50,000 FIR records calibrated to real per-district seizure totals
inspection_reportsSIMULATED27,316etl/simulate/firs_inspections.py (TODO)~120,000 routine inspection reports with findings text
gps_tripsSIMULATED6,000etl/simulate/gps_trips.py (TODO — extends SPIRIT Lock-2 to ~6,000 trips)Synthetic depot→retailer GPS polylines with deviation/dwell/speed features
resource_usageSIMULATED1,92,312etl/simulate/resource_usage.py (TODO)Village × month molasses/sugar/jaggery offtake — correlated with real ID-seizure intensity for hotspot model