About Ektos Health

Ektos Health was built by Peritos Solutions in response to a clear gap in the Indian healthcare market: hospitals and diagnostic centres were running on fragmented systems — paper-based appointment registers, disconnected lab software, manually typed prescriptions, and spreadsheet billing. The result was errors, delays, compliance risk, and frustrated staff.

The vision was to build a single, cloud-based HMIS that any hospital — from a small diagnostic centre to a multi-specialty hospital — could deploy without heavy IT infrastructure, get up and running quickly, and be confident it met India’s digital health standards from day one.

Ektos Health is proudly developed by Peritos Solutions and is available at ektos-health.com. It is deployed for diagnostic centres, polyclinics, and hospitals across India.

The Problem

Scope & Feature List

Module 1 — Appointment Booking & Management 

A centralized, real-time appointment management system for walk-in and online bookings: 

  • Schedule, reschedule, and cancel appointments with a single click — centralized dashboard with real-time queue and wait times 
  • Support for walk-in and online appointment channels — quick-access buttons for both 
  • Patient registration at booking — captures name, gender, DOB, mobile, Aadhaar/ABHA ID 
  • New ABHA account creation directly from the appointment interface — or verification of existing ABHA ID 
  • Real-time appointment tracking — arrival status, wait time, consultation status all visible in one view 
  • View patient vitals, medical history, prescriptions, and previous examination notes directly from the appointment dashboard 
  • Action controls: view detail, update time, reschedule, cancel — with confirmation prompts 
  • Department and doctor selection — appointments routed to the correct consultant automatically 
  • Integration with ABHA for secure digital health record access and sharing at the point of booking 

 

Module 2 — Patient Management 

Complete end-to-end patient record management — from first registration to ongoing care: 

  • Unique UHID (Universal Health ID) generated for every patient at registration 
  • Comprehensive patient profile: demographics, medical history, allergy triggers, vital alerts, immunisation records 
  • Visit logs and doctor assignment history across all OPD and IPD interactions 
  • Consent logs and documented approvals — legally compliant patient consent management 
  • Integrated view across prescriptions, appointments, billing, and lab reports — all accessible from a single patient record 
  • Vital signs dashboard: height, weight, BMI, temperature, pulse, blood pressure — all logged per visit 
  • ABHA / Aadhaar linking — patient records linked to national health identity for interoperability 

 

Module 3 — Laboratory Management (Diagnostic Centre) 

End-to-end lab workflow management — from test assignment to report delivery — purpose-built for diagnostic centres: 

  • Test assignment from doctor to lab — linked directly to patient profile by name, ID, or ABHA number 
  • Sample ID generation and tracking — collection date, dispatch, acknowledgement, processing status all tracked 
  • Priority-based sample handling — urgent samples flagged and processed with higher priority 
  • Result entry: both automated (machine connectivity) and manual input options 
  • Automated report verification alerts — notifies lab doctors and physicians when results are ready for review 
  • PDF lab report generation with one click — auto-distributed to patient, treating doctor, and relevant staff 
  • Outsourced sample management — supports samples sent to external reference labs 
  • Integrated patient history — view all past tests and results from the same interface 
  • ABHA-compatible — search and link patients using national health ID for paperless lab processing 
  • Role-based access — lab assistants, technicians, and consultants each see only relevant data and actions 
  • Diagnostic machine connectivity — direct integration for automated result capture, reducing manual errors 

 

Module 4 — AI Prescription Generation & Clinical Intelligence 

The AI layer is the most powerful differentiator in Ektos Health — enabling doctors to generate accurate, formatted prescriptions in under a minute using voice or text, with AI-assisted diagnosis support: 

Voice-to-Text Prescription Dictation 

  • Doctors dictate the prescription verbally — the system converts speech to structured prescription text in real time 
  • Supports medical terminology, drug names, dosages, and clinical instructions accurately 
  • Significantly reduces consultation time — doctors spend less time typing and more time with the patient 
  • Dictated text is editable before finalisation — doctor retains full clinical control 

AI-Generated Diagnosis Suggestions 

  • Doctor enters the patient’s chief complaint and clinical notes — AI analyses these against the patient’s previous diagnosis history 
  • AI suggests probable diagnoses ranked by likelihood — based on the symptom pattern, history, and clinical context 
  • Differential diagnosis support — highlights conditions to rule out based on the presenting complaint 
  • AI draws from the patient’s full longitudinal history stored in the system — not just the current visit 
  • Doctor reviews, selects, or overrides AI suggestions — the system learns and improves with usage 

AI Prescription PDF Generation 

  • Once the consultation is complete, the system generates a professionally formatted, branded prescription PDF automatically 
  • Prescription includes: patient details, date, doctor name and registration number, diagnosis, medications with dosage and frequency, instructions, follow-up date 
  • Compliant with ABDM digital prescription standards — can be shared electronically via ABHA 
  • Prescriptions stored against the patient record — visible in appointment history and patient profile 
  • Configurable prescription templates per department or doctor — managed via Settings module 

 

Module 5 — Admission & Discharge (IPD) 

Full inpatient lifecycle management — from bed allocation on admission to discharge summary generation: 

  • Digital patient registration at admission — captures insurance details, ID, health history, and emergency contacts 
  • Intelligent bed allocation — auto-assigns beds based on availability, room type (ICU, General, Private, Isolation), and clinical condition 
  • Real-time bed management dashboard — live view of all bed IDs, types, room numbers, and occupancy status 
  • Bed category management — ICU, General, Private, Isolation configured and managed via the system 
  • Real-time patient condition monitoring — clinicians and nursing staff track status updates and discharge readiness 
  • Discharge planning — clinicians set discharge timelines and manage pending reports before release 
  • Automated discharge summary generation — produced with a few clicks, including diagnosis, treatment summary, medications, and follow-up instructions 
  • Bed released automatically on discharge — status updated from ‘In Use’ to ‘Available’ in real time 
  • Cross-department coordination — admission, billing, nursing, and housekeeping all notified of status changes 
  • Care team collaboration dashboard — doctors, nurses, and admin work from the same centralized view 

 

Module 6 — Billing Management 

Integrated, transparent billing across all hospital services — from OPD consultations to lab tests and inpatient stays: 

  • Automated bill generation — charges for consultations, lab tests, procedures, medications, and room rental all captured automatically 
  • Multi-status billing — tracks paid, unpaid, partially paid, and refunded invoices 
  • Detailed payment breakdown per patient — line-item visibility of every charge 
  • Patient-wise billing history — all bills and payment records linked to the patient profile 
  • Search and filter tools — quickly locate invoices by patient, date, doctor, or billing status 
  • Consent-based billing — billing actions linked to documented patient consent 
  • Invoice and billing templates configurable per hospital — managed via Settings module 

 

Module 7 — Settings, Master Data & System Configuration 

Full control over how Ektos Health operates within your institution: 

  • Brand settings — hospital logo, colours, and name applied across all reports, prescriptions, and invoices 

  • Email and SMS configuration — automated notifications for appointment confirmations, lab results, and billing 

  • Invoice, prescription, appointment, and lab report templates — all configurable without developer involvement 

  • User roles and access control — granular permissions per role (doctor, nurse, lab technician, billing staff, admin) 

  • Compliance and audit trails — all actions logged with user ID, timestamp, and action type 

  • Department management — add, edit, and deactivate departments and specialties 

  • Doctor and staff profiles — registration numbers, specialties, and schedule management 

  • Billing item setup — configure service codes, pricing, and insurance billing items 

  • Real-time system updates — configuration changes take effect immediately without downtime 

ABDM & NABH Compliance

Ektos Health is built from the ground up to meet India’s national digital health standards — not as an afterthought, but as a core architectural requirement:

ABDM Compliance 

Full integration with Ayushman Bharat Digital Mission — ABHA creation, verification, and health record linkage built into appointment, lab, and prescription workflows 

ABHA Integration 

Patients can create a new ABHA (Ayushman Bharat Health Account) or link an existing one at registration — enabling secure nationwide health record interoperability 

Aadhaar Verification 

Aadhaar-based patient identity verification supported at registration and appointment booking 

Digital Prescriptions 

AI-generated prescription PDFs comply with ABDM digital prescription standards — shareable via ABHA 

NABH Standards 

Workflow design, documentation requirements, consent management, and audit trails align with NABH accreditation criteria 

Audit Trails 

All clinical and administrative actions logged with user, timestamp, and action type — supporting NABH audit requirements 

Data Security 

Patient health data encrypted at rest and in transit — role-based access prevents unauthorised data exposure 

Compliance Readiness 

90% compliance readiness score across ABDM/NABH requirements — hospitals go live already meeting accreditation benchmarks 

Technology & Architecture

Ektos Health is a cloud-native application built on modern web technologies, deployed on scalable cloud infrastructure:

Frontend 

React.js — fast, responsive, mobile-compatible web interface — works on desktop, tablet, and mobile without a separate app 

Backend 

Node.js / Python REST APIs — microservice architecture for each module (appointments, lab, billing, AI) 

Cloud 

Cloud-hosted — scalable, high-availability infrastructure with 24/7 uptime monitoring 

Database 

Secure relational database with full patient record history, audit logs, and real-time updates 

AI Engine 

GPT-4 integration for diagnosis suggestions and prescription generation — custom prompt engineering for clinical context 

Voice-to-Text 

Speech recognition API with medical vocabulary training — dictation converted to structured prescription text 

PDF Engine 

Automated prescription and lab report PDF generation — templated, branded, and configurable per department 

ABDM API 

Official ABDM API integration — ABHA creation, verification, and health record exchange 

Notifications 

SMS and email notification engine — appointment reminders, lab result alerts, billing confirmations 

Security 

Role-based access control, encrypted data transmission, session management, and full audit logging 

AI Features — How They Work

AI Feature 

How It Works 

Clinical Benefit 

Voice-to-Text Prescription 

Doctor speaks the prescription aloud. Speech recognition converts dictation to structured text — drug names, dosages, instructions — in real time. Doctor reviews and confirms before saving. 

Consultation time reduced significantly. Doctor maintains eye contact with patient rather than typing. Reduces transcription errors. 

AI Diagnosis Suggestion 

Doctor enters chief complaint and clinical notes. AI analyses these against the patient’s stored diagnosis history and symptom patterns to suggest probable diagnoses ranked by likelihood. 

Supports junior doctors with differential diagnosis. Reduces diagnostic oversight. Highlights conditions to rule out. Improves clinical decision quality. 

AI Prescription PDF 

Once diagnosis and treatment are confirmed, the system auto-generates a formatted, branded prescription PDF including patient details, diagnosis, medications with dosage and frequency, and follow-up date. 

Professional, legible, legally compliant prescriptions every time. Stored against patient record. Shareable via ABHA. No formatting overhead for doctor. 

Previous Diagnosis Context 

AI accesses the patient’s full longitudinal history — all previous diagnoses, medications, allergies, and test results — as context when generating suggestions for the current visit. 

Prevents prescribing contraindicated medications. Highlights recurring conditions. Improves continuity of care across multiple visits and doctors. 

Implementation Approach

Ektos Health is deployed as a cloud SaaS product. New hospitals and diagnostic centres can be onboarded and live within days:

Day 1–2 — Setup 

Hospital profile, branding, departments, doctors, and billing items configured in Master Data and Settings. User roles and access levels assigned. 

Day 2–3 — Integration 

ABDM/ABHA API connection configured. SMS/email notification templates set up. Diagnostic machine connectivity tested where applicable. 

Day 3–4 — Data Entry 

Existing patient records migrated or entered. Lab test catalogue configured. Room and bed categories set up for IPD centres. 

Day 4–5 — Training 

Staff training on appointmentlab, billing, and prescription workflows. Doctors trained on voice-to-text and AI diagnosis features. 

Day 5 — Go-Live 

Live patient appointments and consultations begin. Peritos Solutions provides hypercare support for the first two weeks post-launch. 

Ongoing 

24/7 cloud support. Feature updates deployed automatically. AI model improves with usage. ABDM regulatory updates applied centrally. 

Client Testimonials

Ektos Health has significantly streamlined our operations and improved efficiency across our healthcare workflows. The platform is intuitive, reliable, and backed by a highly responsive team at Peritos Solutions.” 

— Akanksha Niranjan, Director, Ekanshi Solutions, Lucknow 

“The platform has simplified our day-to-day clinic management, from patient records to reporting. It has reduced manual work and improved overall efficiency. The support team is proactive and always available when needed.” 

— Dr. Ramnath Mishra, Clinic, Bhubaneswar 

Ektos Health has transformed how we manage patient records and hospital operations. The system is easy to use, secure, and provides quick access to critical information, enabling us to deliver better patient care.” 

— Dr. Anil Chowdhary, Lal Kothi Hospital, Jaipur 

Key Benefits

Ready to Transform Your Hospital’s Operations? 

Book a free demo of Ektos Health HMIS — and see AI prescription generation, voice-to-text, lab management, and ABDM compliance in action. Deployed and live within days. 
info@ektos-health.comektos-health.comPeritos Solutions | www.peritossolutions.com

About the Client

Bayleys Real Estate is New Zealand’s largest full-service real estate agency, operating across residential, commercial, rural, and property management services across New Zealand and Australia. With over 150 property management agents generating 1,800 to 2,000 rental appraisals per month, the manual documentation workload was significant, inconsistent across regions, and a major drain on agent productivity.

The leadership team — including the National Director and Financial Director — identified AI-driven automation as the strategic priority, with a clear mandate: cut appraisal time, improve consistency and accuracy, and free agents to focus on client relationships rather than documentation.

Project Background

Bayleys was already running a modern AWS-hosted property search web application built by Peritos Solutions — giving agents a cloud-based way to search and manage property records from any device. The next phase was to add AI intelligence on top of that foundation.

Each rental appraisal required an agent to manually research comparable properties, write a property description, calculate a market rent range, pull together agent branding and contact details, and format everything into a professional PDF. This 30 to 45 minute process, repeated thousands of times per month, was the single biggest time drain in the property management workflow.

Peritos Solutions was engaged to design and build the AI upgrade: automating the appraisal document, training a mathematical valuation model on historical data, deploying an AI copywriting engine for property descriptions, and building the AskKen AI chatbot for on-demand knowledge access.

Requirements

Scope & Feature List

Module 1 — Automated Rental Appraisal Report 

A property manager enters or confirms the property address. The system generates a complete, professionally formatted, Bayleys-branded rental appraisal report automatically: 

  • Property address triggers automatic data retrieval from CoreLogic API — bedrooms, bathrooms, floor area, carparks, property type 
  • Property image pulled from Bayleys listing API if the property is currently listed for sale — ensuring brand-consistent presentation 
  • Agent profile, photo, and contact details auto-populated via Office 365 SSO — the system knows who is logged in 
  • EMV rental range calculated and displayed — powered by the Random Forest valuation model (see Module 2) 
  • AI-generated property description inserted automatically — see Module 3 
  • Rental details section populated: current market value range, property type, report date
  • Back-page advert customisable by region or individual agent — uploaded via application settings 
  • Multiple property types supported — houses, apartments, units, minor dwellings, home and income 
  • Manual entry available for new-build and off-plan properties — feeds directly into the Bayleys Data Lake 
  • Output: professional PDF matching Bayleys national branding guidelines — consistent across all 150+ agents and all regions 

Module 2 — EMV (Estimated Market Rent) Valuation Engine 

The EMV engine is the mathematical centrepiece of the platform. Peritos evaluated multiple modelling approaches and selected a Random Forest Regression model, trained on data from the Bayleys Data Lake, as the optimal fit for rental valuation. 

The model was refined through three stages to achieve ±3% accuracy against experienced agent assessments: 

  • Initial model (basic property features only) — margin of error exceeded ±15%, predictions unreliable at extremes 
  • After geometric mean aggregation of decision tree outputs — outlier influence dampened, error narrowed to ±8–10% 
  • Final model with full feature set, Pearson R local correlation between capital value and existing rental price weighting, and property subjective rating (manual label by agents in the training data) — ±3% accuracy nationwide

Key technical components: 

  • Bootstrap aggregation (bagging) — many de-correlated decision trees built from different samples of the Data Lake, predictions averaged to reduce variance 
  • Geometric mean aggregation — dampens the influence of extreme outliers across individual tree predictions, particularly effective in large heterogeneous suburbs such as Remuera 
  • Suburb and street-level Pearson R correlation analysis — calculates the relationship between property Capital Value (CV) and actual rent for every suburb in New Zealand, dynamically weighting CV’s influence in the regression per location (e.g. R=0.93 in Epsom = CV carries very high weight; lower R suburbs rely more on physical features) 
  • Resolves the classic valuation dilemma of ‘best house on the worst street vs. worst house on the best street’ by varying feature weights by suburb rather than applying a national average 
  • New property data entered manually by agents (for off-plan/new builds) feeds back into the Data Lake — improving model accuracy over time, especially in newly developed suburbs 

Module 3 — AI-Generated Property Descriptions 

Property descriptions are generated automatically by an AI model trained specifically on Bayleys‘ historical appraisal data. Rather than producing generic text, the model writes in the tone and style of an experienced Bayleys property manager. 

  • The AI engine analyses the property address to identify local amenities, school zones, and suburb characteristics — incorporating these naturally into the description 
  • Property type, bedroom count, bathroom count, floor area, and key features are woven into the description based on data retrieved from CoreLogic
  • Output is inserted directly into the appraisal report — agents can review and edit if needed, but in most cases the AI description is used as-is 
  • New property data entered manually also contributes to future description training — the system improves with every appraisal generated 

Example output: ‘Large 5-bedroom, 3-bathroom home with triple car garaging, heated swimming pool and spa pool… Located close to Kohimarama Beach and top schools — Kohimarama School and Selwyn College. Whole house (5 beds): $1,650–$1,800pw’ 

Module 4 — AskKen AI Legal & Market Chatbot 

AskKen AI is a purpose-built real estate intelligence assistant, powered by OpenAI GPT-4O with a custom AI engine layered on top to control output quality and data sourcing. It is accessible via mobile and desktop and requires zero training to use. 

Architecture — Retrieval-Augmented Generation (RAG): 

  • Proprietary documents — the Residential Tenancies Act, related legislation, tenancy tribunal cases, suburb profiles, maintenance cost databases, and vendor checklists — are ingested, indexed in a vector store, and retrieved at query time 
  • 54,000 tenancy tribunal cases ingested — giving the model deep contextual and interpretive legal capability, not just raw legislative text 
  • GPT-4O  responses are grounded in Bayleys‘ controlled data sources — not the open internet; external web search is disabled by default 
  • Filtering and guardrails layer reviews all outputs against compliance checklists, strips unsupported assertions, and flags uncertain answers for human review 
  • Knowledge management layer tracks document version and effective date — new legislation or tribunal decisions can be re-indexed automatically 

AskKen AI handles queries across: 

  • Residential Tenancies Act legislation and compliance obligations 
  • Tenancy tribunal precedents and case outcomes — with specific case references 
  • Comparable property market analysis and recent rental data 
  • Suburb profiles, school zones, and local amenity information 
  • Rental market reports and suburb-level rent trend analysis 
  • Maintenance cost estimates, depreciation calculations, and IRD schedules 
  • Yield calculations — factoring in rent, management fees, rates, insurance, vacancy, and mortgage interest 
  • Checklists — fixed-term tenancy breaks, property inspections, new tenant onboarding 

Solution Architecture

Technology & Architecture

The AI platform is built cloud-native on AWS, with OpenAI -GPT-4O powering the language generation layer. All AI responses are grounded in controlled proprietary data — not the open internet.

Layer 

Technology / Service 

Role 

Cloud 

AWS (primary) 

Serverless infrastructure, Lambda functions, API Gateway, DynamoDB, S3, SNS, CloudWatch 

AI / LLM 

OpenAI GPT-4O

Base generative and reasoning capability for AskKen AI and property description generation 

AI Orchestration 

AWS Lambda + Node.js/Python 

Microservice orchestrating: user input → RAG retrieval → LLM call → answer filtering → UI response 

EMV Model 

Random Forest Regression 

Trained on Bayleys Data Lake — bootstrap aggregation, geometric mean, Pearson R suburb weighting — ±3% accuracy 

RAG Layer 

Vector store + proprietary docs 

54,000 tribunal cases + legislation + market data indexed — retrieved at query time to ground LLM responses 

Data Layer 

Bayleys Data Lake 

Historical appraisals, rental data, property records, new-build manual entries — feeds EMV model training 

Property Data 

CoreLogic API 

Bedrooms, bathrooms, floor area, carparks — retrieved automatically on address entry 

Images 

Bayleys Listings API 

Current listing photos pulled into appraisal report automatically for listed properties 

Auth 

Office 365 SSO 

Single sign-on — agent profile, photo, and contact details auto-populated in every report 

Security 

AWS WAF + token auth 

Web Application Firewall + token-based API endpoints — all property and AI data secured 

Guardrails 

Rules engine 

LLM outputs reviewed against compliance checklists; unsupported assertions stripped before reaching the user 

Application Images

Implementation Approach

The project kicked off in October 2024 and delivered an MVP into UAT for Auckland agents by early March 2025 — on schedule and within budget:

Phase 1 — Discovery 

Requirements workshops, CoreLogic and Bayleys API integration scoping, data lake assessment, RAG document inventory, architecture design on AWS 

Phase 2 — EMV Model 

Initial Random Forest baseline, geometric mean refinement, Pearson R suburb-level correlation weighting — iterated until ±3% accuracy achieved nationally 

Phase 3 — Appraisal Engine 

Automated report generation, CoreLogic integration, Bayleys API image pull, Office 365 SSO, multiple property type handling, 90-day expiry logic 

Phase 4 — AI Descriptions 

GPT-4O fine-tuning on historical Bayleys appraisals, sentiment analysis training, school zone and amenity integration, review and editing workflow 

Phase 5 — AskKen AI 

RAG pipeline build, document ingestion (legislation + 54,000 tribunal cases), guardrails layer, GPT-4O prompt engineering, chat interface on mobile and desktop 

Phase 6 — UAT & Go-Live 

UAT with Auckland property managers, regional disclaimer configuration, performance tuning, cost optimisation on AWS, go-live and hypercare 

Challenges & Solutions

 

EMV accuracy from ±15% to ±3% 

Initial models had very high variance. Three refinement stages — basic features, geometric mean aggregation, then Pearson R local correlation weighting — drove accuracy to ±3% nationwide, matching experienced agent assessments. 

AI descriptions that sound human 

Generic AI property descriptions were immediately identifiable and not fit for purpose. Peritos trained the model on thousands of real historical Bayleys appraisals using sentiment analysis — the output now matches Bayleys‘ own writing style. 

New-build data gaps 

Off-plan and new-build properties lack CoreLogic data. A manual entry flow was built, with all entered data feeding back into the Bayleys Data Lake to improve future EMV accuracy in newly developed areas. 

RAG grounding vs. hallucination 

LLMs are prone to confident but incorrect legal answers. All AskKen AI responses are grounded in indexed proprietary documents with a guardrails layer that strips unsupported assertions before they reach the agent. 

54,000 tribunal case ingestion 

Ingesting and indexing this volume of case law required careful document parsing, metadata tagging by jurisdiction and date, and chunking strategy to ensure relevant cases are retrieved at query time. 

First AI project — high visibility 

As Bayleys‘ inaugural AI initiative this project set the benchmark for all future AI investment. Peritos delivered on time, within budget, with quantifiable ROI — and secured a $20,000 Microsoft contribution recognising it as an industry first. 

Financial Impact

Appraisal time reduction 

From 30–45 minutes per appraisal to under 1 minute — saving approximately 40 minutes per appraisal 

Monthly appraisal volume 

1,800–2,000 appraisals per month (60,000–67,000 annually) 

Saving per appraisal 

NZD $33.33 per appraisal (at NZD $50/hr agent cost) 

Annual appraisal saving 

NZD $720,000–$800,000 per year from appraisal automation alone 

AskKen AI saving 

10 hrs manual research saved per agent/month × 150 agents × NZD $50/hr = NZD $75,000/month 

Annual AskKen saving 

NZD $900,000+ per year in research, compliance checking, and legal advisory time 

Total annual savings 

NZD ~$1.75 million per year — combined appraisal automation + AskKen AI 

Key Benefits

Support & Next Steps

Peritos Solutions provided post-go-live hypercare covering AI output quality monitoring, EMV model tuning, and integration stability. Automated pipelines re-index the RAG knowledge base as new tribunal decisions and legislation changes arrive — the system improves continuously without manual intervention.

Planned next phase:

Looking for a Similar AI Property or Real Estate Technology Solution?

Peritos Solutions specialises in AI-powered applications, machine learning valuation models, RAG chatbots, and cloud-native platforms on AWS — across New Zealand, Australia, USA, and India.

Get in touch: info@peritosolutions.com | +64-212579909 | www.peritossolutions.com

About the Client

Wine-Searcher is the world’s most visited wine marketplace and price comparison platform, connecting consumers, collectors, and trade buyers with wine retailers across more than 180 countries. With over 8 million distinct wine listings and millions of monthly active users, Wine-Searcher depends on accurate, consistent, and richly detailed product data to power search, recommendations, and pricing intelligence.

The editorial and data team had long relied on a combination of contributor-submitted data and manual verification — a process that created bottlenecks as the catalogue scaled and left the majority of listings without historical or regional context. The leadership team identified AI-driven automation as the path to consistent data quality at scale, with a clear brief: recognise the label, extract the facts, and tell the story.

Project Background

Wine-Searcher already operated on AWS — with a modern data pipeline handling listing ingestion, search indexing, and pricing data. The next phase was to add AI intelligence on top of that foundation: automatically reading wine labels, extracting structured metadata, and generating contextualised provenance summaries that elevate the consumer experience.

Each label presents a unique challenge: typography varies wildly across producers and countries, regulatory text overlaps with brand content, multi-language labels require translation before parsing, and the historical context for a given wine may span decades of regional winemaking history. No off-the-shelf model could handle the full pipeline — a purpose-built solution on SageMaker was required.

Peritos Solutions was engaged to architect and build the end-to-end AI pipeline: image ingestion, a custom label recognition model, OCR-based structured extraction, a RAG-grounded Bedrock summarisation layer, and integration back into Wine-Searcher’s live listings platform.

Requirements

Scope & Feature List

Module 1 — Label Recognition Engine (Amazon Rekognition + SageMaker)

A wine label image is submitted via API or batch upload. A custom Convolutional Neural Network (CNN) model — trained on a corpus of 500,000+ labelled wine label images and hosted on a SageMaker real-time endpoint — identifies the winery, wine name, appellation, grape variety, vintage year, and alcohol content from the visual label. The model handles skewed angles, low-resolution mobile uploads, partial label occlusion, and multi-label bottle formats (front + back). A confidence score is assigned to each detected field. Fields below the confidence threshold are flagged for the human-in-the-loop review queue rather than written directly to the listing. The SageMaker endpoint is versioned — A/B testing allows new model versions to serve a percentage of traffic before full promotion, with automatic rollback if accuracy degrades. Model training used SageMaker Automatic Model Tuning to optimise hyperparameters across CNN architecture variants.

Module 2 — OCR Structured Extraction (Amazon Textract)

Following visual label recognition, Amazon Textract performs OCR on the full label image — extracting all printed text with bounding box coordinates and confidence scores. A post-processing Lambda function, running in Node.js, applies a Wine-Searcher–specific parsing schema to the raw Textract output: regulatory text is separated from brand content; dates are resolved using regional wine labelling conventions; certifications (organic, biodynamic, AOC, DOC, DOCG) are identified by keyword matching against a controlled vocabulary; and alcohol percentages are extracted from the mandatory legal text band. Multi-language labels are detected by AWS Comprehend and routed through a translation Lambda before parsing. The structured output is validated against the Wine-Searcher listing schema before being written to DynamoDB and pushed to the listings enrichment queue.

Module 3 — Historical Provenance Summary (Amazon Bedrock + RAG)

Once structured label data is confirmed, a RAG pipeline generates a 150 to 250 word historical provenance summary for each wine. Proprietary knowledge base documents — covering wine regions, appellations, major châteaux and producers, vintage quality guides, and critical scoring context — are ingested, chunked, and indexed in an Amazon OpenSearch vector store. At query time, the structured label data (winery, appellation, vintage, grape variety) is used to retrieve the most relevant knowledge base passages. These passages, together with the structured label metadata, are passed to Amazon Bedrock (Claude) as context. The model generates a provenance summary grounded entirely in the retrieved documents — hallucination guardrails strip any claim not traceable to an indexed source. Example output: ‘Château Margaux 2015 is a First Growth Bordeaux from the Margaux appellation of the Médoc. The 2015 vintage was widely acclaimed as one of the finest of the decade — warm, dry conditions produced exceptional concentration, with the Wine Advocate awarding 100 points. Produced primarily from Cabernet Sauvignon with Merlot, Petit Verdot, and Cabernet Franc, this wine is expected to peak between 2030 and 2060.’

Module 4 — Human-in-the-Loop Review & Continuous Learning

Low-confidence label predictions and any Bedrock summary flagged by the guardrails layer are routed to a Wine-Searcher editorial review queue — a lightweight web UI where specialists can confirm, correct, or reject AI outputs before publication. All validated corrections are written back to the SageMaker feature store. An automated retraining trigger fires when the volume of corrections for a given producer category exceeds a configurable threshold — a SageMaker Pipeline runs the incremental training job on the updated feature set, evaluates accuracy against a holdout set, and promotes the new model version to the endpoint if performance improves. This closed loop ensures the model improves continuously as Wine-Searcher’s catalogue grows and new producers are encountered.

Solution Architecture

The platform is built cloud-native on AWS. All AI inference runs within the Wine-Searcher AWS account — no label image data leaves the AWS environment. The architecture is fully serverless outside of the SageMaker inference endpoint, with pay-per-request Lambda functions handling orchestration, parsing, validation, and API integration.

AWS architecture — API Gateway → Lambda orchestration → SageMaker endpoint (label recognition) → Textract OCR → Bedrock RAG summarisation → DynamoDB enrichment store → listings API push → CloudWatch + Model Monitor

Technology & Architecture

Layer 

Technology / Service 

Role 

Cloud 

AWS (primary) 

Serverless infrastructure — Lambda, API Gateway, DynamoDB, S3, SNS, CloudWatch 

Label Recognition 

Amazon Rekognition + SageMaker 

Custom CNN model trained on wine label images — winery, vintage, appellation, grape variety detection 

OCR Extraction 

Amazon Textract 

Extracts structured text from label scans — wine name, producer, region, alcohol %, certifications 

AI Summaries 

Amazon Bedrock (Claude) 

Generates historical provenance summaries from extracted structured data + retrieved knowledge base context 

ML Training 

Amazon SageMaker 

Model training, versioning, A/B endpoint testing, feature store, and batch inference pipeline 

RAG Layer 

SageMaker + OpenSearch 

Wine knowledge base (regions, châteaux, vintages, critics) indexed and retrieved at query time to ground Bedrock summaries 

Data Pipeline 

AWS Glue + S3 Data Lake 

Label image ingestion, transformation, enrichment, and storage — feeds training pipeline and inference cache 

Listings API 

Wine-Searcher Internal API 

Pushes enriched label metadata and AI summaries back into live product listings 

Auth & Security 

AWS Cognito + WAF 

Secure API access — rate limiting, token auth, IP-based WAF rules protecting the inference endpoint 

Monitoring 

CloudWatch + SageMaker Model Monitor 

Data drift detection, prediction quality tracking, automated retraining triggers when accuracy degrades 

Implementation Approach

Phase 1 — Discovery 

Requirements workshops, label image corpus assessment, Data Lake scoping, AWS architecture design, SageMaker feasibility study, RAG knowledge base inventory 

Phase 2 — Data Pipeline 

Label image ingestion pipeline via AWS Glue, S3 staging and normalisation, Textract OCR extraction, structured JSON output schema definition 

Phase 3 — SageMaker Model 

CNN training on labelled wine image dataset, SageMaker hyperparameter tuning, A/B endpoint deployment, accuracy benchmarking against manual expert classification 

Phase 4 — RAG & Bedrock 

Wine knowledge base build (regions, châteaux, vintages, critics), OpenSearch indexing, Bedrock (Claude) prompt engineering, hallucination guardrails, summary quality evaluation 

Phase 5 — API & Listings Integration 

Wine-Searcher internal API integration, enriched metadata push back to listings, search index update pipeline, caching layer for inference results 

Phase 6 — UAT & Go-Live 

Accuracy UAT with Wine-Searcher editorial team, performance and cost tuning on AWS, SageMaker Model Monitor activation, go-live across 8M+ listings, hypercare 

Challenges & Solutions

Low label image quality 

Many label images in the marketplace were low-resolution, tilted, or partially obscured. SageMaker training used augmentation pipelines (rotation, blur, contrast variation) to make the model robust across real-world upload quality. 

Multi-language label text 

Wine labels appear in French, Italian, German, Spanish, Portuguese, and English. Textract was supplemented with a language-detection Lambda routing non-English extractions through a translation layer before structured parsing. 

Vintage ambiguity 

Older bottles often carry multiple dates (bottling, release, vintage). A custom post-processing rule engine was built to resolve date ambiguity using regional norms (e.g. Bordeaux vs. New World labelling conventions). 

Hallucination in historical summaries 

Early Bedrock outputs included plausible-sounding but fabricated château histories. A RAG grounding layer with a curated wine knowledge base (regions, producers, vintages, critics’ scores) was introduced — all summaries now cite indexed sources. 

Cold-start for rare producers 

Small-production wineries had few training images. A human-in-the-loop review queue was built so that low-confidence predictions are flagged for expert validation and fed back into the SageMaker feature store for incremental retraining. 

Scale of batch enrichment 

Retroactively enriching 8M+ existing listings required a SageMaker batch transform job spread across spot instances — completing in 72 hours at a fraction of on-demand cost, with checkpointing to handle interruptions gracefully. 

Financial Impact

Time saved per label 

15–20 minutes manual research and transcription reduced to under 2 seconds automated 

Monthly new label volume 

60,000+ new SKUs ingested monthly across contributor uploads and editorial additions 

Saving per label 

USD $12–$16 per label at USD $50/hr data team cost (conservative estimate) 

Annual data team saving 

USD $8.6M–$11.5M per year from automated label enrichment alone 

Historical catalogue enrichment 

8M+ existing listings retroactively enriched — equivalent of 1,600+ person-years of manual work completed in 72 hours via batch transform 

Consumer engagement uplift 

Listings with provenance summaries show 34% higher click-through and 22% higher add-to-cart rates vs unenriched listings 

Total annual value 

USD $10M+ combined — data team savings + engagement-driven revenue uplift 

Key Benefits

Support & Next Steps

Peritos Solutions provided post-go-live hypercare covering model accuracy monitoring, Bedrock summary quality review, and AWS infrastructure optimisation. Automated pipelines re-index the wine knowledge base as new vintage guides, regional legislation, and critic publications are added — the RAG layer stays current without manual intervention.

Planned next phase:

Looking for a Similar AI / ML Platform on AWS?

Peritos Solutions specialises in AI-powered applications, machine learning pipelines, RAG chatbots, and cloud-native platforms on AWS — across New Zealand, Australia, USA, and India.

Get in touch: info@peritosolutions.com | +64-212579909 | www.peritossolutions.com

About HR Mind

HR Mind is a global resourcing company founded in 2010 that offers end-to-end recruitment and HR solutions to organisations in domestic and international markets. With deep expertise across multinational and local businesses, HR Mind provides tailored talent acquisition solutions across eight industry verticals: Infrastructure/EPC, Internet/E-Commerce, Renewable Energy, Automotive, FMCG, Information Technology, Healthcare, and Industrial/Manufacturing.

The company serves four distinct client segments — MNCs, SMEs, Startups, and Joint Ventures — placing candidates at senior and middle management levels as well as running global talent acquisition mandates. Beyond recruitment, HR Mind also provides HR solutions including payroll outsourcing, salary analysis, and candidate assessment services.

With a large and growing pipeline of job openings across multiple industries and client types, HR Mind faced a scalability challenge: the volume of resumes received per job opening was growing faster than their recruiter team could manually process. A technology solution was needed to automate the screening step without sacrificing the quality and accuracy that HR Mind’s clients expected.

The Problem

Recruitment at scale is fundamentally a data matching problem — but one that had been solved manually for decades. HR Mind identified several specific pain points driving the need for an AI solution:

Scope & Feature List

Module 1 — Job Description (JD) Parsing & Keyword Extraction 

The platform begins every recruitment workflow by intelligently parsing the Job Description to extract the structured requirements the AI will match resumes against: 

  • NLP-powered JD parsing — the system reads the full Job Description and extracts structured data: required skills, preferred skills, qualifications, years of experience, seniority level, industry context, and location requirements 
  • Keyword taxonomy — extracted keywords are classified into categories: hard skills (technical), soft skills (behavioural), certifications, tools/technologies, domain knowledge, and education requirements 
  • Semantic keyword expansion — the AI expands the keyword list with synonyms and related terms (e.g. ‘Project Manager’ → also matches ‘Programme Manager’, ‘PM’, ‘Delivery Lead’) — preventing missed matches due to vocabulary differences 
  • Weighted keyword scoring — keywords are automatically weighted by importance based on their position and frequency in the JD (e.g. ‘Required’ skills weighted higher than ‘Preferred’) 
  • JD templates — HR Mind recruiters can save and reuse JD keyword profiles for recurring role types, speeding up future searches 
  • Manual keyword adjustment — recruiters can add, remove, or re-weight keywords before running the matching engine — giving full control over the AI’s scoring criteria 

Module 2 — Resume Ingestion & Classification 

Resumes are ingested in bulk from multiple sources and automatically classified before matching begins: 

  • Multi-format ingestion — resumes accepted in PDF, Word (.doc/.docx), and plain text formats — uploaded in bulk or individually 
  • AWS S3 storage — all resumes stored securely in S3 with unique identifiers, version control, and encrypted storage at rest 
  • Automated text extraction — AWS Textract used for PDF parsing (including scanned documents), ensuring structured text is available for NLP processing regardless of resume format 
  • ML Resume Classification — each resume is automatically classified into a role category (e.g. Engineering, Sales, Finance, Technology, Operations, Healthcare) using a trained classification model — enabling filtering before matching 
  • Structured data extraction — the AI parses each resume to extract: candidate name, contact details, current and previous job titles, companies, employment tenure, education level and institution, skills and technologies, certifications, and total years of experience 
  • Duplicate detection — the system identifies duplicate or near-duplicate resumes (same candidate submitted multiple times or with minor edits) and flags them for recruiter review 
  • Candidate database enrichment — parsed resume data is stored in DynamoDB against the candidate profile, building a searchable, structured talent pool over time 

Module 3 — JD-to-Resume Keyword Matching Engine 

The core intelligence of the platform — matching each resume against the parsed JD using multi-layer keyword and semantic analysis: 

  • Exact keyword matching — each resume is scored for exact matches against the extracted JD keyword list (e.g. ‘Python’, ‘AWS’, ‘PMP certification’) 
  • Semantic similarity matching — NLP embeddings identify candidates whose resumes are semantically similar to the JD even without exact keyword matches — reducing false negatives from vocabulary differences 
  • Weighted match scoring — the overall match score is calculated as a weighted sum across keyword categories: hard skills carry the highest weight, followed by experience requirements, then education, then soft skills and location 
  • Skills gap analysis — for each resume, the platform identifies which JD keywords are matched (green), partially matched (amber), and missing (red) — giving the recruiter an instant gap view 
  • Experience relevance scoring — the AI assesses not just whether the candidate has a skill, but how recently and for how long they have used it — recent and sustained experience scores higher 
  • Industry context matching — candidates from the same or closely related industries are scored higher for roles where domain knowledge matters 
  • Multi-JD matching — a single resume pool can be simultaneously matched against multiple open job descriptions — surfacing the best fit role for each candidate in the database 

Module 4 — AI Candidate Ranking & Shortlist Generation 

The ranking engine takes match scores and produces a prioritised, explainable shortlist for the recruiter: 

  • Composite ranking score — each candidate receives a final score (0–100) combining: JD keyword match score, experience relevance, education fit, industry match, and completeness of resume data 
  • Ranked shortlist — candidates are listed in ranked order from highest to lowest match score — recruiter sees the best fits at the top immediately 
  • Score breakdown per candidate — each candidate’s ranking is accompanied by a transparent breakdown: which keywords they matched, which they missed, their experience alignment, and their overall suitability percentage 
  • Configurable shortlist size — recruiters can set the target shortlist size (e.g. top 10, top 20) — the system automatically surfaces the right number 
  • Threshold filtering — recruiters can set a minimum match score threshold — only candidates above the threshold appear in the shortlist, filtering out weak matches automatically 
  • Bias mitigation — ranking is based entirely on skills, experience, and JD alignment — personal information that could introduce bias (gender, age, ethnicity) is not factored into the scoring 
  • Re-ranking on keyword edit — if the recruiter adjusts JD keywords or weights after an initial run, the ranking updates in real time without needing to re-process all resumes 
  • Export to ATS — ranked shortlists exportable as CSV or integrated directly into HR Mind’s applicant tracking workflow for client presentation 

Module 5 — Recruiter Intelligence Dashboard 

centralised interface giving HR Mind recruiters full visibility of the pipeline, AI results, and candidate insights: 

  • Live pipeline view — all active job openings displayed with application counts, shortlist progress, and time-to-shortlist metrics 
  • Candidate comparison view — side-by-side comparison of top-ranked candidates for a single JD — skills match, experience, score breakdown all visible simultaneously 
  • Search and filter — recruiter can search the entire candidate database by keyword, classification, experience range, location, or score — returning results in seconds 
  • Bulk resume upload and processing — drag-and-drop interface for uploading 100+ resumes at once — processing triggered automatically via AWS Lambda 
  • Notification engine — automated alerts when a new high-match candidate is added to an active job opening — recruiter notified without needing to check manually 
  • Client-ready shortlist export — one-click export of ranked shortlists in formatted PDF or CSV for client presentation — with match scores and candidate summaries included 
  • Analytics dashboard — volume of resumes processed, average match scores by role, time-to-shortlist trends, and recruiter productivity metrics 

Solution Architecture

The platform is built entirely on AWS serverless architecture — no servers to manage, automatic scaling with application volume, and pay-per-use pricing that keeps costs proportional to usage:

Layer 

AWS Service / Technology 

Role in HR Mind Platform 

Ingestion 

AWS S3 

Secure storage for all uploaded resumes (PDF, Word, text) — triggers Lambda functions on upload for automatic processing 

Document Parse 

AWS Textract 

Extracts structured text from PDF resumes including scanned documents — feeds clean text to the NLP pipeline 

NLP / AI 

AWS SageMaker + Lambda 

ML models for resume classification, keyword extraction, semantic matching, and candidate scoring — deployed as serverless endpoints 

Text Embeddings 

Amazon Bedrock

NLP entity extraction and semantic understanding — identifies skills, job titles, organisations, and dates from unstructured resume text 

Matching Engine 

AWS Lambda 

Serverless function orchestrating JD parsing, keyword extraction, resume-to-JD matching, and score calculation — triggered per job opening 

API Layer 

Lambda Direct URl

RESTful API endpoints for recruiter dashboard, resume upload, JD submission, shortlist retrieval, and candidate search 

Data Store 

SQL DB

NoSQL database storing structured candidate profiles, match scores, JD keyword profiles, shortlists, and recruiter actions 

Auth 

Auth0

Recruiter and admin authentication — role-based access to platform features and candidate data 

Monitoring 

AWS CloudWatch 

Performance monitoring, error alerting, Lambda invocation metrics, and cost tracking dashboards 

Security 

AWS WAF + IAM 

Web Application Firewall protecting API endpoints; IAM roles enforcing least-privilege access to all AWS resources 

Storage Archive 

S3 Lifecycle Policies 

Automatic archival of old resumes to S3 Glacier — cost management for long-term candidate data retention 

End-to-End Workflow

Step 1 — JD Upload 

Recruiter uploads or pastes the Job Description into the platform. The NLP engine parses it and extracts a structured keyword profile with weighted categories. 

Step 2 — Keyword Review 

Recruiter reviews the extracted keywords, adjusts weights if needed, adds custom terms, and confirms the JD profile. The AI is now primed for matching. 

Step 3 — Resume Upload 

Resumes uploaded in bulk (PDF/Word) via drag-and-drop or sourced from the existing candidate database. AWS S3 stores each file and triggers automatic processing. 

Step 4 — Classification 

Each resume is classified into a role category by the ML model. The NLP pipeline extracts structured data: skills, experience, education, job titles, and tenure. 

Step 5 — Keyword Matching 

Each parsed resume is matched against the JD keyword profile. Exact and semantic matches are scored. Skills gaps are identified. An overall match score (0–100) is calculated. 

Step 6 — Review & Export 

Recruiter reviews the ranked shortlist, filters by threshold if needed, compares top candidates side-by-side, and exports the final shortlist for client presentation. 

Step 7 — Database Update 

All candidate profiles and scores are stored in DynamoDB. As new JDs are posted, stored candidates are automatically re-evaluated — making the talent pool smarter over time. 

Challenges & Solutions

Resume format diversity 

Resumes arrive in PDF, Word, scanned images, and plain text. AWS Textract handles all formats including scanned PDFs — ensuring no resume is unreadable by the system. 

Vocabulary mismatch 

Candidates and JDs use different terms for the same skill. Semantic NLP embeddings via Amazon Comprehend identify conceptually similar terms, preventing valid candidates from being missed. 

Explaining AI ranking to clients 

Clients needed to understand why a candidate ranked where they did. The platform generates a transparent score breakdown per candidate — every ranking decision is explainable. 

Scaling for peak volume 

Recruitment campaigns can generate hundreds of applications overnight. AWS Lambda’s serverless model scales to process any number of resumes in parallel with no infrastructure provisioning required. 

Bias in recruitment 

Automated ranking risks encoding historical bias. The scoring model is built on skills and experience alignment only — personal identifiers are excluded from all scoring calculations. 

Stale candidate database 

Stored resumes quickly became outdated if not re-evaluated. The platform automatically re-scores all existing candidates against each new JD — keeping the talent pool continuously active. 

Multi-industry keyword sets 

HR Mind operates across 8 industry verticals — each with distinct terminology. The keyword taxonomy is industry-aware, with sector-specific skill libraries built for each vertical. 

Key Benefits

Implementation Approach

Peritos Solutions delivered the platform in phases, with HR Mind recruiters involved at every stage to validate AI output quality against their own domain expertise:

Phase 1 — Discovery 

Recruitment workflow analysis, JD structure review across 8 industry verticals, resume format audit, keyword taxonomy design, AWS architecture design 

Phase 2 — Data Pipeline 

AWS S3 ingestion setup, Textract PDF parsing pipeline, resume text extraction and cleaning, DynamoDB schema design for candidate and JD profiles 

Phase 3 — NLP & ML Models 

Resume classification model training, keyword extraction pipeline (Amazon Comprehend + custom), semantic embedding layer for vocabulary mismatch handling 

Phase 4 — Matching Engine 

JD keyword extraction, weighted scoring algorithm, semantic matching layer, composite ranking engine, skills gap analysis module 

Phase 5 — Dashboard 

Recruiter interface build — React.js frontend, API Gateway integration, bulk upload UI, ranked shortlist view, candidate comparison, export functionality 

Phase 6 — Testing & Tuning 

Recruiter validation of AI shortlists vs manual shortlists — model tuning to align AI output with HR Mind domain expertise; bias audit 

Phase 7 — Go-Live 

Production deployment on AWS, CloudWatch monitoring setup, recruiter training, hypercare support period, ongoing model improvement pipeline 

Support & Next Steps

Peritos Solutions manages the AWS environment as an ongoing cloud partner — monitoring costs, model performance, and system stability. The AI models improve continuously as more resumes and JDs are processed through the platform.

Planned next phase enhancements:

Looking for a Similar AI Recruitment or HR Technology Solution?

Peritos Solutions builds AI-powered recruitment platforms, NLP screening engines, and AWS-native HR tech solutions for staffing firms, enterprises, and HR SaaS companies across New Zealand, Australia, India, and USA.

Get in touch: info@peritosolutions.com | +64-212579909 | www.peritossolutions.com

About the Client

Yorker Limited is a New Zealand-based sports-technology company founded to solve one of cricket’s most persistent problems — the inability of grassroots and amateur players to properly track and manage their bowling workloads. Operating from Auckland, Yorker provides digital tools tailored specifically for cricket players, with a focus on reducing overtraining risk and supporting sustainable performance improvement.

Key business drivers included:

Project Background

Cricket bowlers — particularly at grassroots, club, and academy levels — lack access to the sophisticated workload management tools available to professional teams. Without structured tracking, bowlers frequently over-bowl during net sessions, leading to stress fractures, shoulder injuries, and burnout. Coaches rely on manual scorebooks and memory rather than data. Yorker was conceived to democratise performance science for cricket.

Peritos Solutions was engaged to architect and build the full technology stack: a mobile app (Android and iOS), a serverless AWS backend, and an AI layer powered by AWS Bedrock. The goal was to create an intelligent assistant that could answer training questions, suggest improvement pathways, and flag injury risks — all from within the app.

Requirements

Solution Overview

The Yorker platform is built on a fully serverless AWS architecture, with AWS Bedrock at the heart of its AI capabilities. The solution consists of four layers: the mobile application, the API and compute layer, the AI intelligence layer, and the data and notification layer. 

 

Technology & Architecture

Mobile Platform 

Android (Google Play) · iOS 

AI Engine 

AWS Bedrock – Foundation Model (LLM) 

Compute 

AWS Lambda (Serverless Functions) 

API Layer 

Amazon API Gateway (REST & WebSocket) 

Authentication 

Amazon Cognito (User Pools & Identity Pools) 

Database 

Amazon DynamoDB (NoSQL – Player & Session Data) 

Storage 

Amazon S3 (Training Data, Media Assets) 

Notifications 

Amazon SNS (Push Notifications to Mobile) 

Observability 

Amazon CloudWatch (Logs, Metrics, Alarms) 

AI Prompt Design 

Peritos Solutions Custom Prompt Engineering 

Architecture Style 

100% Serverless – No Managed Infrastructure 

Region 

AWS ap-southeast-2 (Sydney) 

Scope & Feature List

Bowling Load Tracker 

Players log each net session by delivery type (pace, swing, spin), intensity, and volume. The app calculates cumulative weekly and monthly loads, tracking acute-to-chronic workload ratios used in professional sports science. 

AI Speed Improvement Suggestions 

AWS Bedrock analyses the player’s logged pace data and bowler profile to generate personalised speed improvement plans — including strength drills, run-up adjustments, and release technique recommendations. 

Injury Risk Intelligence 

The AI monitors bowling load trends and compares them against established thresholds. When a risk is detected, Yorker proactively alerts the player and coach via push notification, with a Bedrock-generated explanation and recommended rest plan. 

Natural Language Q&A 

Players and coaches can type or speak questions such as ‘How do I improve my pace?’ or ‘Is my workload safe this week?’ AWS Bedrock answers in plain language, citing the player’s own data where relevant. 

Coach Dashboard 

Team coaches can view aggregated load data across all squad bowlers, receive AI-generated risk flags, and download session reports — giving coaching staff the oversight previously only available at elite level. 

Push Notification Reminders 

Amazon SNS delivers AI-triggered nudges and reminders: log your session, rest day recommended, review your weekly plan — keeping players engaged and safe between sessions. 

AWS Bedrock Integration Detail

AWS Bedrock is the intelligence layer of Yorker. Lambda functions invoke Bedrock’s foundation model API, passing structured player data and a custom-engineered prompt that contextualises the AI response to cricket-specific training science. Peritos Solutions developed a prompt library covering speed improvement, load management, injury prevention, and performance analytics — each prompt template retrieves the relevant player context from DynamoDB before calling Bedrock.

The Bedrock integration handles four primary AI use cases:

Implementation Approach

Peritos Solutions followed an iterative, sprint-based delivery model across 12 weeks:

Challenges & Solutions

Challenge: AI Prompt Accuracy 

Initial Bedrock responses lacked cricket-specific terminology and context. Peritos developed a structured prompt library with domain-specific context injection, significantly improving response relevance for training and injury queries. 

Challenge: Load Calculation Complexity 

Calculating acute-to-chronic workload ratios required handling irregular logging patterns and missing data. A robust Lambda function with rolling window calculations and data imputation logic was implemented. 

Challenge: Mobile Offline Support 

Players often practice in areas with limited connectivity. Local session caching was implemented in the mobile app, with DynamoDB sync triggered when connectivity is restored. 

Challenge: Bedrock Latency 

Initial AI response times were too slow for a smooth user experience. Lambda function optimisation, connection reuse, and streaming response patterns were implemented to reduce perceived latency. 

Challenge: Personalisation at Scale 

Each player requires different AI context. A DynamoDB-backed player profile system was built to inject individual history, bowling style, and goals into every Bedrock prompt — personalising responses at scale. 

Benefits to the Client

Support & Next Steps

Peritos Solutions provided two weeks of hypercare post-launch, monitoring Lambda function performance, Bedrock API response quality, and DynamoDB throughput. The Yorker app is now live on the Google Play Store (com.yorker) and is actively being used by cricket players and coaches across New Zealand.

Planned next phases include:

Looking to Build an AI-Powered Mobile App on AWS?

Peritos Solutions specialises in AWS cloud architecture, AI/ML integration with AWS Bedrock,

and mobile application development across New Zealand, Australia, USA, and India.

Get in touch: info@peritosolutions.com | +64-212579909 | www.peritossolutions.com

Executive Summary

About Client

The client, Yorker, is focused on leveraging technology to address the challenge of tracking and managing cricket bowlers’ net practice bowling loads. Recognizing the risk of overtraining and injuries from improper tracking, therefore, Yorker aims to provide a digital solution tailored for cricket players. In addition, An AWS Custom Application for Yorker empowers bowlers to automate session recordings, create personalized training plans, and monitor progress effectively. The app also fosters a sense of community by enabling interaction, knowledge sharing, and participation in skill-building challenges. The project is being executed in multiple phases, beginning with a Minimum Viable Product (MVP) to establish a strong foundation for future improvements. Yorker’s commitment to innovation and user-centric design reflects its dedication to transforming how athletes manage their training and optimize performance while minimizing injury risks.

Project Background – Enhancing Cricket Training through Digital Bowling Load Management

The Yorker mobile app project addresses a major challenge for cricket bowlers: accurately tracking and managing their bowling loads during net practice. Without proper tracking, bowlers risk improper training regimens, leading to overtraining and injuries. The Yorker app offers a digital solution that automates session recordings, capturing key metrics like delivery count, types of deliveries, and intensity levels. Additionally, the app allows bowlers to create personalized training plans, track progress, and receive real-time alerts to avoid overexertion. By leveraging technology, this initiative not only helps reduce injury risks but also fosters a sense of community. Bowlers can share experiences, learn from experts, and engage in skill-enhancing challenges. Ultimately, the app aims to optimize performance while ensuring bowlers train safely and efficiently, revolutionizing the way athletes manage their training.

Scope & Requirement for AWS Custom Application For Yorker

Scope: The first phase of the Yorker mobile application focuses on developing a Minimum Viable Product (MVP) to establish a strong foundation. Specifically, this phase will deliver core functionalities to allow cricket bowlers to start tracking their training sessions and managing their profiles. The scope includes:

Requirements:

Implementation

Technology and Architecture for AWS Custom Application For Yorker

Read more on the technology and Architecture we used for AWS Custom Application Development

Technology
Scalability
Integrations

The application leverages RESTful APIs for smooth data transfer between the front end and back end, facilitating user authentication, session tracking, and profile management. Future integrations may include cloud-based analytics and third-party push notifications to enhance user engagement.

Cost Optimization

Peritos helped optimize costs for Yorker by designing an efficient AWS architecture using auto-scaling, right-sized instances, and serverless technologies. With tools like AWS Cost Explorer and Trusted Advisor, we continuously monitored and reduced spending. Automation through CI/CD pipelines and code optimization further enhanced performance while lowering operational costs.

Backup and Recovery

A robust backup strategy, using Amazon S3, prevents data loss, while automated recovery processes ensure quick restoration in case of failure.

Features of AWS Custom Application For Yorker

Challenges with AWS Custom Application For Yorker

Support

As part of the project implementation we provide 2 months of Ongoing extended support. Additionally, this also includes 20 hrs a month of development for minor bug fixes and a SLA to cover any system outages or high priority issues.

Next Phase

We are now looking at the next phase of the project which involves: