• Send Us A Tip
  • Calling all Tech Writers
  • Advertise
Monday, July 7, 2025
  • Login
TechStory
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
TechStory
No Result
View All Result
Home Business

“Every Frame Is a Decision”: Kirill Starkov on the Realities of Real-Time Object Tracking

by Rohan Mathawan
November 11, 2022
in Business
Reading Time: 4 mins read
0
Photo by Bernd 📷 Dittrich on Unsplash

Photo by Bernd 📷 Dittrich on Unsplash

TwitterWhatsappLinkedin

When we talk about artificial intelligence in computer vision, our minds often jump to familiar concepts — face detection, self-driving cars, surveillance cameras. But behind these technologies lies a deceptively complex task: understanding who or what is moving, and where, in real time. This is the domain of object detection and tracking, a space where milliseconds matter and mistakes have consequences.

You might also like

Elon Musk Launches “America Party” to Challenge U.S. Two-Party Dominance

Aditya Mangla Appointed CEO of Zomato’s Food Ordering and Delivery Business

Nehal Modi Arrested in US for Loan and Pawn Diamond Fraud, Faces Extradition to India

Kirill Starkov knows this field intimately. With a background in machine learning and a particular focus on computer vision, Kirill has spent years working on production-level systems involving object detection and tracking. His experience spans from video analytics for surveillance to automated licence plate recognition systems and pandemic-response tools built under tight deadlines.

“This field looks clean in theory,” Kirill begins, “but in practice, it’s chaotic. You’re working with noise, low-quality streams, overlapping objects, changing light conditions, and unpredictable behaviour. Every frame is a decision — and every mistake is cumulative.”

Understanding the Technical Stack

At its core, real-time object detection and tracking relies on a combination of well-known models and algorithms. “In most practical pipelines, you’ll see something like YOLO for detection, paired with DeepSORT or a custom tracker for the temporal association,” Kirill explains. “YOLOv4 and YOLOv5 have become very popular in recent years because of their speed-to-accuracy ratio. But the tracker is where the real finesse happens.”

He describes tracking as the ‘glue’ that holds the detection together. “Detection tells you what’s in the current frame. Tracking tells you what’s been there — and whether it’s still the same object. Without a reliable tracker, you’re just guessing between frames.”

The DeepSORT algorithm, a refinement over the simpler SORT (Simple Online Realtime Tracking), uses both motion (via Kalman filters) and appearance descriptors to associate detections across frames. Kirill has frequently worked with both. “SORT is faster, and if your objects are distinct and well-separated, it’s enough. But in crowded scenes, or when objects look similar, you need DeepSORT or even your own enhancements on top.”

Real World ≠ Benchmark

Kirill is quick to point out the disparity between academic papers and production deployments. “In benchmarks, you’re running videos from clear, daylight street scenes. In real life, you might be dealing with a night-time car park where half the footage is glare and shadows.”

He recalls one long-term project involving real-time vehicle tracking. “We were building a system to track cars and read their number plates under varying conditions — rain, snow, fog. The detection part was relatively stable, but tracking became unpredictable during certain hours of the day due to lighting changes.”

To solve it, Kirill’s team introduced custom appearance models trained on the specific environment, and even fine-tuned the motion model to better reflect the geometry of the location. “It’s not something you can get from plug-and-play packages. You need to know your data intimately and adapt your tools.”

The project involved a hybrid recognition model for licence plates, which combined transformer layers with an RNN decoder, an unusual architecture at the time. But Kirill highlights that no matter how novel the model, if tracking failed, the whole system lost value. “If your system says this is a new car every five seconds, you can’t build trust or automation on that.”

Challenges and Edge Cases

“Occlusion is the classic problem,” Kirill says. “You’ve got someone walking behind a pillar. Or two cars crossing paths. Your model needs to predict, interpolate, and recover. But even more difficult are long-term consistency and ID switches.”

In real-time systems, identity switches, when a tracker accidentally assigns the wrong ID to an object,  are especially harmful. “If a system is meant to monitor the flow of people through a building, or detect abnormal behaviour, an ID switch creates false narratives. It looks like a person duplicated, disappeared, or teleported.”

To reduce these issues, Kirill’s teams often integrate additional context. “If we know the physical layout of a space, we can model likely paths. If we have other sensors, we can cross-reference. Multimodal systems are one of the ways forward.”

He also highlights a common misconception: that increasing detection frequency always improves tracking. “Sometimes, more frames lead to more noise. If your detection isn’t stable, the tracker can’t compensate. It’s about precision, not just volume.”

Applications and Impact

Kirill’s work has had wide-ranging applications, from smart city initiatives to retail security. During the early stages of the COVID-19 pandemic, his team adapted part of their detection-tracking pipeline for a new purpose: face mask detection in public spaces.

“It was a two-day sprint,” he says. “We built a prototype that could detect whether someone was wearing a mask, and tracked them across a space to ensure compliance over time. The system was deployed in several retail chains. It wasn’t perfect, but it served a public health function at a critical time.”

That project later became a cornerstone of a client’s crisis-response suite, and contributed to client retention during the most uncertain period of their business. “It was a reminder that these systems aren’t just technical. They serve people. They inform decisions.”

Advice for Engineers Entering the Field

When asked what advice he would give to engineers starting in real-time computer vision, Kirill emphasises mindset over tools. “Don’t get too attached to the algorithms. Learn the principles. Learn what makes a system robust. Real-time tracking is about trade-offs:  between accuracy, speed, and complexity. You won’t get all three.”

He encourages young developers to start with existing open-source pipelines like the YOLO + DeepSORT combo, but not to stop there. “Run your models on dirty data. Film your own video in a train station or a parking lot. Break the system. That’s how you’ll learn what matters.”

Looking to the future, Kirill sees growing relevance in multimodal tracking, federated systems, and self-adapting trackers that learn on the edge. “We’re past the phase of isolated pipelines. Integration is the future — between sensors, between modalities, and between the model and its environment.”

Tweet63SendShare18
Previous Post

“I’m leaving my laptop at a random Starbucks for one hour,” Let’s see what happens!

Next Post

Coinbase cuts 60 jobs in latest layoffs

Rohan Mathawan

Content Editor at Techstory Media | Technology | Gadgets | Written more than 5000+ articles about different niches from Tech to online real money gaming for reputed brands and companies. Get in touch Email: [email protected] For Business Enquires related to TechStory [email protected]

Recommended For You

Elon Musk Launches “America Party” to Challenge U.S. Two-Party Dominance

by Harikrishnan A
July 7, 2025
0
Elon Musk Threatens “Thermonuclear Lawsuit” Against Media Watchdog Amid Ad Exodus on X

Elon Musk has officially launched a new political party called the “America Party”, marking a significant departure from his previous political alliances and a bold challenge to the...

Read more

Aditya Mangla Appointed CEO of Zomato’s Food Ordering and Delivery Business

by Rounak Majumdar
July 6, 2025
0
Aditya Mangla Appointed CEO of Zomato’s Food Ordering and Delivery Business

Eternal, formerly known as Zomato, has announced the appointment of Aditya Mangla as the new Chief Executive Officer of its food ordering and delivery business. The board approved...

Read more

Nehal Modi Arrested in US for Loan and Pawn Diamond Fraud, Faces Extradition to India

by Rounak Majumdar
July 6, 2025
0
Nehal Modi Arrested in US for Loan and Pawn Diamond Fraud, Faces Extradition to India

Nehal Modi, the younger brother of fugitive diamond merchant Nirav Modi, has once again found himself in the international spotlight after being arrested in the United States. The...

Read more
Next Post
Coinbase chances of featuring in Fortune 500 in 2023

Coinbase cuts 60 jobs in latest layoffs

Please login to join discussion

Techstory

Tech and Business News from around the world. Follow along for latest in the world of Tech, AI, Crypto, EVs, Business Personalities and more.
reach us at [email protected]

Advertise With Us

Reach out at - [email protected]

BROWSE BY TAG

#Crypto #howto 2024 acquisition AI amazon Apple bitcoin Business China cryptocurrency e-commerce electric vehicles Elon Musk Ethereum facebook flipkart funding Gaming Google India Instagram Investment ios iPhone IPO Market Markets Meta Microsoft News NFT samsung Social Media SpaceX startup startups tech technology Tesla TikTok trend trending twitter US

© 2024 Techstory.in

No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to

© 2024 Techstory.in

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?