Audio Detection Accuracy for Smart City Listening

Imagine a world where machines don’t just hear, but listen intently—distinguishing a door creak from a dog bark, or a car alarm from the distant rumble of thunder. That world just edged closer to reality, thanks to the newly unveiled EFAM model, which pushes the boundaries of sound event detection by significantly boosting accuracy and clarity in noisy environments.

Developed through a collaboration between China Jiliang University, Hangzhou Hikvision Digital Technology Co., Ltd, Hangzhou Aihua Intelligent Technology Co., Ltd, and Beihang University, EFAM (Enhanced Feature Attention Model) marks a promising stride toward reliable, real-time acoustic event recognition in smart homes, cities, and industrial ecosystems.

Why Better Sound Detection Matters

Let’s face it—modern life is noisy. Sirens wail, dogs bark, engines rumble, and conversations overlap. For humans, it’s often easy to tune out the irrelevant. Machines, however, struggle with this chaos. That’s where high-precision sound detection enters the frame.

Reliable audio recognition is vital for a spectrum of real-world applications:

Smart home assistants that can detect a smoke alarm or a baby crying
City surveillance systems that monitor traffic incidents or gunshots
Environmental monitoring to track wildlife or measure noise pollution
Industrial safety alarms and predictive maintenance tools

“We envision a future where smart assistants, environmental monitors, and emergency systems all benefit from these improvements” says Prof. Dongping Zhang, a lead researcher on the project. “Better sound detection means faster response times and stronger safety nets.”

With the EFAM model on the scene, the accuracy of recognising overlapping or obscured sound events has taken a notable step forward.

A 12.7% Accuracy Jump

Performance is where EFAM really shines. The system was benchmarked using the DESED dataset, a standard in the field of sound event detection, and the gains were unmissable:

PSDS1 Score: Jumped to 0.489, up by 12.7% from the baseline
PSDS2 Score: Climbed to 0.771, reflecting enhanced detection and classification
Class-balanced F1 Score: Reached 0.567, indicating sharper frame-level recognition

These aren’t minor tweaks. They represent a leap forward in how machines interpret complex acoustic scenes.

Dissecting EFAM

The EFAM model is built on a Mean-Teacher semi-supervised learning framework. This hybrid approach makes clever use of both labelled and unlabelled data—a crucial advantage in a field where sourcing annotated audio clips can be time-consuming and expensive.

EFAM’s innovation lies in its layered architecture:

Bi-Path Fusion Convolution
- Captures both low-level details (like pitch) and high-level features (like audio texture) via dual convolution paths.
Channel Attention Mechanism
- Highlights the audio frequencies that matter, while tuning out irrelevant noise. It works like a fine-tuned filter, giving priority to critical sound patterns.
Dual-head Self-Attention Pooling
- Refines the focus further by aggregating and distilling predictions across frames, giving the model a clearer sense of the event timeline.

On top of this, the use of pretrained BEATs (Bidirectional Encoder representations from Audio Transformers) embeddings injects the model with robust prior knowledge, based on millions of hours of previously analysed audio.

Opening Doors to Next-Gen Audio Applications

These technical strides translate into meaningful real-world improvements. From smarter homes to resilient cities and safer factories, the implications are vast.

Consumer Tech: Imagine a voice assistant that not only listens to your commands but understands when your kettle boils or your cat meows at the door.
Public Safety: Surveillance systems can now better detect glass breaking, arguments escalating, or emergencies brewing in the background.
Industrial IoT: Machines on factory floors can detect early anomalies in sound signatures, alerting operators before failures occur.

These aren’t just incremental upgrades—they’re transformative shifts in how sound data can be leveraged.

Peer-Reviewed and Internationally Recognised

Published in Frontiers of Computer Science in April 2025, the research enjoys validation by one of the most reputable journals in the field. The research also highlights China’s growing dominance in AI-powered smart sensing technologies, supported by collaborations among academic and industrial heavyweights. From Hangzhou’s AI hotspots to Beijing’s university labs, the synergy is clear.

What This Means for Policymakers and Industry Leaders

The promise of EFAM extends far beyond the lab. For those shaping regulations, urban planning, or designing the next wave of consumer electronics, this technology offers a blueprint for scalable, reliable sound intelligence.

Policy: Governments can set more precise safety standards based on verifiable audio data.
Infrastructure: Cities can invest in smarter street monitoring and incident detection.
Product Design: Manufacturers can build more intuitive, responsive, and safe consumer electronics.

As urban environments grow more complex and interconnected, the need for sound-aware systems becomes not just a luxury but a necessity.

Listening to the Future

EFAM might just be a stepping stone, but it’s a critical one. By significantly enhancing how machines perceive and interpret sound, it opens up a host of possibilities that reach into nearly every corner of modern life.

The leap in detection accuracy, underpinned by semi-supervised learning and smart attention mechanisms, positions EFAM as more than just a research prototype. It’s a foundational technology poised to power the next generation of responsive, audio-aware systems.

In a world that never stops making noise, EFAM teaches machines to listen more wisely—and that could make all the difference.

Region	USD/tonne	Trend
Middle East · FOB (drums)	378	▼
Europe · Delivered	565	▼
Asia · Delivered (bulk)	495	▼
USA · Bulk	618	▲

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Stevens Expands Fleet Capabilities With New Caterpillar D7 Dozers

Major Screen Media Innovations Taking Centre Stage At CONEXPO CON AGG

Harmony Building a Strategic Copper Future with Metso in Queensland

CASE Construction Equipment and HRN Strengthen Scottish Machinery Market

TPH Plant Hire Expands With New Hyundai Excavators

Liebherr Machines Prove Their Strength Across Fifty Five Years

Building Future Innovators in the UK Through The Big Bang Competition

Herrenknecht Showcases Cutting Edge TBM Manufacturing in China

Egis Celebrates Global ISEP Recognition

Digital Innovation Thrives At Digital Construction North

Sumitomo’s Next Generation Excavators Heading for the UK

SolarLite Pathway Studs Transforming Road Safety in Milton Keynes

Volvo Trucks Expands its Line-up with the Volvo FL Electric

Road Surface Maintenance 101 – What Causes Deterioration and How to Fix It

JCB Building New Horizons For Hispanic Construction Entrepreneurs in the US

The High-Tech Construction Safety Revolution

DuPont Advancing Comfort and Safety with New Tyvek Protective Clothing

Quantum Communication Breakthroughs Strengthen NZ Korea Innovation Ties

Skillington Quarry Thrives With Develon Heavy Equipment

Constructing Norway’s Ocean Technology Hub With Liebherr Cranes

Audio Detection Accuracy for Smart City Listening