Team Jedi Hackathon¶

Overview¶

This project allows developers to send a single channel sound file to a sound detection pipeline connected to business logic. The business logic parses the inference results as data that is exported to InfluxDB, a time-series database. This time-series data can then be viewed on a Grafana dashboard and customized to send email notifications.

Programming Languages: Python, Golang
Technologies used : Docker, Docker Compose, Make, Conda, BentoML, Telegraf, InfluxDB, Grafana
Intel OpenSource Technologies used: OpenVino, OpenVino Model Server (OVMS), EdgeX

Target System Requirements¶

Disk Space needed

Dependencies:¶

Docker v24.0.0
Docker Compose v2.17.3
Golang v1.20
Conda Environment with Python v3.8
Sample Audio File - must be single channel and .wav format

Microservice descriptions:¶

BentoML Sound Detection Pipeline¶

BentoML is a Unified Model Serving Framework which makes it easy to create ML-powered prediction services that are ready to deploy and scale. BentoML easily wraps the ML pipelines around web services. It is used by Data Scientists and ML Engineers to:

Accelerate and standardize the process of taking ML models to production
Build scalable and high performance prediction services
Continuously deploy, monitor, and operate prediction services in production

Learn more: BentoML

Open Source Repository: GitHub

Get Started on BentoML: Tutorial

Workflow¶

Sound Classification Pipeline python example from the OpenVino Model Zoo is adopted and modified to build the ML pipeline for this use case.
Bentos are created for this sound classification pipeline.
BentoML is now used to build & deploy these Bentos docker containers.
These Bentos can be tested using the Swagger APIs. (Explained in detail below)

This ML pipeline sends the following inference data to the business logic microservice -

{
    'timestamp': '2023-09-13 21:10:31.500582',
    'inputVideo': '${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav', 
    'inference': [
        {
        'videoTimestamp': '[0.00-1.00]', 
        'label': 'Gunshot', 
        'accuracy': '100.00%'
        }, 
        {
        'videoTimestamp': '[1.00-2.00]', 
        'label': 'Door knock', 
        'accuracy': '15.64%'
        }   
    ],
    'latency': 9.321213001385331
}

ML Pipeline Service Development & Deployment Workflow

More examples for developing with BentoML can be found here for the AiCSD project

Business Logic Application Service¶

The microservice for the business logic is developed using the the golang-based Edgex app services. Sound Classification microservice sends POST request with the inference data to the business logic microservice. Here the inference results are stored in the InfluxDB which are then queried to display the results on the Grafana dashboard.

How It Works¶

Architecture diagram of the hackathon project showing how each service communicates to move data from the sound detection pipeline to receiving an email notification.

Figure 1: Architecture Diagram

In this project, a single-channel, wav-format sound file is sent over REST from the Swagger UI to the BentoML Sound Detection Pipeline container. This container then calls OpenVino Model Server (OVMS) Docker container over gRPC to get the model status. The Sound Detection Pipeline container will then run the Python inferencing and send the inference results over REST to the business logic application service. The business logic container is then responsible for parsing the inference results and persisting it to the InfluxDB. Initially it was discussed to create a separate data layer using EdgeX for persisting the data to influxDB. However, later decided to make direct call to InfluxDB from business logic microservice to keep it simple as adding additional microservice to this solution was an overhead. Ideally architecture should have separated these out, but complexity of setting up the data layer via EdgeX made us pivot to make direct calls to database. InfluxDB is responsible for storing the time-series data that is then visualized in a Grafana dashboard. The Grafana dashboard is configured for viewing inference results and sending email notifications.

Get Started¶

Provide step-by-step instructions for getting started.

Install the listed dependencies.
Configure DOCKER_INFLUXDB_INIT_PASSWORD and DOCKER_INFLUXDB_INIT_ADMIN_TOKEN in the .env file for InfluxDB.

Build the business logic container.

cd app-sample-service
make docker
cd ..

Create a Conda working environment, configure it, and activate.

conda create -n hackathon_env python=3.8
conda activate hackathon_env
cd pipelines/sound_classification_demo
pip install -r requirements.txt
cd ../..

Note

Use conda list to verify all packages in requirements.txt are installed.

Build the BentoML sound detection pipeline service
```
cd pipelines/sound_classification_demo
make build
```
Note

To build and run the docker image, the build tag from the make build command is required.

The following is the expected output from the build command:

Figure 2: BentoML Build Output
[Optional] Build the BentoML Docker Image from the pipelines/sound_classification_demo directory.
```
make docker-build BENTO_TAG=<bento_image_name>:<bento_image_tag>
```
Note

In the example above, the BENTO_TAG would be BENTO_TAG=sound_classification:r5lystssnwbweb5w.

Run the Application¶

Configure the type of sound you want to get notified with. Update the docker-compose-apps.yml line 18, and add sounds based on the file in here models/aclnet-int8/aclnet_53cl.txt Each item must be in a comma separated:
```
SOUNDS: "Gunshot,Door knock"
```
Run the stack of services from the project root directory.
```
make run
```
[Optional] Run Portainer for container management.
```
make run-portainer
```
Start the BentoML service from the pipelines/sound_classification_demo directory.

Run Method Run Command

Locally make serve

Docker make docker-run BENTO_TAG=<bento_image_name>:<bento_image_tag> PROJECT_REPO_PATH=<project_repo_path>

Note

In the example from setup, the BENTO_TAG would be BENTO_TAG=sound_classification:r5lystssnwbweb5w. The PROJECT_REPO_PATH is the full path to the team-jedi-hackathon-dev project.
Open the Swagger API UI.
Test a POST request to the /classify API by providing the input text as
```
{
  "MediaPath": "${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav",
  "ModelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/1/aclnet_des_53.xml",
  "LabelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/aclnet_53cl.txt", 
  "GatewayIP":"XXX.XXX.X.X", 
  "Port":"9001"
}
```
Success

If the pipeline runs successfully, the Response Code will be 200 and the Response Body will look like Success, inference_results: {'timestamp': '2023-09-13 21:10:31.500582', 'inputVideo': '${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav', 'inference': [{'videoTimestamp': '[0.00-1.00]', 'label': 'Gunshot', 'accuracy': '100.00%'}, {'videoTimestamp': '[1.00-2.00]', 'label': 'Door knock', 'accuracy': '15.64%'}], 'latency': 9.321213001385331}
To further verify the pipeline ran successfully, check the logs of the BentoML Pipeline container. The example below shows the logs from [Portainer][def] Figure 3: Portainer container log screenshot
Open influxDB to visualize data as well. Use admin as username and the password set inside the .env file. Go to the dashboard section: Import the file telegraf/template.json Click on Sound Dashboard Once in the dashboard, increase the refresh rate to 10s:
Open the Grafana Dashboard to see the visualization of the inference results.
To setup email notification in Grafana -

a. open grafana dashboard - http://0.0.0.0:3001/grafana (log in as admin, password is admin)

b. Go to Home/Alerting/Contact points

c. Edit the default contact point name, edit option will appear only if logged in as admin

d. Under addresses text box provide - alertgrafanaemail@gmail.com, gmail created for demo which is set in grafana.ini

e. Click Test

f. While clicking send test notification, following success message should be displayed - Test Alert Sent (need to be outside vpn)

g. Now save the contact point.

API Documentation¶

Sound Classification Pipeline - Python Microservice POST http://0.0.0.0:3000/classify
```
{
  "MediaPath": "${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav",
  "ModelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/1/aclnet_des_53.xml",
  "LabelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/aclnet_53cl.txt", 
  "GatewayIP":"XXX.XXX.X.X", 
  "Port":"9001"
}
```
Success

If the pipeline runs successfully, the Response Code will be 200 and the Response Body will look like Success, inference_results: {'timestamp': '2023-09-13 21:10:31.500582', 'inputVideo': '${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav', 'inference': [{'videoTimestamp': '[0.00-1.00]', 'label': 'Gunshot', 'accuracy': '100.00%'}, {'videoTimestamp': '[1.00-2.00]', 'label': 'Door knock', 'accuracy': '15.64%'}], 'latency': 9.321213001385331}

Business Logic - Golang Microservice

  POST http://127.0.0.1:59741/api/v1/data
  Payload:  {"inference": [{"videoTimestamp": "[0.00-1.00]", "label": "Gunshot", "accuracy": "100.00%"}, {"videoTimestamp": "[1.00-2.00]", "label": "something", "accuracy": "15.64%"}], "latency": 9.321213001385331}

If request is successfull, then the Response Code will be 200

Testing¶

Unit test for golang microservices using go test.

    cd ${HOME}/team-jedi-hackathon-dev/app-sample-service
    go test ./...

Python microservices can be unit tested using pytest.(TODO)

Golang microservices can be integration tested using Postman by send a POST request

    POST http://127.0.0.1:59741/api/v1/data
    Payload:  {"inference": [{"videoTimestamp": "[0.00-1.00]", "label": "Gunshot", "accuracy": "100.00%"}, {"videoTimestamp": "[1.00-2.00]", "label": "something", "accuracy": "15.64%"}], "latency": 9.321213001385331}

Python AI/ML Pipelines can be integration tested using Swagger UI(steps explained above).

Postman can also be used by sending a POST request

    POST http://0.0.0.0:3000/classify
    Payload:  {"MediaPath": "${HOME}/team-jedi-hackathon-dev/media/ak47s_gun_sound_mono.wav", "ModelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/1/aclnet_des_53.xml", "LabelPath": "${HOME}/team-jedi-hackathon-dev/models/aclnet/aclnet_53cl.txt", "GatewayIP":"XXX.XXX.X.X", "Port":"9001"}

Check influxDB to test and visualize the data(steps explained above).

Summary and Next Steps¶

Final Solution¶

This solutions aims to build an infrastructure for remote sensing, making it easy to integrate different AI/ML pipelines with minimal changes. Current solution supports a sound classification pipeline which raises an email alert notification when any sound perceived as a threat (e.g., gunshots, chainsaw etc.) is detected.

Solution Features¶

Fully distributed solution achieved through microservices and containerization
Faster development of application services using open-source framework EdgeX
Seamless interaction between solution components written in different languages – Golang based Business microservices and Python based AI/ML pipelines
Easily integrate a new AI/ML pipeline (e.g. Intel OpenVINO Model Zoo examples) as a microservice using open-source tool – BentoML
Dashboard & alert notifications added using open-source TIG stack – Telegraf, InfluxDB, Grafana
Sound Classification pipeline executed using Intel OpenVINO and Intel OpenVINO Model Server (OVMS) serving model status.
Security enabled via open-source framework EdgeX

Next Steps¶

Following couldn't be finished due to lack of time -

Pytest unit tests for Python microservices.
Grafana email notification feature is integrated, however couldn't finish creating the Grafana rules to send email notification.
Couldn't test the microservices with validation service.
Infer sound using Intel OVMS instead of Intel OpenVINO, contacted OVMS team for help.

Troubleshooting¶

While building the Sound Classification pipeline microservices using BentoMl, several package dependecy issues can be resolved by using conda environment. Make sure all packages are installed before building & running them.
Sound Classification pipeline from OpenVINO model zoo didn't work initially. Raised issue using github to get support on the same from the OpenVINO team.
Access Grafana Dashboard using this link - http://0.0.0.0:3001/grafana. Accessing the link directly from portainer (http://0.0.0.0:3001) results in error as it doesn't append grafana to the link.
For setting up the Grafana email notifications, user need to log in as admin.

Run Method	Run Command
Locally	`make serve`
Docker	`make docker-run BENTO_TAG=<bento_image_name>:<bento_image_tag> PROJECT_REPO_PATH=<project_repo_path>`