Research outputs 2022 to 2026

PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition

Abstract

We present PyMAiVAR, a versatile toolbox that encompasses the generation of image representations for audio data including Wave plots, Spectral Centroids, Spectral Roll Offs, Mel Frequency Cepstral Coefficients (MFCC), MFCC Feature Scaling, and Chromagrams. This wide-ranging toolkit generates rich audio-image representations, playing a pivotal role in reshaping human action recognition. By fully exploiting audio data's latent potential, PyMAiVAR stands as a significant advancement in the field. The package is implemented in Python and can be used across different operating systems.

Keywords

[RSTDPub], Computer vision, Human action recognition, Image representations, Multimodal

Document Type

Journal Article

Date of Publication

9-1-2023

Volume

Publication Title

Software Impacts

Publisher

Elsevier

School

School of Science / School of Engineering

RAS ID

58302

Funding Information

Edith Cowan University / Australia and Higher Education Comission (HEC) of Pakistan / Australian Government

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Related Publications

Shaikh, M. (2023). Multimodal human action recognition using deep learning. Edith Cowan University. Retrieved from https://ro.ecu.edu.au/theses/2749

Comments

Shaikh, M. B., Chai, D., Islam, S. M. S., & Akhtar, N. (2023). PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition. Software Impacts, 17, article 100544. https://doi.org/10.1016/j.simpa.2023.100544

Download

Included in

Computer Sciences Commons, Electrical and Computer Engineering Commons

COinS

Link to publisher version (DOI)

10.1016/j.simpa.2023.100544

Research outputs 2022 to 2026

PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition

Abstract

Keywords

Document Type

Date of Publication

Volume

Publication Title

Publisher

School

RAS ID

Funding Information

Creative Commons License

Related Publications

Comments

Included in

Link to publisher version (DOI)

Search

Links

Browse

Author Information

Article Locations

Research outputs 2022 to 2026

PyMAiVAR: An open-source Python suit for audio-image representation in human action recognition

Authors/Creators

Abstract

Keywords

Document Type

Date of Publication

Volume

Publication Title

Publisher

School

RAS ID

Funding Information

Creative Commons License

Related Publications

Comments

Included in

Share

Link to publisher version (DOI)

Search

Links

Browse

Author Information

Article Locations