Pingchuan Ma

Mr. Pingchuan Ma

Position:

Research Assistant / PhD Student

Email:

Personal website:

https://mpc001.github.io/

Biography

Pingchuan MA received his BSc degree from Beihang University in 2015 and his MSc degree in Machine Learning from Imperial College London in 2017. Currently, he is a PhD student at the iBUG group under the supervision of Prof. Maja Pantic and Dr. Stavros Petridis.

Publications

Journal articles

End-to-end visual speech recognition for small-scale datasets

S. Petridis, Y. Wang, P. Ma, Z. Li, M. Pantic. Pattern Recognition Letters. 131: pp. 421 - 427, 2020.

Bibtex reference [hide]
@article{petridis2020end,
    author = {S. Petridis and Y. Wang and P. Ma and Z. Li and M. Pantic},
    pages = {421--427},
    journal = {Pattern Recognition Letters},
    publisher = {Elsevier},
    title = {End-to-end visual speech recognition for small-scale datasets},
    volume = {131},
    year = {2020},
}
Endnote reference [hide]

Conference articles

Lip-reading with Densely Connected Temporal Convolutional Networks

P. Ma, Y. Wang, J. Shen, S. Petridis, M. Pantic. The IEEE Winter Conference on Applications of Computer Vision (WACV). 2021.

Bibtex reference [hide]
@inproceedings{ma2020lip,
    author = {P. Ma and Y. Wang and J. Shen and S. Petridis and M. Pantic},
    booktitle = {The IEEE Winter Conference on Applications of Computer Vision (WACV)},
    title = {Lip-reading with Densely Connected Temporal Convolutional Networks},
    year = {2021},
}
Endnote reference [hide]

Video-Driven Speech Reconstruction using Generative Adversarial Networks

K. Vougioukas, P. Ma, S. Petridis, M. Pantic. Interspeech. September 2019.

Bibtex reference [hide]
@inproceedings{interspeech_videoDrivenSpeechRecWithGANs,
    author = {K. Vougioukas and P. Ma and S. Petridis and M. Pantic},
    booktitle = {Interspeech},
    month = {September},
    title = {Video-Driven Speech Reconstruction using Generative Adversarial Networks},
    year = {2019},
}
Endnote reference [hide]

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

P. Ma, S. Petridis, M. Pantic. Interspeech. September 2019.

Bibtex reference [hide]
@inproceedings{AV_lombard,
    author = {P. Ma and S. Petridis and M. Pantic},
    booktitle = {Interspeech},
    month = {September},
    title = {Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition},
    year = {2019},
}
Endnote reference [hide]

Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture

S. Petridis, T. Stafylakis, P. Ma, G. Tzimiropoulos, M. Pantic. IEEE SLT. December 2018.

Bibtex reference [hide]
@inproceedings{AV_speech_hybrid_CTC_attention,
    author = {S. Petridis and T. Stafylakis and P. Ma and G. Tzimiropoulos and M. Pantic},
    booktitle = {IEEE SLT},
    month = {December},
    title = {Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture},
    year = {2018},
}
Endnote reference [hide]

End-to-End Audiovisual Speech Recognition

S. Petridis, T. Stafylakis, P. Ma, F. Cai, G. Tzimiropoulos, M. Pantic. Accepted to ICASSP. Calgary, Canada, April 2018.

Bibtex reference [hide]
@inproceedings{end2endAVspeech,
    author = {S. Petridis and T. Stafylakis and P. Ma and F. Cai and G. Tzimiropoulos and M. Pantic},
    address = {Calgary, Canada},
    booktitle = {Accepted to ICASSP},
    journal = {Accepted to ICASSP},
    month = {April},
    title = {End-to-End Audiovisual Speech Recognition},
    year = {2018},
}
Endnote reference [hide]