Digital Media Technology Lab
The Digital Media Technology (DMT) Lab specialises in the development of methods for creating, processing, analysing, evaluating and distributing digital media. The Lab has four core themes and its academics, research fellows and PhD students bring together core expertise in digital signal processing and machine learning to collaborate on projects.
- Mixed Reality and HCI
- Digital Image and Video Processing
- Digital Audio Processing
- Digital Media Distribution
Mixed Reality and Human Computer Interaction
The Mixed Reality and Human Computer Interaction (HCI) Group researches a range of new and exciting applications in HCI, Mixed Reality (MR), Augmented Reality (AR) and Virtual Reality(VR). The group specialises in understanding and defining new methods and modes of interacting with computer systems via new sensor systems and current AR and VR hardware. The group also researches into the methods for improving the application and adoption of AR and VR systems into production workflows and holds a core specialism in understanding the usability and plausibility of the end systems. The group consists of a core team of academics who are part of national and international committees (ISMAR) and interest groups (ImmerseUK) in AR, VR and work with some of the world's leading international companies in developing novel AR and VR solutions. The group further works in conjunction with leading UK charity organisations in researching novel interfaces and hardware solutions for users with physical impairments.
Areas of Activity
- Natural interaction in Augmented and Virtual Reality
- User interaction analysis
- Eye Gaze Interaction for Disabled Users
- Freehand Grasping for Mixed Reality
- Freehand Grasping for Virtual Reality
- Mixed Reality for Remote Medical Demonstrations
- Onset pre-visualisation in film production
- Motion capture analysis (user tracking)
- Muadh Al-Kalbani - Thesis titled: Achieving Natural Freehand Interaction in Mixed Reality
- Andreea Blaga - Thesis titled: A new grasping model for Mixed and Virtual Reality
Research Projects in this Group
List of Publications
- M Al-Kalbani, M Frutos-Pascual, I Williams. "Freehand grasping in mixed reality: analysing variation during transition phase of interaction" Proceedings of the 19th ACM International Conference on Multimodal Interfaces, 2017
- M Randall, I Williams, C Athwal. "A Predictive Approach to On-line Time Warping of Motion" IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2017
- AD Blaga, M Frutos-Pascual, M Al-Kalbani, I Williams. "Usability Analysis of an Off-the-Shelf Hand Posture Estimation Sensor for Freehand Physical Interaction in Egocentric Mixed Reality" IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2017
- Debattista K., Bashford-Rogers T., Harvey C., Waterfield B., Chalmers A.: Subjective Evaluation of High-Fidelity Virtual Environments for Driving Simulations. IEEE Transactions on Human-Machine Systems. Oct 17.
- Harvey C., Bashford-Rogers T., Debattista K., Doukakis E., Chalmers A.: Olfaction and Selective Rendering. Computer Graphics Forum. 09/2017.
- Doukakis E., Debattista K., Harvey C., Bashford-Rogers T., Chalmers A.: Audio-Visual Resource Allocation for Bi-Modal Virtual Environments. Computer Graphics Forum. 06/2017.
- Harvey C., Debattista K., Bashford-Rogers T., Chalmers A.: Multi-Modal Perception for Selective Rendering. Computer Graphics Forum. 01/2016.
- Minors A., Harvey C.: Influence of active listening on eye movements while viewing images of concert halls. Psychomusicology: Music, Mind, and Brain 01/2015; 25(3):345-354.
- Dong Y., Webb M., Harvey C., Debattista K. and Chalmers, A.: Multisensory Virtual Experience of Tanning in Medieval Coventry. In: EUROGRAPHICS Workshop on Graphics and Cultural Heritage. 27-29 September 2017. Graz, Austria.
- Barnett L., Gatzidis C., Harvey C.: An Investigation into Usability and First Time User Experiences within a Mobile Gaming Context. Edutainment 06/2017. (Best Short Paper Award).
- Bradley T., Debattista K., Bashford-Rogers T., Harvey C., Chalmers A.: Selective BRDFs for High Fidelity Rendering. Computer Graphics & Visual Computing (CGVC) Oct 2016.
- Dhokia A., Debattista K., Harvey C., Bashford-Rogers T., Chalmers A.: A Calibrated Olfactory Display for High Fidelity Virtual Environments. Computer Graphics & Visual Computing (CGVC) Oct 2016.
- Creed, C. (2016) Assistive tools for disability arts: collaborative experiences in working with disabled artists and stakeholders, Journal of Assistive Technologies, 10, 2, p. 121-129.
- Creed, C. (2016) Eye Gaze Interaction for Supporting Creative Work with Disabled Artists. Proceedings of 30th British HCI Group Annual Conference (HCI2016), Bournemouth, UK.
- Al-Kalbani, M. Williams I and Frutos-Pascual, M. “Analysis of Medium Wrap Freehand Virtual Object Grasping in Exocentric Mixed Reality”, Mixed and Augmented Reality (ISMAR), 2016 IEEE International Symposium on, Merida, 2016.
- Creed, C., Beale, R., Cowan, B. (2015) The Impact of an Embodied Agent's Emotional Expressions Over Multiple Interactions, Interacting with Computers, 27, 2, p. 172-188.
- Al-Kalbani, M. Williams, I. “Accuracy assessment of freehand grasping interaction for mixed reality”, Eurographics Workshop on Visual Computing for Biology and Medicine, 2015
- Hough, G.; Williams, I.; Athwal, C., "Fidelity and Plausibility of Bimanual Interaction in Mixed Reality," IEEE Transactions on Visualization and Computer Graphics, ,vol.12, no.99, pp. 1377-1389, 2015.
- G.Hough, I. Williams, C. Athwal. “Measurements of Live Actor Motion in Mixed Reality Interaction”. IEEE International Symposium on Mixed and Augmented Reality. Munich, Germany. 2014.
- G.Hough, I. Williams, C. Athwal. “Measurement of Perceptual Tolerance for Inconsistencies within Mixed Reality Scenes”. IEEE International Symposium on Mixed and Augmented Reality. Munich, Germany. 2014.
- Hough G, Athwal C and Williams I. Advanced Occlusion Handling for Virtual Studios. Lecture Notes in Computer Science, Springer 2012.
- Hough G, Athwal C and Williams I. 'ScaMP: A Head Guided Projection System'. ACM Designing Interactive Systems'12. Newcastle, UK. 2012
Digital Image and Video Processing
The Digital Image and Video Processing group researches a range of new and exciting applications for analysing and processing still image and video data. The research within the group is directly linked to a wide variety of application domains, these include medical image analysis systems, security analysis systems, video post production systems and and image distribution systems. The group has a strong core specialism in the application of low-level image feature extraction, image object classification, image compositing and real-time video processing systems and has strong ties with the Mixed reality and HCI group.
Areas of Activity
- Illumination and colour coherence in video compositing
- Organ motion analysis in MRI data
- 3D surface segmentation for CT and MRI data
- Improved brain tumour contouring using adaptive contour models
- Objective performance evaluation of edge detection methods
- Sam Smith - Thesis titled: Improved surface detection in 3D data
- Richard Finlan - Thesis titled: Segmentation of paediatric brain tumours in MRI data
List of Publications
- A Dolhasz, M Frutos-Pascual, I Williams. "Composite Realism: Effects of Object Knowledge and Mismatched Feature Type on Observer Gaze and Subjective Quality" IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2017
- Unsupervised morphological segmentation of tissue compartments in histopathological images. Fouad, S., Randell, D., Galton, S., Mehanna, H., Landini, G. 2017 In: PloS one 12 (11), e0188717
- Mereotopological Correction of Segmentation Errors in Histological Imaging. Randell, D., Galton, A., Fouad, S., Mehanna, H., Landini, G. 2017 In: Journal of Imaging 3 (4), 63
- Epithelium and Stroma Identification in Histopathological Images Using Unsupervised and Semi-Supervised Superpixel-Based Segmentation. Fouad, S., Randell, D., Galton, S., Mehanna, H., Landini, G. 2017 In: Journal of Imaging 3 (4), 61
- Unsupervised Superpixel-based Segmentation of Histopathological Images with Consensus Clustering. Fouad, S., Randell, D., Galton, A., Mehanna, H. & Landini, G. 19 Apr 2017. Medical Image Understanding and Analysis (MIUA) 2017. Communications in Computer and Information Science (CCIS) with Springer, p. 767- 779, 2017.
- Model-based Correction of Segmentation Errors in Digitised Histological Images. Randell, D., Galton, A., Fouad, S., Mehanna, H. & Landini, G. 19 Apr 2017. Medical Image Understanding and Analysis (MIUA) 2017. Communications in Computer and Information Science (CCIS) with Springer, p. 719-730, 2017.
- Morphological separation of clustered nuclei in histological images. Fouad, S., Landini, G., Randell, D. & Galton, A. 1 Jul 2016 13th International Conference on Image Analysis and Recognition, ICIAR 2016, Póvoa de Varzim, Portugal, July 13-15, 2016. Proceedings. Springer International Publishing, Vol. 9730, p. 599-60
- Ontological Levels in Histological Imaging. Galton, A., Landini, G., Randell, D. & Fouad, S. 6 Jul 2016 Formal Ontology in Information Systems. IOS Press, Vol. 283, p. 271-284 14 p. 8. (Frontiers in Artificial Intelligence and Applications)
- Automatic thresholding from the gradients of region boundaries. Landini, G., Randell, D., Fouad, S. & Galton, A. 20 Sep 2016 In: Journal of Microscopy
- Classifying Cognitive Profiles Using Machine Learning with Privileged Information in Mild Cognitive Impairment. Alahmadi, H., Shen, Y., Fouad, S., Di Bernardi Luft, C., Bentham, P., Kourtzi, Z. & Tino, P. 17 Nov 2016 In: Frontiers in Computational Neuroscience. 10, 117
- Dolhasz, A. Williams I and Frutos-Pascual, M. “Measuring Observer Response to Object-Scene Disparity in Composites”. Mixed and Augmented Reality (ISMAR), 2016 IEEE International Symposium on, Merida, 2016.
- Smith, S. Williams, I. “A Statistical Method for Surface Detection”, Eurographics Workshop on Visual Computing for Biology and Medicine, 2015
- S. Smith and I. Williams. “A Statistical Method for Improved Surface Detection”. Signal Processing Letters, August 2015 vol. 8, pp 1045-1049
- Williams, I., Bowring, N. & Svoboda, D. (2014) ‘A Performance Evaluation of Statistical Tests for Edge Detection in Textured Images’ in Computer Vision and Image Understanding vol. 122 May 2014 pp. 115-130
- Williams I, Shirvani B and Mourier JM. Measurement of Cold Rolled Steel Sections Using Digital Image Processing. Journal of Key Engineering Materials, vol 473. Trans Tech Publications, 2011
- Williams I, Svoboda D and Bowring N. A Novel Performance Metric for Greyscale Edge Detection. Proceedings of the International Conference on Computer Vision Theory and Applications 2010.
- Bowring N, Williams I, Johnson C and Jaiswal J. Fatigue crack, squat and wheel burn detection by a multi-scale statistical image processing technique. Proceedings of the 33rd Annual General Meeting of the British Institute of Non-Destructive Testing.
- Williams I, Svoboda D, Bowring N and Guest, E. Statistical Edge Detection of Concealed Weapons Using Artificial Neural Networks. Proceedings of SPIE-IS&T Electronic Imaging. Vol. 6812. Bellingham, Washington: SPIE, 2008. p. 68121J-1-12, 12 pp. ISSN 0277-786X.
- Williams I, Svoboda D, Bowring N, and Guest E. Improved Statistical Edge Detection Through Neural Networks. In 10th Conference on Medical Image Understanding and Analysis 2006. ISBN: 1-901727-31-9. P.56-60.
Digital Audio Processing
The Digital Audio Processing Group is a multidisciplinary research group working in the areas of Music Informatics, Digital Signal Processing (DSP), and computational musicology.
It studies technologies for analysing and processing music, using a range of Digital Signal Processing (DSP) techniques. This involves processing musical event data in real-time and the processing of digital audio signals. Our research extends to embrace psychoacoustics and music theory/analysis/synthesis subject areas.
With a wealth of knowledge gained from working alongside industry and in collaboration with other institutions, we are ideally placed to engage in cutting-edge research work. As part of a Faculty with a long history of working alongside creative companies, we use innovative computational techniques to develop a range of outputs that can be employed by forward-thinking businesses.
The Group is a vibrant hub for audio technology research. With close links to research teams in Digital Image and Video Processing as well as the Centre for Music and Performance at Birmingham Conservatoire, the research within this group is focused on using a range of scientific and mathematical techniques and draws on a strong emphasis on DSP. We have expanding opportunities for students wishing to pursue MPhil and PhD study.
Areas of Activity
- Low Latency High Resolution Audio Processing
- Semantic Music Production and Retrieval
- Application of perceptual models to automatic music mixing
- Probabilistic Models for Expressive Musical Performance
- Adaptive Digital Audio Effects
- Acoustic Variance in Musical Performance
- Signal Processing for Folk Music Analysis
Research Projects in this Group
Digital Media Distribution
The Digital Media Distribution Group addresses a range of advanced topics in distribution formats and services. The group specifically focusses on applications in commercial media operation for network provision, authoring and composition of media and presentation of media services for consumption and re-use of media content.
Areas of Activity
- Low Latency and Real-Time Networks
- Object-Based Interactive and Personalised Media
- Media Distribution and Services
- Asset Tagging, Tracking and Protection in Networked Media
Staff working in this group
- Jeremy Foss
- Leo Wang
- Dalia El Banna
List of Publications
- B. Veloso, B. Malheiro, J.C. Burguillo-Rial, J.Foss. “Personalised Fading for Stream Data” SAC '17 Proceedings of the Symposium on Applied Computing, Pages 870-872 , Marrakech, Morocco, April 4-6, 2017
- B. Veloso, J. Foss, B. Malheiro, J.C. Burguillo-Rial, J. Gama. “Personalised Dynamic Viewer Profiling for Streamed Data” In: Rocha Á., Adeli H., Reis L.P., Costanzo S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 746. Springer, Cham
- B. Veloso, J. Foss, B. Malheiro, J.C. Burguillo-Rial. “Improving On-line Genre-based Viewer Profiling” In Proceedings of the In-Programme Personalisation for Broadcast (IPP4B) Workshop, ACM TVX 2017 Conference, Hilversum, The Netherlands, June 2017 (TVX 2017), 6 pages.
- J. Foss, B.Shirley, B.Malheiro, S.Kepplinger, A.Ulisses, M.Armstrong, “In-Programme Personalisation for Broadcast: IPP4B (Workshop Summary)” In Proceedings of the In-Programme Personalisation for Broadcast (IPP4B) Workshop, ACM TVX 2017 Conference, Hilversum, The Netherlands, June 2017 (TVX 2017), 6 pages.
- B. Veloso, J. Foss, B. Malheiro, J.C. Burguillo-Rial. “Product Placement Platform for Personalised Advertising”. New European Media (NEM) Summit 2016.
- J.Foss, A.Ullises, N.Monnoyer. “Object-Media: From Personalisation To A Seamless TV/VR Convergence”. Presentation - IBC Futurzone presentation, Amsterdam 2016
- J.Liebetrau, S.Kepplinger, A.Cristea, J.Foss. “Roadmap for a Comprehensive Evaluation Approach on QoE of Interactive and Personalized TV” 1st International Workshop on Interactive Content Consumption at EuroITV 2013, Como, Italy; June 2013
- B. Malheiro, J. Foss, B. Veloso, J.C. Burguillo-Rial. “B2B Platform for Media Content Personalisation” MUSST: Workshop on Multi-User Services for Social TV, EuroITV 2013, Como, Italy; June 2013
- J.Foss, B.Malheiro, J C Burguillo-Rial. “Personalisation of Networked Video” EuroITV, Berlin, Germany, July 2012
- J.Foss, B.Malheiro, J.C.Burguillo-Rial. “Personalised Placement in Networked Video” MutliA-Pro, Lyon, France, March 2012
- T.Ma, Y.Wang, et al. "Evaluation of Flexilink as Unified Real-Time Protocol for Industrial Networks" 13th IEEE Conference on Industrial Electronics and Applications (ICIEA 2018), To appear and paper accepted.
- R.Ma, Y.Wang et al. "Optimum Design of Multistage Half-band FIR Filter for Audio Conversion using a Simulated Annealing Algorithm" 13th IEEE Conference on Industrial Electronics and Applications (ICIEA 2018) To appear and paper accepted.
- K.Xu, W.Hu, Y.Wang,"An Improved Singer's Formant Extraction Method Based on LPC Algorithm" 2017 IEEE 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI 2017) . 2017
- N.Jillings, Y.Wang et al. "An Intellgient Audio Plugin Framework for the Web Audio API. 2nd Web Audio Conference (WAC 2017). London, United Kingdom. August, 2017
- Oluwaseyi Oginni, Peter Bull and Yonghao Wang. "Constraint-Aware Software-Defined Network for Routing Real-Time Multimedia in Multi-Path Networks". The 15th International Workshop on Real-Time Networks, 27th June, 2017
- Y.Song, Y.Wang, et.al. "Performance Evaluation of a New Flexible Time Division Multiplexing Protocol on Mixed Traffic Types," IEEE International Conference on Advanced Information Networking and Applications (AINA) 27-29 March 2017
- J. Tian, W. Hu, Y. Wang, et al. "A Novel PSO based Task Scheduling Algorithm for Multi-core Systems", Chapter, January 2017, In book: Smart Computing and Communication, pp.62-71
- J. Lu, W. Hu, Y. Wang, et al. "A Hybrid Algorithm Based on Particle Swarm Optimization and Ant Colony Optimization Algorithm", Chapter, January 2017, In book: Smart Computing and Communication, pp.22-31
- X.Zhu, Y. Wang, W. Hu, et al. “Practical considerations on optimising multistage decimation and interpolation processes,” to appear. “IEEE International Conference on Digital Signal Processing”, Beijing, 2016 OCT 16-18.
- N.Jillings, Y.Wang, J.Reiss, R.Stables, "JSAP: A Plugin Standard for the Web Audio API with Intelligent Functionality". 141st Audio Engineering Convention, Los Angeles. 2016 09.
- W.Hu, Q.Shi, Y.Wang, et al. “An efficient task mapping algorithm with power-aware optimization for network on chip,” Journal of Systems Architecture, Elsevier, 24 April 2016, ISSN 1383-7621.
- O. Hawker and Y. Wang, “A Method of Equal Loudness Compensation for Uncalibrated Listening Systems,” in Audio Engineering Society Convention 139, 2015.
- I. Arnaldi, Y. Wang, "A SIMULINK toolbox of Sigma-Delta modulators for high resolution audio conversions", 137th AES Convention Los Angeles, 2014.
- N. Jillings, Y. Wang, "CUDA Accelerated Audio Digital Signal Processing for Real-Time Algorithms", 137th AES Convention Los Angeles, 2014.
- D. El-Banna, Y. Wang, M. Clarke, S. Cox, “The Development of a simulation model for the Flexilink Protocol”, The Journal of TEE – ISSUE1 Volume 1, Spring 2014
- Y. Wang, J. Grant, and J. Foss, "Flexilink: A unified low latency network architecture for multichannel live audio", 133rd AES Convention San Francisco, Oct 2012.
- Y. Wang, J. D. Reiss, "Time domain performance of decimation filter architectures for high resolution sigma delta analogue to digital conversion", 132nd AES Convention, Budapest, Hungary, April 26–29, 2012.
- Y. Wang, X. Zhu, Q. Fu, "A Low Latency Multichannel Audio Processing Evaluation Platform", 132nd AES Convention, Budapest, Hungary, April 26–29, 2012.
- Y. Wang, Engineering Brief: "Latency Measurements of Audio Sigma Delta Analogue to Digital and Digital to Analogue Converts ", 131st AES Convention, New York, USA, Oct 20-23, 2011.
- Y. Wang, R. Stables, J. D. Reiss, "Audio Latency Measurement for Desktop Operating Systems with Onboard Soundcards", 128th AES Convention, London, May 22-25, 2010.