The Robots are Taking Over: How AI Can Help Us Analyse Forensic DNA Profiles

The Robots are Taking Over: How AI Can Help Us Analyse Forensic DNA Profiles

There is an increasing utilisation of Artificial Neural Networks (ANN) in modern society. In the field of forensic DNA profile analysis, the use of ANNs to aid in the classification of peak data within casework electropherograms (EPG) has recently been described. Prior to implementation, an ANN must be created, trained, and adjusted until an acceptable level of performance is obtained. The training of an ANN for the classification of DNA profiles is undertaken using a supervised learning approach. In principle, DNA profile data manually labelled by an analyst comprise a training dataset where the ANN learns the features of EPG data and discerns the ‘correct’ type of peak classification to be assigned. Various factors of the training regimen impact the overall performance of the ANN. These include: the number of epochs (training cycles), the size of the training (and validation) dataset, the types of EPG data used for training, and the composition (different types of profiles) of the training dataset. We present how trained ANNs can assist laboratories streamline the designation of forensic DNA profiles. We describe the importance of the training regimen and how performance metrics can be used to validate the ANN and give examples for DNA profiles generated using the PowerPlex® Fusion 6C profiling kit.

There is an increasing utilisation of Artificial Neural Networks (ANN) in modern society. In the field of forensic DNA profile analysis, the use of ANNs to aid in the classification of peak data within casework electropherograms (EPG) has recently been described. Prior to implementation, an ANN must be created, trained, and adjusted until an acceptable level of performance is obtained. The training of an ANN for the classification of DNA profiles is undertaken using a supervised learning approach. In principle, DNA profile data manually labelled by an analyst comprise a training dataset where the ANN learns the features of EPG data and discerns the ‘correct’ type of peak classification to be assigned. Various factors of the training regimen impact the overall performance of the ANN. These include: the number of epochs (training cycles), the size of the training (and validation) dataset, the types of EPG data used for training, and the composition (different types of profiles) of the training dataset. We present how trained ANNs can assist laboratories streamline the designation of forensic DNA profiles. We describe the importance of the training regimen and how performance metrics can be used to validate the ANN and give examples for DNA profiles generated using the PowerPlex® Fusion 6C profiling kit.

Workshop currently at capacity. A waitlist is available to join on our registration page.

Brought to you by

Worldwide Association of Women Forensic Experts

Meng-Han Lin

Senior Scientist, Institute of Environmental Science and Research (ESR)

Meng-Han Lin is a Senior Scientist at the Institute of Environmental Science and Research (ESR) and has been employed in the STRmix™ team since 2016. He has a background in molecular biology and in his current role is primarily responsible for the development, test, and validation of FaSTR™ DNA analysis software.

Speaker Image

Submit Question to a speaker