Publication
Detection of copy number variants in the human genome: Is long-read sequencing an alternative to genomic microarrays?
| dc.contributor.author | Silva, Catarina | |
| dc.contributor.author | Ferrão, José | |
| dc.contributor.author | Marques, Barbara | |
| dc.contributor.author | Pedro, Sónia | |
| dc.contributor.author | Correia, Hildeberto | |
| dc.contributor.author | Rodrigues, António Sebastião | |
| dc.contributor.author | Vieira, Luís | |
| dc.date.accessioned | 2024-02-27T17:26:22Z | |
| dc.date.available | 2024-02-27T17:26:22Z | |
| dc.date.issued | 2023-11 | |
| dc.description.abstract | Introduction: Copy number variations (CNVs) represent ~13% of the human genome and can harbour important genes and regulatory elements. High-resolution whole genome microarray (MA) analysis is the gold standard tool for detection of CNVs associated with genetic disorders. While short-read sequencing (SRS) can address SV detection, the use of long-read sequencing as proven to overcome SRS mapping inaccuracy in highly repetitive DNA regions and improve genome contiguity. We applied whole genome nanopore sequencing (NS) to call CNVs and compared the results with those obtained by microarray. Methodology: Genomic DNA from 2 cell lines (EOL-1 and 697) were processed using the CytoSan HD Array (Affymetrix) and ChAS software (ThermoFisher). A minimum CNV calling size threshold of 35 Kb was used. DNA was also sequenced on the MinION device (Oxford Nanopore Technologies) following a rapid library preparation method. Sequencing data were basecalled using Guppy, mapped with LRA, and SVs called using both CuteSV and Sniffles2. Sanger sequencing was performed to demonstrate breakpoint positions for 3 CNVs. R packages were used to perform comparisons between MA and NS data. Results: A total of 49 CNVs were confirmed after curated MA analysis in both cell lines, ranging in size from 35 Kb to 79 Mb. From those, 43 CNVs (87.7%) were called in nanopore data by either one (4 CNVs) or both (39 CNVs) callers with a mean whole genome coverage of ~12X. Six of 43 CNVs were called as inversions instead. In 3 CNVs the size of the variant was found to be smaller (ranging from ~5 to 22 Kb) than the threshold of MA analysis. The correlation between CNV sizes obtained with MA and NS was of 0.71 with Sniffles2 and 0.74 with CuteSV, whereas the correlation between callers was of 0.99. The breakpoint precision obtained for NS was much higher (ranging for CuteSV from 2 to 42 bp; and for Sniffles2 from 0 to 87 bp) than the one obtained for MA (ranging from 774 to 7618 bp). Conclusions: NS technology proved to be technically effective in the detection of CNVs of different types and sizes and thus posing itself as an alternative to MA in the detection of pathogenic SVs associated with genetic diseases. However, NS data analysis requires fine-tuning of the analysis conditions as well as the use of different methods, for greater reliability of results in a clinical context. | pt_PT |
| dc.description.sponsorship | Funding: This work is a result of the GenomePT project (POCI-01- 0145-FEDER-022184), supported by COMPETE 2020 – Operational Programme for Competitiveness and Internationalisation (POCI), Lisboa Portugal Regional Operational Programme (Lisboa2020), Algarve Portugal Regional Operational Programme (CRESC Algarve2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and by Fundação para a Ciência e a Tecnologia (FCT). This work was also supported by Fundos FEDER through the Programa Operacional Factores de Competitividade – COMPETE and by Fundos Nacionais through the FCT within the scope of the project UID/BIM/00009/2019 (Centre for Toxicogenomics and Human Health-ToxOmics | pt_PT |
| dc.description.version | N/A | pt_PT |
| dc.identifier.uri | http://hdl.handle.net/10400.18/9152 | |
| dc.language.iso | eng | pt_PT |
| dc.relation | National Facility for genome sequencing and analysis | |
| dc.relation | Centre for Toxicogenomics and Human Health | |
| dc.subject | Long-Read Sequencing | pt_PT |
| dc.subject | Whole Human Genome Sequencing | pt_PT |
| dc.subject | Structural Variation | pt_PT |
| dc.subject | Bioinformatics | pt_PT |
| dc.subject | Nanopore Sequencing | pt_PT |
| dc.subject | Tecnologias de Análise de DNA | pt_PT |
| dc.title | Detection of copy number variants in the human genome: Is long-read sequencing an alternative to genomic microarrays? | pt_PT |
| dc.type | conference object | |
| dspace.entity.type | Publication | |
| oaire.awardTitle | National Facility for genome sequencing and analysis | |
| oaire.awardTitle | Centre for Toxicogenomics and Human Health | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/9444 - RNIIIE/PINFRA%2F22184%2F2016/PT | |
| oaire.awardURI | info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UID%2FBIM%2F00009%2F2019/PT | |
| oaire.citation.conferencePlace | Lisboa, Portugal | pt_PT |
| oaire.citation.title | 3rd GenomePT Symposium 2023 - Symposium of the National Research infrastructure for Genome Sequencing and Analysis, 17 novembro 2023 | pt_PT |
| oaire.fundingStream | 9444 - RNIIIE | |
| oaire.fundingStream | 6817 - DCRRNI ID | |
| person.familyName | Silva | |
| person.familyName | de Oliveira Alves Ferrão | |
| person.familyName | Nunes Lopes Marques | |
| person.familyName | Rodrigues | |
| person.familyName | Vieira | |
| person.givenName | Catarina | |
| person.givenName | José Cândido | |
| person.givenName | Bárbara Sofia | |
| person.givenName | António Sebastião | |
| person.givenName | Luís | |
| person.identifier | A-1930-2013 | |
| person.identifier.ciencia-id | F316-76B8-6216 | |
| person.identifier.ciencia-id | C01F-F31D-F997 | |
| person.identifier.ciencia-id | 581A-57C7-6B58 | |
| person.identifier.ciencia-id | 9012-44BE-F79D | |
| person.identifier.orcid | 0000-0002-0864-2572 | |
| person.identifier.orcid | 0000-0002-2553-7467 | |
| person.identifier.orcid | 0000-0002-4392-4858 | |
| person.identifier.orcid | 0000-0002-8139-4595 | |
| person.identifier.orcid | 0000-0002-7703-1409 | |
| person.identifier.scopus-author-id | 36130557500 | |
| person.identifier.scopus-author-id | 55781566100 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.identifier | http://doi.org/10.13039/501100001871 | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| project.funder.name | Fundação para a Ciência e a Tecnologia | |
| rcaap.rights | restrictedAccess | pt_PT |
| rcaap.type | conferenceObject | pt_PT |
| relation.isAuthorOfPublication | f24561e7-7c4b-401a-8f1c-262ab3368649 | |
| relation.isAuthorOfPublication | 0a8de695-b13b-44b9-9427-f2356ea01c17 | |
| relation.isAuthorOfPublication | 779bb189-5d52-46db-98db-1642542a4519 | |
| relation.isAuthorOfPublication | 50eaa538-742f-4a74-bb9b-8aede14f0451 | |
| relation.isAuthorOfPublication | ed1872ad-c157-4863-9cb7-2dc8b93d6cce | |
| relation.isAuthorOfPublication.latestForDiscovery | ed1872ad-c157-4863-9cb7-2dc8b93d6cce | |
| relation.isProjectOfPublication | 3b8a803c-37e5-4354-93f5-6410d7fb2af7 | |
| relation.isProjectOfPublication | da24f146-9eb6-459d-8a8d-8adef9929107 | |
| relation.isProjectOfPublication.latestForDiscovery | da24f146-9eb6-459d-8a8d-8adef9929107 |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Silvaetal_GPT2023.pdf
- Size:
- 1.25 MB
- Format:
- Adobe Portable Document Format
- Description:
- Poster
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
