<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.limswiki.org/index.php?action=history&amp;feed=atom&amp;title=Journal%3ABuilding_infrastructure_for_African_human_genomic_data_management</id>
	<title>Journal:Building infrastructure for African human genomic data management - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.limswiki.org/index.php?action=history&amp;feed=atom&amp;title=Journal%3ABuilding_infrastructure_for_African_human_genomic_data_management"/>
	<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;action=history"/>
	<updated>2026-04-05T05:30:45Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.36.1</generator>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36854&amp;oldid=prev</id>
		<title>Shawndouglas: Finished adding rest of content.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36854&amp;oldid=prev"/>
		<updated>2019-11-11T23:05:12Z</updated>

		<summary type="html">&lt;p&gt;Finished adding rest of content.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 23:05, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l19&quot;&gt;Line 19:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 19:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;|download     = [https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/ https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/] (PDF)&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;|download     = [https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/ https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/] (PDF)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;}}&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;}}&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;{{ombox	 &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;| type      = notice	 &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;| image     = [[Image:Emblem-important-yellow.svg|40px]]	 &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;| style     = width: 500px;	 &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;| text      = This article should be considered a work in progress and incomplete. Consider this article incomplete until this notice is removed.	 &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;}}&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Abstract==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Abstract==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Human [[Genomics|genomic]] data are large and complex, and require adequate infrastructure for secure storage and transfer. The [[National Institutes of Health]] (NIH) and The Wellcome Trust have funded multiple projects on genomic research, including the Human Heredity and Health in Africa (H3Africa) initiative, and data are required to be deposited into the public domain. The European Genome-phenome Archive (EGA) is a repository for [[Sequencing|sequence]] and genotype data where data access is controlled by access committees. Access is determined by a formal application procedure for the purpose of secure storage and distribution, which must be in line with the informed consent of the study participants. H3Africa researchers based in Africa and generating their own data can benefit tremendously from the data sharing capabilities of the internet by using the appropriate technologies. The H3Africa Data Archive is an effort between the H3Africa data generating projects, H3ABioNet, and the EGA to store and submit genomic data to public repositories. H3ABioNet maintains the security of the H3Africa Data Archive, ensures ethical security compliance, supports users with data submission, and facilitates data transfers. The goal is to ensure efficient data flow between researchers, the archive, and the EGA or other public repositories. To comply with the H3Africa data sharing and release policy, nine months after the data is in secure storage, H3ABioNet converts the data into an Extensible Markup Language (XML) format ready for submission to EGA. This article describes the infrastructure that has been developed for African human genomic data management.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Human [[Genomics|genomic]] data are large and complex, and require adequate infrastructure for secure storage and transfer. The [[National Institutes of Health]] (NIH) and The Wellcome Trust have funded multiple projects on genomic research, including the Human Heredity and Health in Africa (H3Africa) initiative, and data are required to be deposited into the public domain. The European Genome-phenome Archive (EGA) is a repository for [[Sequencing|sequence]] and genotype data where data access is controlled by access committees. Access is determined by a formal application procedure for the purpose of secure storage and distribution, which must be in line with the informed consent of the study participants. H3Africa researchers based in Africa and generating their own data can benefit tremendously from the data sharing capabilities of the internet by using the appropriate technologies. The H3Africa Data Archive is an effort between the H3Africa data generating projects, H3ABioNet, and the EGA to store and submit genomic data to public repositories. H3ABioNet maintains the security of the H3Africa Data Archive, ensures ethical security compliance, supports users with data submission, and facilitates data transfers. The goal is to ensure efficient data flow between researchers, the archive, and the EGA or other public repositories. To comply with the H3Africa data sharing and release policy, nine months after the data is in secure storage, H3ABioNet converts the data into an Extensible Markup Language (XML) format ready for submission to EGA. This article describes the infrastructure that has been developed for African human genomic data management.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l202&quot;&gt;Line 202:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 197:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Data submitted via the internet is ingested into the Vault from the Landing Area (Table 2, step 1). Likewise, data that is due to be submitted to the EGA is moved from the Vault to the Landing Area. Due to network design, the only method of moving data between the Vault and Landing Area is via the network. It is not currently possible to connect external USB storage to the Landing area. The slowest data transfer speeds are recorded when transferring data from a local server via the internal network to the EGA. This is largely due to the various firewalls and packet inspection tools implemented along the data transfer path. As expected, connecting a high speed USB storage device directly to the Vault server yields higher transfer rates. The data transfer rate shown in Table 2 (step 3) was conducted using a USB 3.0 enabled storage device. The Landing Area is located in the institutional DMZ, as data transfers to and from this server have been optimized for internet based data transfers which yields the fastest transfer speeds.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Data submitted via the internet is ingested into the Vault from the Landing Area (Table 2, step 1). Likewise, data that is due to be submitted to the EGA is moved from the Vault to the Landing Area. Due to network design, the only method of moving data between the Vault and Landing Area is via the network. It is not currently possible to connect external USB storage to the Landing area. The slowest data transfer speeds are recorded when transferring data from a local server via the internal network to the EGA. This is largely due to the various firewalls and packet inspection tools implemented along the data transfer path. As expected, connecting a high speed USB storage device directly to the Vault server yields higher transfer rates. The data transfer rate shown in Table 2 (step 3) was conducted using a USB 3.0 enabled storage device. The Landing Area is located in the institutional DMZ, as data transfers to and from this server have been optimized for internet based data transfers which yields the fastest transfer speeds.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;As is evident from Table 2, data transfers are a challenge when internet speeds are slow and resources are limited. The data archive’s preferred online data transfer mechanism is “Globus Online,” which uses GridFTP &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;Foster &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;et al&lt;/del&gt;. 2011&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;)&lt;/del&gt;, while EGA uses Aspera. The Aspera and Globus Online (GO) data transfer applications are optimized to efficiently and securely transfer data between two points on a public or private network. &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;Madduri et al. 2014&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;) &lt;/del&gt;Both applications have security and fault recovery built into the system, which sets it apart from traditional data transfer methods such as FTP (file transfer protocol). The fault recovery measures work by setting checkpoints as data is successfully delivered to its destination. In the event of a network failure, when the transfer is restarted, GO or Aspera will pick up from the last checkpoint, compared to FTP, which would require restarting the entire copy.&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot; /&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;As is evident from Table 2, data transfers are a challenge when internet speeds are slow and resources are limited. The data archive’s preferred online data transfer mechanism is “Globus Online,” which uses GridFTP&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;FosterGlobus11&amp;quot;&amp;gt;{{cite journal |title=Globus Online: Accelerating and Democratizing Science through Cloud-Based Services |journal=IEEE Internet Computing |author=&lt;/ins&gt;Foster&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, I. |volume=15 |issue=3 |pages=70–73 |year=2011 |doi=10.1109/MIC&lt;/ins&gt;.2011&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;.64}}&amp;lt;/ref&amp;gt;&lt;/ins&gt;, while EGA uses Aspera. The Aspera and Globus Online (GO) data transfer applications are optimized to efficiently and securely transfer data between two points on a public or private network.&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;MadduriExperi14&amp;quot;&amp;gt;{{cite journal |title=Experiences Building Globus Genomics: A Next-Generation Sequencing Analysis Service using Galaxy, Globus, and Amazon Web Services |journal=Concurrency and Computing: Practice &amp;amp; Experience |author=&lt;/ins&gt;Madduri&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, R.K.; Sulakhe, D.; Lacinski, L. &lt;/ins&gt;et al. &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|volume=26 |issue=13 |pages=2266-2279 |year=&lt;/ins&gt;2014 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10.1002/cpe.3274 |pmid=25342933 |pmc=PMC4203657}}&amp;lt;/ref&amp;gt; &lt;/ins&gt;Both applications have security and fault recovery built into the system, which sets it apart from traditional data transfer methods such as FTP (file transfer protocol). The fault recovery measures work by setting checkpoints as data is successfully delivered to its destination. In the event of a network failure, when the transfer is restarted, GO or Aspera will pick up from the last checkpoint, compared to FTP, which would require restarting the entire copy.&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot; /&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Encryption===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The primary aim of data encryption is to secure the genomic data.&amp;lt;ref name=&amp;quot;MahdiSecure18&amp;quot;&amp;gt;{{cite journal |title=Secure Similar Patients Query on Encrypted Genomic Data |journal=IEEE Journal of Biomedical and Health Informatics |author=Mahdi, M.S.R.; Aziz, M.M.A.; Alhadidi, D. et al. |volume=23 |issue=6 |pages=2611-2618 |year=2019 |doi=10.1109/JBHI.2018.2881086 |pmid=30442622}}&amp;lt;/ref&amp;gt; Encryption works by encoding data using a secret private key. The data is only accessible or readable when using the matching public key to decrypt it. Without the private key, the data is inaccessible, making it a suitable method of securing data whilst in transit across public networks such as the internet. Encryption to EGA has a separate public key which is built into the EGACryptor tool. A major challenge of encryption is that it takes longer to encrypt or decrypt a file compared to using a standard password to protect the file. It also requires additional storage space, up to three times the size of the raw data. This is not much of an issue when working with small data sets, but for larger data sets, such as genomic data, storage space for encryption becomes an important factor. To encrypt a terabyte of data can take approximately an hour and thirty minutes. This varies depending on the file type and amount of resources used on the server at a particular time.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Archive dashboard===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The Archive Dashboard is a web application that was built to keep track of data submissions from the data submitters. The dashboard tracks all the progress of the submissions in an intuitive user friendly interface. A user is able to register, log in, and fill in a data submission request form. The HDAT will respond by assisting the data submitter with the submission pack and file formats. Funders or project managers can log in to the dashboard with different access rights to view the progress of data from first engagement to submission to the EGA.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Current status of the Archive===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;At the time of writing, a total of nine data sets have been submitted to the H3Africa Archive and the EGA. The total size of all the data sets submitted is 118.7 terabytes, with the average data set being 13.2 terabytes. There are currently two studies that have been submitted to the EGA. The AWI-Gen Pilot Study (accession number: EGAS00001002482) is accessible via the EGA, and the H3Africa Chip (accession ID: EGAS00001002976) is currently under embargo and expected to be accessible to the greater scientific community soon. More datasets are expected to be submitted to the H3Africa Data Archive in the near future.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==Discussion and conclusions==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;In order to implement the H3Africa data sharing policies, we developed what to our knowledge is the first human genomic data archive in Africa. There were many challenges encountered in the development of the infrastructure, most notably in data transfers when moving data around the globe and specifically across the African continent. Common challenges include:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Researchers or data owners not wanting to share their data&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Communication issues&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Technical issues, such as slow internet speeds or expensive bandwidth&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Available server compute resources, available storage space, or familiarity with data transfer technologies&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Data governance which restricts the movement of genomic data across borders&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The H3Africa Archive, though developed to address internal needs of the consortium, provides a useful proof of concept for the possibility of establishing local EGA facilities. It was designed based on the EGA architecture, and it ensures data security and conversion into EGA formats. A similar infrastructure could be used for other genomic data, where an archive of files is required with built-in secure storage, off-site replication, and data transfer procedures. Our experience has demonstrated that significant long-term resources are required for such an infrastructure, including both human and computational. We also recognize the value in data sharing initiatives as researchers and funders move increasingly to an open science ethos. Researchers from the project sites can benefit tremendously from the data sharing capabilities of the internet. In addition to having access to international data sets, by submitting their data to public data archives such as the EGA, they expose their research to the greater scientific community, which in itself holds many benefits.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==Acknowledgements==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Competing interests===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The authors have no competing interests to declare.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36853:rev-36854 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36853&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36853&amp;oldid=prev"/>
		<updated>2019-11-11T22:17:52Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;a href=&quot;https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;amp;diff=36853&amp;amp;oldid=36848&quot;&gt;Show changes&lt;/a&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36848&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36848&amp;oldid=prev"/>
		<updated>2019-11-11T21:22:37Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 21:22, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l31&quot;&gt;Line 31:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 31:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted.&amp;lt;ref name=&amp;quot;GoldfederHuman17&amp;quot;&amp;gt;{{cite journal |title=Human Genome Sequencing at the Population Scale: A Primer on High-Throughput DNA Sequencing and Analysis |journal=American Journal of Epidemiology |author=Goldfeder, R.L.; Wall, D.P.; Khoury, M.J. et al. |volume=186 |issue=8 |pages=1000–1009 |year=2017 |doi=10.1093/aje/kww224 |pmid=29040395 |pmc=PMC6250075}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot;&amp;gt;{{cite journal |title=Genome sequencing in the clinic: the past, present, and future of genomic medicine |journal=Physiological Genomics |author=Prokop. J.W.; May, T.; Strong, K. et al. |volume=50 |issue=8 |pages=563–79 |year=2018 |doi=10.1152/physiolgenomics.00046.2018 |pmid=29727589 |pmc=PMC6139636}}&amp;lt;/ref&amp;gt; However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies.&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;PopejoyGenomics16&amp;quot;&amp;gt;{{cite journal |title=Genomics is failing on diversity |journal=Nature |author=Popejoy, A.B.; Fullerton, S.M. |volume=538 |issue=7624 |pages=161–64 |year=2016 |doi=10.1038/538161a |pmid=27734877 |pmc=PMC5089703}}&amp;lt;/ref&amp;gt; African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators.&amp;lt;ref name=&amp;quot;H3AfricaResearch14&amp;quot;&amp;gt;{{cite journal |title=Research capacity. Enabling the genomic revolution in Africa |journal=Science |author=H3Africa Consortium; Rotimi, C.; Abayomi, A. et al. |volume=344 |issue=6190 |pages=1346–8 |year=2014 |doi=10.1126/science.1251546 |pmid=24948725 |pmc=PMC4138491}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;MulderH3Africa18&amp;quot;&amp;gt;{{cite journal |title=H3Africa: Current perspectives |journal=Pharmacogenomics and Personalized Medicine |author=Mulder, N.; Abimiku, A.; Adebamowo, S.N. et al. |volume=11 |pages=59–86 |year=2018 |doi=10.2147/PGPM.S141546 |pmid=29692621 |pmc=PMC5903476}}&amp;lt;/ref&amp;gt; To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]).&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot;&amp;gt;{{cite journal |title=H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa |journal=Genome Research |author=Mulder, N.J.; Adebiyi, E.; Alami, R. et al. |volume=26 |issue=2 |pages=271–7 |year=2016 |doi=10.1101/gr.196295.115 |pmid=26627985 |pmc=PMC4728379}}&amp;lt;/ref&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted.&amp;lt;ref name=&amp;quot;GoldfederHuman17&amp;quot;&amp;gt;{{cite journal |title=Human Genome Sequencing at the Population Scale: A Primer on High-Throughput DNA Sequencing and Analysis |journal=American Journal of Epidemiology |author=Goldfeder, R.L.; Wall, D.P.; Khoury, M.J. et al. |volume=186 |issue=8 |pages=1000–1009 |year=2017 |doi=10.1093/aje/kww224 |pmid=29040395 |pmc=PMC6250075}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot;&amp;gt;{{cite journal |title=Genome sequencing in the clinic: the past, present, and future of genomic medicine |journal=Physiological Genomics |author=Prokop. J.W.; May, T.; Strong, K. et al. |volume=50 |issue=8 |pages=563–79 |year=2018 |doi=10.1152/physiolgenomics.00046.2018 |pmid=29727589 |pmc=PMC6139636}}&amp;lt;/ref&amp;gt; However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[Sequencing|&lt;/ins&gt;sequences&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]] &lt;/ins&gt;being derived from European and North American ancestry, regions that have been early adopters of genomic technologies.&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;PopejoyGenomics16&amp;quot;&amp;gt;{{cite journal |title=Genomics is failing on diversity |journal=Nature |author=Popejoy, A.B.; Fullerton, S.M. |volume=538 |issue=7624 |pages=161–64 |year=2016 |doi=10.1038/538161a |pmid=27734877 |pmc=PMC5089703}}&amp;lt;/ref&amp;gt; African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators.&amp;lt;ref name=&amp;quot;H3AfricaResearch14&amp;quot;&amp;gt;{{cite journal |title=Research capacity. Enabling the genomic revolution in Africa |journal=Science |author=H3Africa Consortium; Rotimi, C.; Abayomi, A. et al. |volume=344 |issue=6190 |pages=1346–8 |year=2014 |doi=10.1126/science.1251546 |pmid=24948725 |pmc=PMC4138491}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;MulderH3Africa18&amp;quot;&amp;gt;{{cite journal |title=H3Africa: Current perspectives |journal=Pharmacogenomics and Personalized Medicine |author=Mulder, N.; Abimiku, A.; Adebamowo, S.N. et al. |volume=11 |pages=59–86 |year=2018 |doi=10.2147/PGPM.S141546 |pmid=29692621 |pmc=PMC5903476}}&amp;lt;/ref&amp;gt; To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]).&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot;&amp;gt;{{cite journal |title=H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa |journal=Genome Research |author=Mulder, N.J.; Adebiyi, E.; Alami, R. et al. |volume=26 |issue=2 |pages=271–7 |year=2016 |doi=10.1101/gr.196295.115 |pmid=26627985 |pmc=PMC4728379}}&amp;lt;/ref&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The H3Africa Consortium consists of multiple projects and sites distributed across Africa, most of which are generating genomic data linked to clinical data for specific diseases. The principal H3Africa funders (NIH and the Wellcome Trust) require any project data generated to be deposited into a data repository accessible by the scientific community.&amp;lt;ref name=&amp;quot;NIHSharing19&amp;quot;&amp;gt;{{cite web |url=https://grants.nih.gov/policy/sharing.htm |title=NIH Sharing Policies and Related Guidance on NIH-Funded Research Resources |work=Grants &amp;amp; Funding |publisher=National Institutes of Health |date=2019}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;WallcomeData17&amp;quot;&amp;gt;{{cite web |url=https://wellcome.ac.uk/funding/guidance/data-software-materials-management-and-sharing-policy |title=Data, software and materials management and sharing policy |work=Funding |publisher=The Wellcome Trust |date=10 July 2017}}&amp;lt;/ref&amp;gt; In order to facilitate the storage and accessibility of H3Africa genomics data, significant infrastructure, procedures, and policies were established. Part of H3ABioNet’s mandate is to develop processes and implement an infrastructure that will enable the ingestion, validation, annotation, secure storage, and submission of the African genomics data to the controlled access European Genome-phenome Archive (EGA).&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot;&amp;gt;{{cite journal |title=The European Genome-phenome Archive of human data consented for biomedical research |journal=Nature Genetics |author=Lappalainen, I.; Almeida-King, J.; Kumanduri, V. et al. |volume=47 |issue=7 |pages=692–5 |year=2015 |doi=10.1038/ng.3312 |pmid=26111507 |pmc=PMC5426533}}&amp;lt;/ref&amp;gt; This has been achieved through the development of the H3Africa Data Archive, which also ensures a copy of the genomic data is securely stored and retained on the African continent.&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;MulderDevelop17&amp;quot;&amp;gt;{{cite journal |title=Development of Bioinformatics Infrastructure for Genomics Research |journal=Global Heart |author=Mulder, N.J.; Adebiyi, E.; Adebiyi, M. et al. |volume=12 |issue=2 |pages=91–98 |year=2017 |doi=10.1016/j.gheart.2017.01.005 |pmid=28302555 |pmc=PMC5582980}}&amp;lt;/ref&amp;gt; This article describes the infrastructure that has been developed, which to our knowledge, is the first formalized human genomic data archive on the continent.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The H3Africa Consortium consists of multiple projects and sites distributed across Africa, most of which are generating genomic data linked to clinical data for specific diseases. The principal H3Africa funders (NIH and the Wellcome Trust) require any project data generated to be deposited into a data repository accessible by the scientific community.&amp;lt;ref name=&amp;quot;NIHSharing19&amp;quot;&amp;gt;{{cite web |url=https://grants.nih.gov/policy/sharing.htm |title=NIH Sharing Policies and Related Guidance on NIH-Funded Research Resources |work=Grants &amp;amp; Funding |publisher=National Institutes of Health |date=2019}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;WallcomeData17&amp;quot;&amp;gt;{{cite web |url=https://wellcome.ac.uk/funding/guidance/data-software-materials-management-and-sharing-policy |title=Data, software and materials management and sharing policy |work=Funding |publisher=The Wellcome Trust |date=10 July 2017}}&amp;lt;/ref&amp;gt; In order to facilitate the storage and accessibility of H3Africa genomics data, significant infrastructure, procedures, and policies were established. Part of H3ABioNet’s mandate is to develop processes and implement an infrastructure that will enable the ingestion, validation, annotation, secure storage, and submission of the African genomics data to the controlled access European Genome-phenome Archive (EGA).&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot;&amp;gt;{{cite journal |title=The European Genome-phenome Archive of human data consented for biomedical research |journal=Nature Genetics |author=Lappalainen, I.; Almeida-King, J.; Kumanduri, V. et al. |volume=47 |issue=7 |pages=692–5 |year=2015 |doi=10.1038/ng.3312 |pmid=26111507 |pmc=PMC5426533}}&amp;lt;/ref&amp;gt; This has been achieved through the development of the H3Africa Data Archive, which also ensures a copy of the genomic data is securely stored and retained on the African continent.&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;MulderDevelop17&amp;quot;&amp;gt;{{cite journal |title=Development of Bioinformatics Infrastructure for Genomics Research |journal=Global Heart |author=Mulder, N.J.; Adebiyi, E.; Adebiyi, M. et al. |volume=12 |issue=2 |pages=91–98 |year=2017 |doi=10.1016/j.gheart.2017.01.005 |pmid=28302555 |pmc=PMC5582980}}&amp;lt;/ref&amp;gt; This article describes the infrastructure that has been developed, which to our knowledge, is the first formalized human genomic data archive on the continent.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l40&quot;&gt;Line 40:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 40:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;===Data submission and access policy===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;===Data submission and access policy===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Genomic data associated with phenotype data enables the possibility of re-identification of study participants; hence all human genomic data and its accompanying phenotypic data need to be governed by a controlled access policy.&amp;lt;ref name=&amp;quot;ShabaniControl15&amp;quot;&amp;gt;{{cite journal |title=Controlled Access under Review: Improving the Governance of Genomic Data Access |journal=PLoS Biology |author=Shabani, M.; Dyke, S.O.; Joly, Y. et al. |volume=13 |issue=12 |pages=e1002339 |year=2015 |doi=10.1371/journal.pbio.1002339 |pmid=26720729 |pmc=PMC4697814}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;DykeRegist18&amp;quot;&amp;gt;{{cite journal |title=Registered access: Authorizing data access |journal=European Journal of Human Genetics |author=Dyke, S.O.M.; Linden, M.; Lappalainen, I. et al. |volume=26 |issue=12 |pages=1721-1731 |year=2018 |doi=10.1038/s41431-018-0219-y |pmid=30069064 |pmc=PMC6244209}}&amp;lt;/ref&amp;gt; H3Africa is distinguished as biospecimens are also being collected and stored at one of the three H3Africa biorepositories, so researchers can request access to genomic data and/or biospecimens. As a consortium, H3Africa has developed its own data submission and access policy, which takes into account the genomic and phenotype data generated, as well as policies for the access to and transfer of biospecimens.&amp;lt;ref name=&amp;quot;deVriesTheH3Africa15&amp;quot;&amp;gt;{{cite journal |title=The H3Africa policy framework: negotiating fairness in genomics |journal=Trends in Genetics |author=de Vries, J.; Tindana, P;. Littler, K. et al. |volume=31 |issue=3 |pages=117-9 |year=2015 |doi=10.1016/j.tig.2014.11.004 |pmid=25601285 |pmc=PMC4471134}}&amp;lt;/ref&amp;gt; A single H3Africa Data and Biospecimen Access committee has been established to oversee the secondary use of both the data, which is being deposited in the EGA, and biospecimens in the H3Africa biorepositories. The sharing and access policies and the H3Africa Data and Biospecimen Access Committee guidelines seek to provide a balance between protecting the rights of individuals and their data, while at the same time not acting as a barrier to advancing scientific knowledge. A data requester will need to identify the data in the EGA and apply for data access.&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot; /&amp;gt; The data access request is routed to the Data and Biospecimen Access Committee (DBAC), who review it to determine whether the intended research use is inline with the H3Africa data and access policy, and ensure the requester is a bona fide researcher. Once the data request has been reviewed, the H3Africa DBAC will provide a decision to approve or reject it.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Genomic data associated with phenotype data enables the possibility of re-identification of study participants; hence all human genomic data and its accompanying phenotypic data need to be governed by a controlled access policy.&amp;lt;ref name=&amp;quot;ShabaniControl15&amp;quot;&amp;gt;{{cite journal |title=Controlled Access under Review: Improving the Governance of Genomic Data Access |journal=PLoS Biology |author=Shabani, M.; Dyke, S.O.; Joly, Y. et al. |volume=13 |issue=12 |pages=e1002339 |year=2015 |doi=10.1371/journal.pbio.1002339 |pmid=26720729 |pmc=PMC4697814}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;DykeRegist18&amp;quot;&amp;gt;{{cite journal |title=Registered access: Authorizing data access |journal=European Journal of Human Genetics |author=Dyke, S.O.M.; Linden, M.; Lappalainen, I. et al. |volume=26 |issue=12 |pages=1721-1731 |year=2018 |doi=10.1038/s41431-018-0219-y |pmid=30069064 |pmc=PMC6244209}}&amp;lt;/ref&amp;gt; H3Africa is distinguished as biospecimens are also being collected and stored at one of the three H3Africa biorepositories, so researchers can request access to genomic data and/or biospecimens. As a consortium, H3Africa has developed its own data submission and access policy, which takes into account the genomic and phenotype data generated, as well as policies for the access to and transfer of biospecimens.&amp;lt;ref name=&amp;quot;deVriesTheH3Africa15&amp;quot;&amp;gt;{{cite journal |title=The H3Africa policy framework: negotiating fairness in genomics |journal=Trends in Genetics |author=de Vries, J.; Tindana, P;. Littler, K. et al. |volume=31 |issue=3 |pages=117-9 |year=2015 |doi=10.1016/j.tig.2014.11.004 |pmid=25601285 |pmc=PMC4471134}}&amp;lt;/ref&amp;gt; A single H3Africa Data and Biospecimen Access committee has been established to oversee the secondary use of both the data, which is being deposited in the EGA, and biospecimens in the H3Africa biorepositories. The sharing and access policies and the H3Africa Data and Biospecimen Access Committee guidelines seek to provide a balance between protecting the rights of individuals and their data, while at the same time not acting as a barrier to advancing scientific knowledge. A data requester will need to identify the data in the EGA and apply for data access.&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot; /&amp;gt; The data access request is routed to the Data and Biospecimen Access Committee (DBAC), who review it to determine whether the intended research use is inline with the H3Africa data and access policy, and ensure the requester is a bona fide researcher. Once the data request has been reviewed, the H3Africa DBAC will provide a decision to approve or reject it.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Types of data being accepted===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The principal data types being collected for submission to the H3Africa Data Archive and the EGA include genomic sequence data, genotype array files, the associated phenotypes and metadata that is collected along with the samples, and results of any analysis conducted. Genomic sequence data mainly comprises of short DNA sequence reads in FASTQ format.&amp;lt;ref name=&quot;CockTheSanger10&quot;&gt;{{cite journal |title=The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants |journal=Nucleic Acids Research |author=Cock, P.J.; Fields, C.J.; Goto, N. et al. |volume=38 |issue=6 |pages=1767–71 |year=2010 |doi=10.1093/nar/gkp1137 |pmid=20015970 |pmc=PMC2847217}}&amp;lt;/ref&gt;&amp;lt;ref name=&quot;Hsi-YangEffic11&quot;&gt;{{cite journal |title=Efficient storage of high throughput DNA sequencing data using reference-based compression |journal=Genome Research |author=Hsi-Yang Fritz, M.; Leinonen, R.; Cochrane, G. et al. |volume=21 |issue=5 |pages=734–40 |year=2011 |doi=10.1101/gr.114819.110 |pmid=21245279 |pmc=PMC3083090}}&amp;lt;/ref&gt; The types of data and associated files for the H3Africa research projects are summarized in Table 1.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;{| &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; | STYLE=&quot;vertical-align:top;&quot;|&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;{| class=&quot;wikitable&quot; border=&quot;1&quot; cellpadding=&quot;5&quot; cellspacing=&quot;0&quot; width=&quot;100%&quot;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot; colspan=&quot;3&quot;|'''Table 1.''' Description of data types for submission&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  ! style=&quot;background-color:#e2e2e2; padding-left:10px; padding-right:10px;&quot; |Exome/Whole Genome Sequence&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  ! style=&quot;background-color:#e2e2e2; padding-left:10px; padding-right:10px;&quot; |16S rRNA Microbiome studies&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  ! style=&quot;background-color:#e2e2e2; padding-left:10px; padding-right:10px;&quot; |Genome Wide Association studies/genotyping arrays&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Study type and description	&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Study type and description	&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Study type and description	&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Sequencing platform and technology used&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Sequencing platform and technology used&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Genotyping array model/name and description of the software and version used for calling the genotypes&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|FASTQ files linked with de-identified participant ID (minus technical reads such as adapters, linkers, barcodes)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|FASTQ files linked with de-identified participant ID (minus technical reads such as adapters, linkers, barcodes)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Raw intensity files linked with de-identified participant IDs (IDATs, CELs)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Binary Alignment files (BAMs, de-multiplexed) – linked with participant de-identified ID&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Manifest file describing SNP or probe content on the genotyping array&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Associated phenotypic data collected&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Associated phenotypic data collected&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Associated phenotypic data collected&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Variant calling files (VCFs)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Final analyses BIOM files (at minimum must contain OTUs)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Final reports and analysis files generated&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Mapping file indicating the relationship between the submitted files&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Mapping file indicating the relationship between the submitted files&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;  | style=&quot;background-color:white; padding-left:10px; padding-right:10px;&quot;|Mapping file indicating the relationship between the submitted files (completed Array Format template)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; |-&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|}&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|}&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36847:rev-36848 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36847&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36847&amp;oldid=prev"/>
		<updated>2019-11-11T21:05:44Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 21:05, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l34&quot;&gt;Line 34:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 34:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The H3Africa Consortium consists of multiple projects and sites distributed across Africa, most of which are generating genomic data linked to clinical data for specific diseases. The principal H3Africa funders (NIH and the Wellcome Trust) require any project data generated to be deposited into a data repository accessible by the scientific community.&amp;lt;ref name=&amp;quot;NIHSharing19&amp;quot;&amp;gt;{{cite web |url=https://grants.nih.gov/policy/sharing.htm |title=NIH Sharing Policies and Related Guidance on NIH-Funded Research Resources |work=Grants &amp;amp; Funding |publisher=National Institutes of Health |date=2019}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;WallcomeData17&amp;quot;&amp;gt;{{cite web |url=https://wellcome.ac.uk/funding/guidance/data-software-materials-management-and-sharing-policy |title=Data, software and materials management and sharing policy |work=Funding |publisher=The Wellcome Trust |date=10 July 2017}}&amp;lt;/ref&amp;gt; In order to facilitate the storage and accessibility of H3Africa genomics data, significant infrastructure, procedures, and policies were established. Part of H3ABioNet’s mandate is to develop processes and implement an infrastructure that will enable the ingestion, validation, annotation, secure storage, and submission of the African genomics data to the controlled access European Genome-phenome Archive (EGA).&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot;&amp;gt;{{cite journal |title=The European Genome-phenome Archive of human data consented for biomedical research |journal=Nature Genetics |author=Lappalainen, I.; Almeida-King, J.; Kumanduri, V. et al. |volume=47 |issue=7 |pages=692–5 |year=2015 |doi=10.1038/ng.3312 |pmid=26111507 |pmc=PMC5426533}}&amp;lt;/ref&amp;gt; This has been achieved through the development of the H3Africa Data Archive, which also ensures a copy of the genomic data is securely stored and retained on the African continent.&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;MulderDevelop17&amp;quot;&amp;gt;{{cite journal |title=Development of Bioinformatics Infrastructure for Genomics Research |journal=Global Heart |author=Mulder, N.J.; Adebiyi, E.; Adebiyi, M. et al. |volume=12 |issue=2 |pages=91–98 |year=2017 |doi=10.1016/j.gheart.2017.01.005 |pmid=28302555 |pmc=PMC5582980}}&amp;lt;/ref&amp;gt; This article describes the infrastructure that has been developed, which to our knowledge, is the first formalized human genomic data archive on the continent.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The H3Africa Consortium consists of multiple projects and sites distributed across Africa, most of which are generating genomic data linked to clinical data for specific diseases. The principal H3Africa funders (NIH and the Wellcome Trust) require any project data generated to be deposited into a data repository accessible by the scientific community.&amp;lt;ref name=&amp;quot;NIHSharing19&amp;quot;&amp;gt;{{cite web |url=https://grants.nih.gov/policy/sharing.htm |title=NIH Sharing Policies and Related Guidance on NIH-Funded Research Resources |work=Grants &amp;amp; Funding |publisher=National Institutes of Health |date=2019}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;WallcomeData17&amp;quot;&amp;gt;{{cite web |url=https://wellcome.ac.uk/funding/guidance/data-software-materials-management-and-sharing-policy |title=Data, software and materials management and sharing policy |work=Funding |publisher=The Wellcome Trust |date=10 July 2017}}&amp;lt;/ref&amp;gt; In order to facilitate the storage and accessibility of H3Africa genomics data, significant infrastructure, procedures, and policies were established. Part of H3ABioNet’s mandate is to develop processes and implement an infrastructure that will enable the ingestion, validation, annotation, secure storage, and submission of the African genomics data to the controlled access European Genome-phenome Archive (EGA).&amp;lt;ref name=&amp;quot;LappalainenTheEuro15&amp;quot;&amp;gt;{{cite journal |title=The European Genome-phenome Archive of human data consented for biomedical research |journal=Nature Genetics |author=Lappalainen, I.; Almeida-King, J.; Kumanduri, V. et al. |volume=47 |issue=7 |pages=692–5 |year=2015 |doi=10.1038/ng.3312 |pmid=26111507 |pmc=PMC5426533}}&amp;lt;/ref&amp;gt; This has been achieved through the development of the H3Africa Data Archive, which also ensures a copy of the genomic data is securely stored and retained on the African continent.&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;MulderDevelop17&amp;quot;&amp;gt;{{cite journal |title=Development of Bioinformatics Infrastructure for Genomics Research |journal=Global Heart |author=Mulder, N.J.; Adebiyi, E.; Adebiyi, M. et al. |volume=12 |issue=2 |pages=91–98 |year=2017 |doi=10.1016/j.gheart.2017.01.005 |pmid=28302555 |pmc=PMC5582980}}&amp;lt;/ref&amp;gt; This article describes the infrastructure that has been developed, which to our knowledge, is the first formalized human genomic data archive on the continent.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==Methods==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;In order to establish the H3Africa Archive, and to make the submission process seamless, a data storage infrastructure had to be created and new processes and policies developed and adhered to.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;===Data submission and access policy===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Genomic data associated with phenotype data enables the possibility of re-identification of study participants; hence all human genomic data and its accompanying phenotypic data need to be governed by a controlled access policy.&amp;lt;ref name=&quot;ShabaniControl15&quot;&gt;{{cite journal |title=Controlled Access under Review: Improving the Governance of Genomic Data Access |journal=PLoS Biology |author=Shabani, M.; Dyke, S.O.; Joly, Y. et al. |volume=13 |issue=12 |pages=e1002339 |year=2015 |doi=10.1371/journal.pbio.1002339 |pmid=26720729 |pmc=PMC4697814}}&amp;lt;/ref&gt;&amp;lt;ref name=&quot;DykeRegist18&quot;&gt;{{cite journal |title=Registered access: Authorizing data access |journal=European Journal of Human Genetics |author=Dyke, S.O.M.; Linden, M.; Lappalainen, I. et al. |volume=26 |issue=12 |pages=1721-1731 |year=2018 |doi=10.1038/s41431-018-0219-y |pmid=30069064 |pmc=PMC6244209}}&amp;lt;/ref&gt; H3Africa is distinguished as biospecimens are also being collected and stored at one of the three H3Africa biorepositories, so researchers can request access to genomic data and/or biospecimens. As a consortium, H3Africa has developed its own data submission and access policy, which takes into account the genomic and phenotype data generated, as well as policies for the access to and transfer of biospecimens.&amp;lt;ref name=&quot;deVriesTheH3Africa15&quot;&gt;{{cite journal |title=The H3Africa policy framework: negotiating fairness in genomics |journal=Trends in Genetics |author=de Vries, J.; Tindana, P;. Littler, K. et al. |volume=31 |issue=3 |pages=117-9 |year=2015 |doi=10.1016/j.tig.2014.11.004 |pmid=25601285 |pmc=PMC4471134}}&amp;lt;/ref&gt; A single H3Africa Data and Biospecimen Access committee has been established to oversee the secondary use of both the data, which is being deposited in the EGA, and biospecimens in the H3Africa biorepositories. The sharing and access policies and the H3Africa Data and Biospecimen Access Committee guidelines seek to provide a balance between protecting the rights of individuals and their data, while at the same time not acting as a barrier to advancing scientific knowledge. A data requester will need to identify the data in the EGA and apply for data access.&amp;lt;ref name=&quot;LappalainenTheEuro15&quot; /&gt; The data access request is routed to the Data and Biospecimen Access Committee (DBAC), who review it to determine whether the intended research use is inline with the H3Africa data and access policy, and ensure the requester is a bona fide researcher. Once the data request has been reviewed, the H3Africa DBAC will provide a decision to approve or reject it.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36846:rev-36847 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36846&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36846&amp;oldid=prev"/>
		<updated>2019-11-11T20:50:10Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:50, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l32&quot;&gt;Line 32:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 32:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted.&amp;lt;ref name=&amp;quot;GoldfederHuman17&amp;quot;&amp;gt;{{cite journal |title=Human Genome Sequencing at the Population Scale: A Primer on High-Throughput DNA Sequencing and Analysis |journal=American Journal of Epidemiology |author=Goldfeder, R.L.; Wall, D.P.; Khoury, M.J. et al. |volume=186 |issue=8 |pages=1000–1009 |year=2017 |doi=10.1093/aje/kww224 |pmid=29040395 |pmc=PMC6250075}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot;&amp;gt;{{cite journal |title=Genome sequencing in the clinic: the past, present, and future of genomic medicine |journal=Physiological Genomics |author=Prokop. J.W.; May, T.; Strong, K. et al. |volume=50 |issue=8 |pages=563–79 |year=2018 |doi=10.1152/physiolgenomics.00046.2018 |pmid=29727589 |pmc=PMC6139636}}&amp;lt;/ref&amp;gt; However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies.&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;PopejoyGenomics16&amp;quot;&amp;gt;{{cite journal |title=Genomics is failing on diversity |journal=Nature |author=Popejoy, A.B.; Fullerton, S.M. |volume=538 |issue=7624 |pages=161–64 |year=2016 |doi=10.1038/538161a |pmid=27734877 |pmc=PMC5089703}}&amp;lt;/ref&amp;gt; African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators.&amp;lt;ref name=&amp;quot;H3AfricaResearch14&amp;quot;&amp;gt;{{cite journal |title=Research capacity. Enabling the genomic revolution in Africa |journal=Science |author=H3Africa Consortium; Rotimi, C.; Abayomi, A. et al. |volume=344 |issue=6190 |pages=1346–8 |year=2014 |doi=10.1126/science.1251546 |pmid=24948725 |pmc=PMC4138491}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;MulderH3Africa18&amp;quot;&amp;gt;{{cite journal |title=H3Africa: Current perspectives |journal=Pharmacogenomics and Personalized Medicine |author=Mulder, N.; Abimiku, A.; Adebamowo, S.N. et al. |volume=11 |pages=59–86 |year=2018 |doi=10.2147/PGPM.S141546 |pmid=29692621 |pmc=PMC5903476}}&amp;lt;/ref&amp;gt; To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]).&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot;&amp;gt;{{cite journal |title=H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa |journal=Genome Research |author=Mulder, N.J.; Adebiyi, E.; Alami, R. et al. |volume=26 |issue=2 |pages=271–7 |year=2016 |doi=10.1101/gr.196295.115 |pmid=26627985 |pmc=PMC4728379}}&amp;lt;/ref&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted.&amp;lt;ref name=&amp;quot;GoldfederHuman17&amp;quot;&amp;gt;{{cite journal |title=Human Genome Sequencing at the Population Scale: A Primer on High-Throughput DNA Sequencing and Analysis |journal=American Journal of Epidemiology |author=Goldfeder, R.L.; Wall, D.P.; Khoury, M.J. et al. |volume=186 |issue=8 |pages=1000–1009 |year=2017 |doi=10.1093/aje/kww224 |pmid=29040395 |pmc=PMC6250075}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot;&amp;gt;{{cite journal |title=Genome sequencing in the clinic: the past, present, and future of genomic medicine |journal=Physiological Genomics |author=Prokop. J.W.; May, T.; Strong, K. et al. |volume=50 |issue=8 |pages=563–79 |year=2018 |doi=10.1152/physiolgenomics.00046.2018 |pmid=29727589 |pmc=PMC6139636}}&amp;lt;/ref&amp;gt; However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies.&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;PopejoyGenomics16&amp;quot;&amp;gt;{{cite journal |title=Genomics is failing on diversity |journal=Nature |author=Popejoy, A.B.; Fullerton, S.M. |volume=538 |issue=7624 |pages=161–64 |year=2016 |doi=10.1038/538161a |pmid=27734877 |pmc=PMC5089703}}&amp;lt;/ref&amp;gt; African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators.&amp;lt;ref name=&amp;quot;H3AfricaResearch14&amp;quot;&amp;gt;{{cite journal |title=Research capacity. Enabling the genomic revolution in Africa |journal=Science |author=H3Africa Consortium; Rotimi, C.; Abayomi, A. et al. |volume=344 |issue=6190 |pages=1346–8 |year=2014 |doi=10.1126/science.1251546 |pmid=24948725 |pmc=PMC4138491}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;MulderH3Africa18&amp;quot;&amp;gt;{{cite journal |title=H3Africa: Current perspectives |journal=Pharmacogenomics and Personalized Medicine |author=Mulder, N.; Abimiku, A.; Adebamowo, S.N. et al. |volume=11 |pages=59–86 |year=2018 |doi=10.2147/PGPM.S141546 |pmid=29692621 |pmc=PMC5903476}}&amp;lt;/ref&amp;gt; To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]).&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot;&amp;gt;{{cite journal |title=H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa |journal=Genome Research |author=Mulder, N.J.; Adebiyi, E.; Alami, R. et al. |volume=26 |issue=2 |pages=271–7 |year=2016 |doi=10.1101/gr.196295.115 |pmid=26627985 |pmc=PMC4728379}}&amp;lt;/ref&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;The H3Africa Consortium consists of multiple projects and sites distributed across Africa, most of which are generating genomic data linked to clinical data for specific diseases. The principal H3Africa funders (NIH and the Wellcome Trust) require any project data generated to be deposited into a data repository accessible by the scientific community.&amp;lt;ref name=&quot;NIHSharing19&quot;&gt;{{cite web |url=https://grants.nih.gov/policy/sharing.htm |title=NIH Sharing Policies and Related Guidance on NIH-Funded Research Resources |work=Grants &amp;amp; Funding |publisher=National Institutes of Health |date=2019}}&amp;lt;/ref&gt;&amp;lt;ref name=&quot;WallcomeData17&quot;&gt;{{cite web |url=https://wellcome.ac.uk/funding/guidance/data-software-materials-management-and-sharing-policy |title=Data, software and materials management and sharing policy |work=Funding |publisher=The Wellcome Trust |date=10 July 2017}}&amp;lt;/ref&gt; In order to facilitate the storage and accessibility of H3Africa genomics data, significant infrastructure, procedures, and policies were established. Part of H3ABioNet’s mandate is to develop processes and implement an infrastructure that will enable the ingestion, validation, annotation, secure storage, and submission of the African genomics data to the controlled access European Genome-phenome Archive (EGA).&amp;lt;ref name=&quot;LappalainenTheEuro15&quot;&gt;{{cite journal |title=The European Genome-phenome Archive of human data consented for biomedical research |journal=Nature Genetics |author=Lappalainen, I.; Almeida-King, J.; Kumanduri, V. et al. |volume=47 |issue=7 |pages=692–5 |year=2015 |doi=10.1038/ng.3312 |pmid=26111507 |pmc=PMC5426533}}&amp;lt;/ref&gt; This has been achieved through the development of the H3Africa Data Archive, which also ensures a copy of the genomic data is securely stored and retained on the African continent.&amp;lt;ref name=&quot;MulderH3ABioNet16&quot; /&gt;&amp;lt;ref name=&quot;MulderDevelop17&quot;&gt;{{cite journal |title=Development of Bioinformatics Infrastructure for Genomics Research |journal=Global Heart |author=Mulder, N.J.; Adebiyi, E.; Adebiyi, M. et al. |volume=12 |issue=2 |pages=91–98 |year=2017 |doi=10.1016/j.gheart.2017.01.005 |pmid=28302555 |pmc=PMC5582980}}&amp;lt;/ref&gt; This article describes the infrastructure that has been developed, which to our knowledge, is the first formalized human genomic data archive on the continent.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36845:rev-36846 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36845&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36845&amp;oldid=prev"/>
		<updated>2019-11-11T20:33:22Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:33, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l31&quot;&gt;Line 31:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 31:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Introduction==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted. &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;Goldfeder et al. 2017; &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Prokop &lt;/del&gt;et al. 2018&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;) &lt;/del&gt;However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies. &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;Popejoy &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;and &lt;/del&gt;Fullerton 2016&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;; Prokop et al&lt;/del&gt;. &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;2018) &lt;/del&gt;African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators. &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;H3Africa Consortium et al. 2014; &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Mulder &lt;/del&gt;et al. 2018&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;) &lt;/del&gt;To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]). &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;(&lt;/del&gt;Mulder et al. 2016&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;)&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted.&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;GoldfederHuman17&amp;quot;&amp;gt;{{cite journal |title=Human Genome Sequencing at the Population Scale: A Primer on High-Throughput DNA Sequencing and Analysis |journal=American Journal of Epidemiology |author=&lt;/ins&gt;Goldfeder&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, R.L.; Wall, D.P.; Khoury, M.J. &lt;/ins&gt;et al. &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|volume=186 |issue=8 |pages=1000–1009 |year=&lt;/ins&gt;2017 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10.1093/aje/kww224 |pmid=29040395 |pmc=PMC6250075}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot;&amp;gt;{{cite journal |title=Genome sequencing in the clinic: the past, present, and future of genomic medicine |journal=Physiological Genomics |author=Prokop. J.W.; May, T.&lt;/ins&gt;; &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Strong, K. &lt;/ins&gt;et al&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;. |volume=50 |issue=8 |pages=563–79 |year=2018 |doi=10.1152/physiolgenomics.00046&lt;/ins&gt;.2018 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|pmid=29727589 |pmc=PMC6139636}}&amp;lt;/ref&amp;gt; &lt;/ins&gt;However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies.&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;ProkopGenome18&amp;quot; /&amp;gt;&amp;lt;ref name=&amp;quot;PopejoyGenomics16&amp;quot;&amp;gt;{{cite journal |title=Genomics is failing on diversity |journal=Nature |author=&lt;/ins&gt;Popejoy&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, A.B.; &lt;/ins&gt;Fullerton&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, S.M. |volume=538 |issue=7624 |pages=161–64 |year=&lt;/ins&gt;2016 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10&lt;/ins&gt;.&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;1038/538161a |pmid=27734877 |pmc=PMC5089703}}&amp;lt;/ref&amp;gt; &lt;/ins&gt;African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators.&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;H3AfricaResearch14&amp;quot;&amp;gt;{{cite journal |title=Research capacity. Enabling the genomic revolution in Africa |journal=Science |author=&lt;/ins&gt;H3Africa Consortium&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;; Rotimi, C.; Abayomi, A. &lt;/ins&gt;et al. &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|volume=344 |issue=6190 |pages=1346–8 |year=&lt;/ins&gt;2014 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10.1126/science.1251546 |pmid=24948725 |pmc=PMC4138491}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;MulderH3Africa18&amp;quot;&amp;gt;{{cite journal |title=H3Africa: Current perspectives |journal=Pharmacogenomics and Personalized Medicine |author=Mulder, N.; Abimiku, A.&lt;/ins&gt;; &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Adebamowo, S.N. &lt;/ins&gt;et al. &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|volume=11 |pages=59–86 |year=&lt;/ins&gt;2018 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10.2147/PGPM.S141546 |pmid=29692621 |pmc=PMC5903476}}&amp;lt;/ref&amp;gt; &lt;/ins&gt;To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]).&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;ref name=&amp;quot;MulderH3ABioNet16&amp;quot;&amp;gt;{{cite journal |title=H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa |journal=Genome Research |author=&lt;/ins&gt;Mulder&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, N.J.; Adebiyi, E.; Alami, R. &lt;/ins&gt;et al. &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|volume=26 |issue=2 |pages=271–7 |year=&lt;/ins&gt;2016 &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;|doi=10.1101/gr.196295.115 |pmid=26627985 |pmc=PMC4728379}}&amp;lt;/ref&amp;gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36844:rev-36845 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36844&amp;oldid=prev</id>
		<title>Shawndouglas: Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36844&amp;oldid=prev"/>
		<updated>2019-11-11T19:41:34Z</updated>

		<summary type="html">&lt;p&gt;Saving and adding more.&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 19:41, 11 November 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l30&quot;&gt;Line 30:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 30:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;'''Keywords''': genomic data, data archive, H3Africa data, African genomic data&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;'''Keywords''': genomic data, data archive, H3Africa data, African genomic data&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==Introduction==&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Advances in high-throughput [[Genomics|genomic]] technologies are laying the foundations for the goal of precision medicine to be realized.&amp;lt;ref name=&amp;quot;ChristensenAssessing15&amp;quot;&amp;gt;{{cite journal |title=Assessing the Costs and Cost-Effectiveness of Genomic Sequencing |journal=Journal of Personalized Medicine |author=Christensen, K.D.; Dukhovny, D.; Siebert, U. et al. |volume=5 |issue=4 |pages=470–86 |year=2015 |doi=10.3390/jpm5040470 |pmid=26690481 |pmc=PMC4695866}}&amp;lt;/ref&amp;gt;&amp;lt;ref name=&amp;quot;AronsonBuilding15&amp;quot;&amp;gt;{{cite journal |title=Building the foundation for genomics in precision medicine |journal=Nature |author=Aronson, S.J.; Rehm, H.L. |volume=526 |issue=7573 |pages=336–42 |year=2015 |doi=10.1038/nature15816 |pmid=26469044 |pmc=PMC5669797}}&amp;lt;/ref&amp;gt; Decreasing costs and the capacity to generate larger volumes of human genomic data at faster rates are enabling population-level genomics studies to be conducted. (Goldfeder et al. 2017; Prokop et al. 2018) However, most of the current population-level genomics studies and data generated to date have a significant population representational bias, with the majority of genome sequences being derived from European and North American ancestry, regions that have been early adopters of genomic technologies. (Popejoy and Fullerton 2016; Prokop et al. 2018) African researchers, in general, have been late adopters of high-throughput technologies for use in population genomics due to more limited resources and funding. To address this critical gap in scientific knowledge about African genomics and population variation, and inspired by the African Society for Human Genetics, the [[National Institutes of Health]] (NIH) and The Wellcome Trust, through the Human Hereditary and Health in Africa (H3Africa) program, have funded multiple genomics projects led by African investigators. (H3Africa Consortium et al. 2014; Mulder et al. 2018) To support the H3Africa projects in terms of provisioning of infrastructure for secure data storage, management, and computing, the NIH has also funded a Pan-African [[Bioinformatics]] Network for H3Africa ([http://www.h3abionet.org H3ABioNet]). (Mulder et al. 2016)&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==References==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key limswiki:diff::1.12:old-36843:rev-36844 --&gt;
&lt;/table&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
	<entry>
		<id>https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36843&amp;oldid=prev</id>
		<title>Shawndouglas: Created stub. Saving and adding more.</title>
		<link rel="alternate" type="text/html" href="https://www.limswiki.org/index.php?title=Journal:Building_infrastructure_for_African_human_genomic_data_management&amp;diff=36843&amp;oldid=prev"/>
		<updated>2019-11-11T19:16:53Z</updated>

		<summary type="html">&lt;p&gt;Created stub. Saving and adding more.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{Infobox journal article&lt;br /&gt;
|name         = &lt;br /&gt;
|image        = &lt;br /&gt;
|alt          = &amp;lt;!-- Alternative text for images --&amp;gt;&lt;br /&gt;
|caption      = &lt;br /&gt;
|title_full   = Building infrastructure for African human genomic data management&lt;br /&gt;
|journal      = ''Data Science Journal''&lt;br /&gt;
|authors      = Parker, Ziyaad; Maslamoney, Suresh; Meintjes, Ayton; Botha, Gerrit; Panji, Sumir; Hazelhurst, Scott; Mulder, Nicola&lt;br /&gt;
|affiliations = University of Cape Town, University of the Witwatersrand&lt;br /&gt;
|contact      = Email: ziyaad dot parker at uct dot ac dot za&lt;br /&gt;
|editors      = &lt;br /&gt;
|pub_year     = 2019&lt;br /&gt;
|vol_iss      = '''18'''(1)&lt;br /&gt;
|pages        = 47&lt;br /&gt;
|doi          = [http://dx.doi.org/10.5334/dsj-2019-047 10.5334/dsj-2019-047]&lt;br /&gt;
|issn         = 1683-1470&lt;br /&gt;
|license      = [http://creativecommons.org/licenses/by/4.0/ Creative Commons Attribution 4.0 International]&lt;br /&gt;
|website      = [https://datascience.codata.org/articles/10.5334/dsj-2019-047/ https://datascience.codata.org/articles/10.5334/dsj-2019-047/]&lt;br /&gt;
|download     = [https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/ https://datascience.codata.org/articles/10.5334/dsj-2019-047/galley/894/download/] (PDF)&lt;br /&gt;
}}&lt;br /&gt;
{{ombox	 &lt;br /&gt;
| type      = notice	 &lt;br /&gt;
| image     = [[Image:Emblem-important-yellow.svg|40px]]	 &lt;br /&gt;
| style     = width: 500px;	 &lt;br /&gt;
| text      = This article should be considered a work in progress and incomplete. Consider this article incomplete until this notice is removed.	 &lt;br /&gt;
}}&lt;br /&gt;
==Abstract==&lt;br /&gt;
Human [[Genomics|genomic]] data are large and complex, and require adequate infrastructure for secure storage and transfer. The [[National Institutes of Health]] (NIH) and The Wellcome Trust have funded multiple projects on genomic research, including the Human Heredity and Health in Africa (H3Africa) initiative, and data are required to be deposited into the public domain. The European Genome-phenome Archive (EGA) is a repository for [[Sequencing|sequence]] and genotype data where data access is controlled by access committees. Access is determined by a formal application procedure for the purpose of secure storage and distribution, which must be in line with the informed consent of the study participants. H3Africa researchers based in Africa and generating their own data can benefit tremendously from the data sharing capabilities of the internet by using the appropriate technologies. The H3Africa Data Archive is an effort between the H3Africa data generating projects, H3ABioNet, and the EGA to store and submit genomic data to public repositories. H3ABioNet maintains the security of the H3Africa Data Archive, ensures ethical security compliance, supports users with data submission, and facilitates data transfers. The goal is to ensure efficient data flow between researchers, the archive, and the EGA or other public repositories. To comply with the H3Africa data sharing and release policy, nine months after the data is in secure storage, H3ABioNet converts the data into an Extensible Markup Language (XML) format ready for submission to EGA. This article describes the infrastructure that has been developed for African human genomic data management.&lt;br /&gt;
&lt;br /&gt;
'''Keywords''': genomic data, data archive, H3Africa data, African genomic data&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
{{Reflist|colwidth=30em}}&lt;br /&gt;
&lt;br /&gt;
==Notes==&lt;br /&gt;
This presentation is faithful to the original, with only a few minor changes to presentation. In some cases important information was missing from the references, and that information was added. The original paper listed references alphabetically; this wiki lists them by order of appearance, by design. The two footnotes were turned into inline references for convenience.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!--Place all category tags here--&amp;gt;&lt;br /&gt;
[[Category:LIMSwiki journal articles (added in 2019)‎]]&lt;br /&gt;
[[Category:LIMSwiki journal articles (all)‎]]&lt;br /&gt;
[[Category:LIMSwiki journal articles on data management and sharing]]&lt;br /&gt;
[[Category:LIMSwiki journal articles on genome informatics]]&lt;br /&gt;
[[Category:LIMSwiki journal articles on health informatics]]&lt;/div&gt;</summary>
		<author><name>Shawndouglas</name></author>
	</entry>
</feed>