Genetic Association with JMP Genomics, Part 4: Basic Linkage Mapping
Our World Statistics Day conversations have been a great reminder of how much statistics can inform our lives. Do you have an example of how statistics has made a difference in your life? Share your story with the Community!
Genetic Association with JMP Genomics, Part 4: Basic Linkage Mapping
Jun 5, 2019 12:38 PM
| Last Modified: Jun 18, 2019 1:01 PM
The Basic Linkage Mapping Workflow in JMP Genomics creates linkage maps using genetic marker data. The workflow includes output from the Recombination and Linkage Groups, Linkage Map Order, and Linkage Map Viewer. This includes calculation of recombination frequencies, testing for segregation distortion, assigning linkage groups to sets of markers, generating visualized linkage maps, and reordering the data according to the linkage map for QTL analysis. This example will use oat data with around 450 genetic markers from 90 recombinant inbred oat lines.
From the Genomics Starter window, select Workflows > Basic > Basic Linkage Mapping Workflow to open the dialogue box.
In the General tab, select po_ingenotypes.sas7bdat as the Input Genotype SAS Data Set.
Type “GMI:” into the List-Style Specification of Marker Variables field to specify every variable beginning with “GMI” as a marker variable.
Select an Output Folder.
In the Options tab, select Numeric Genotypes as the Format of Marker Variables.
The Annotation tab does not require input in this example and is always optional. For data with separate annotation data sets, they can be entered here.
The Framework Map tab is also optional. This tab is used if there are pre-defined linkage groups for a set of markers. In this case, enter oat_framework.sas7bdat as the Framework Map Data Set.
Open the data set and notice that 172 of the markers are already sorted into 22 known framework groups. Each marker is ordered within its corresponding framework group.
Select Marker as the Framework Marker Name Variable, Framework_Groups as the Framework Linkage Group Variable, and Framework_Order as the Framework Order Variable.
In the Linkage Groups tab, Select IRILself as the Cross Type.
This experimental population is an intercrossed recombinant inbred line. A list of cross types and their descriptions is shown in the table below.
For the number of Random Mating Generations Prior to Inbreeding, enter 2. This is the number of generations of random mating that occurred between the initial cross and the start of inbreeding.
If the experimental cross has unique expected segregation ratios, they can be specified in the Expected Segregation Ratios The expected ratios for an RIL self cross like the one in this population is 0.5 : 0 : 0.5 -- which is the default if no other values are entered.
Select FDR as the Multiple Testing Method for Segregation Tests. This minimizes the False Discovery Rate.
Set the Grouping Recombination Fraction Threshold to 25. This will be the maximum recombination fraction for assigning a marker to a linkage group. The lower this number, the greater amount of linkage that needs to occur for markers to be placed within the same group.
On the Linkage Map Order tab, select Haldane as the Map Function. These are different methods for converting recombination fractions into distance in centimorgans (cM).
Check the box to Break linkage groups between markers with large ordered distances. Break the groups up by Genetic Distance and set that Genetic Distance Break Value to 40. This means that if the distance between two markers in a linkage group exceeds 40, that group will be broken into more than one linkage group.
Deselect the box next to Display marker genotype cell color plots. This creates cell plots with each genotype represented for every marker. It is not necessary here and increases computing time.
Check the box next to Run Subset and Reorder Genetic Data process to order marker data for QTL analysis to reorder the data according to its linkage groups.
The resulting data set will be handy later in performing QTL Analysis.
The Linkage Map Viewer tab allows for changes in the color scheme and output layout for the 3D chromosome linkage maps. Leave the default options here.
Select Run to begin the analysis. When the analysis is finished running, the following JMP Journal will appear.
Click the first results button, Recombination and Linkage Groups.
The first output to appear is the heatmap of the Linkage Group Results. Red blocks in the heatmap represent very low recombination frequencies and thus highly linked markers. Each linkage group (27) is listed to the right of the window under LGroup with the number of markers in each group shown in parentheses. Click on any of the linkage groups in this list to display a heatmap for only that group.
The dotplot to the left of the heatmap represents genotype frequencies. When the dots are closer together a homozygous major genotype is more frequent, and when the dots are far apart a homozygous minor genotype is more frequent.
The Segregation and Linkage Groups tab shows a plot of the segregation ratios for each of the linkage groups (created using recombination frequencies) and framework groups (specified in the framework map data set). The linkage groups are represented on the plot with dots, and the framework groups with stars. Note that many of these groups coincide with one another. The dotted red line represents statistical significance for the points found above it. Click any of the framework or linkage groups to highlight them in the plot.
Return to the JMP Journal and select Linkage Map Order. The resulting output is a plot of the genetic distances in cM of the markers in each linkage group. The markers are colored according to their original framework group, and the x-axis has each linkage group. There are now 37 linkage groups as opposed to the original framework map. This is due to the splitting of groups with markers greater than 40 cM apart.
In the Drill Downs menu on the left of the output window, each of the individual linkage groups can be selected for further inspection. Select Linkage Group 17. New windows will open showing a heatmap of each marker in that linkage group and its recombination frequency as well as a dataset with the recombination frequencies between each marker in the group. Each marker’s place in the linkage group can be seen on the horizontal linkage map above the heatmap.
Next, return to the JMP Journal and select the Linkage Map Viewer. The 3D Linkage Map output will appear in a new window. This output gives a visualization of each linkage group, with positions annotated on the left and marker names on the right. The sliders at the top of the window adjust the aesthetics of the maps. Below is the map of linkage group 2.
The last output from the JMP Journal is the Subset and Reorder Genetic Data, which are both a genotype (po_ingenotypes.sas7bdat) and an annotation (po_ingenotypes_lmo_sr.sas7bdat) data set that can be used for QTL analysis. These data sets will be saved to the Output Folder designated in the General tab. Below is the annotation set with 453 markers and linkage data for each.
*The interactive reports generated from this workflow can be viewed on JMP Public.
The Basic Linkage Mapping Workflow performs several useful analyses using recombination frequencies to place markers into linkage groups, calculate their segregation ratios, and display linkage maps. In this example, we began with marker data and a framework map which assigned certain markers to predetermined framework groups. Note that the framework map, as well as annotation data, are not required to run the linkage mapping workflow. Now that each marker has been placed into a linkage group and the distances between each marker has been estimated, QTL analysis can be performed. For information on that process see the post titled Genetic Association with JMP Genomics, Part 5: QTL Single Marker Analysis.