Visual Phasing with a Twist - Segment Phasing (Part 2): Advanced Genetic Genealogy

Video Transcription

(00:00):
Segment phasing is another way you can use visual phasing techniques in order to find out information about your grandparents.

(00:11):
Howdy, I’m Andy Lee with Family History Fanatics where we help you understand your DNA, climb your family tree and write your ancestor stories along the way. In part one of this two part series, we went over the first five steps of segment phasing. There’s three more steps and we’re going to start on them right now. After we have colored our chromosome map, it is now time to assign the recombination points for those recombination points that we have identified. Now there’s a couple of ways that you can assign recombination points. You can visually look to see how recombination points are assigned. So for instance, if I’m looking at this first recombination point here, I can see that there’s a change from a B, and I can see that there’s a change from B, C, and so B is common to both of them. I would assign this to B.

(00:59):
Another way with this table that you can do is look at this table. So for instance, at recombination 0.4 0.4, it’s highlighted too as the likely recombination point of B because B shows up in both of those. So I’m going to assign B here. I’m gonna assign B here. I’m going to assign C here, and then I’m gonna assign B right there, C there B right there. And now I have all of my recombination points assigned. So with this, we’re now going to start assigning grandparent segments. Now I call ’em grandparent segments because these are the segments that came from each of the grandparents, even though right now we don’t know who those grandparents are from. Now this is just using your visual phasing techniques. So identify all the grandparents segments in the section below. I usually wanna start with the person that has the least number of recombination points.

(01:59):
In that case, this is person A. And so I’m just gonna assign A one on the top, B one on the bottom. And I’m gonna go through and I’m going to copy that over the entire chromosome right here because there are no recombination points on A, we’ve already got the grandparent segments on A, they each came from a single grandparent. So there’s two grandparents missing on this chromosome. Let’s go down to B and C and solve for those on B. We know that A and B match right here, so I can just paste that right there. And we know that A and B match right there. Now there happens to be recombination points that are blocking both of those in. So if we go down to C, I know that A and C match right in this area right here, and there’s a hard recombination point right there.

(02:51):
So then B and C, well, B and C, they match right there. That’s good. And B and C, they match right here, which we don’t know which either one of them is right now there are no matches. So we’re ready to make our selection now to make our selection. It doesn’t matter where, but we wanna try to solve the most possible. In this case, I’m going to switch this one right here to A two and this one is going to still be B one. And I’ll just copy that and extend that over because there is no B recombination point right here. So down below. Now I can make this one the same because it matches and that means that this is going to be the same as well. So if I look back up at the top, I’ve solved as much as I can with just this graphic.

(03:44):
There’s one little section right here that I can’t solve because of that, but I’ve been able to solve everything else. So let me go through this process again with another chromosome. I’m going to duplicate my chart. I’m gonna rename this and I’m going to call this chart number 16. So that means here’s number 16 on the chromosome and it is pulled in all of that information. Now you notice this one is not quite as perfect as the other one because every line doesn’t have a blue highlighted. There’s a lot of ’em that do have a blue highlighted, but not all of them have a blue highlight. So I need to go through and I need to identify where these recombination points are. Now, if I look here real carefully at this 1.4 and 1.5, if I combine those together, then they’re really the same one.

(04:32):
So I’m gonna actually put a C on the number five and an R on 1.5. That C is telling me that I’m combining it with this 1.4. It is all the same 0.3. That’s a good recombination point it looks like. Then we jump up to 24.4 and 25.1. Even though they’re a little bit apart, there is two common. And so if we combine them together, then that would look like a recombination point. Okay? So 27.8 is odd because hey, there’s just one in one. 29.6 is close to it, but there’s just one in one there. So let’s go back over and look and we see that, yeah, 27.8 and 29.6 are from the same segment. Now this was back when I could download all the way down to 1 centimorgans and you’ll see that this segment is just 1.40 centimorgans long. This is not a real segment match.

(05:28):
And so I’m actually going to X both of those out because there’d have to be some other match with A here in order for one of these to be a recombination point. And there’s nothing around there that is like that. I go down to 50.2 and 50.5, and there’s another good recombination point 54.3. Looks like a good recombination point 55.2, 57.1 58.28 and we’ll get down to 71. Okay, and then 72 or 71.2 looks a little odd. We got a two and a two, well the closest one is this 71 or this 72.9. So let’s take a look at those over here. This 71.2, okay, that goes to an 84. So that’s not just a little tiny, that looks like a real recombination. And whereas our other 71.2, it’s right here. That’s also a real one. So what about this 71 and the 72.9? Well, here’s my 71 and 72.9.

(06:34):
They’re actually both the exact same thing. And when I start to look at this a little bit more, hey, let me look at this. 50.2 to 71.2 is an ab, and 71.2 to 84.0 is also a. Now it’s possible that there was a break, but it is extremely unlikely. So initially I’m going to assume that this was just an error in the algorithm and that this 71.2 and 71.2, it really just merged together. So I can ignore this 71.2. So then what about this 71 and 72.9? They are both really small segments and as I can see, they both fall right together. I’m going to actually eliminate both of them right now, so they’re not going to be real segments for me.

(07:35):
Now, as I’m eliminating some of these, I want to X them out over here as well. And you can see it graze them out whenever I x them out. So here’s 27.8, 29.6, and that’s so that I don’t start mapping them over in the next portion. So 84, that looks like a good recombination. 86 looks like a good, and that looks like the end. And we have the beginning. So we have a total of 10 recombination points right here. Now I want to put in 10, I got four, so I need to put in six more there. I’ve just put in six more. And I’m just going to copy the formulas that are up there. I’m gonna come down here and I’m gonna copy the formulas that are down here. Now I can go and I can start mapping this out. So we start AB from or, so we have our recombination points of 1.4 and then we have 3.0. We have 24.4, we have 50.2, we have 54.3, we have 55.2, 57.1 58, 84, 86, 86 0.1, and 90.2. So I’m gonna go and I’m gonna resize these columns so that they are better representative of what the length is. And now I can go through and figure out how much centimorgans is in each one of these. So this to 1.5, this is 5.10 cent to Morgan’s. And you can see already, hey, there’s 5.1 which is less than seven. So if I didn’t select that seven, this is one segment that would not have shown up. All right, from 1.4 to 3.0, that is 3.4. There’s another segment that wouldn’t have shown up as well.

(09:38):
Let’s see here, 24.4, we got a 24.4 right there. And 24.4 is merge 25. That’s gonna be 42.9, 24.4 to 50. Well, we’ve got this 62.8 and that’s going to be 11.4. So 50 to 54 we don’t have, but 54 to 55 we do is 2.8 and 57 to 58 is 1.5. And you can see these are two really small segments that you can’t actually find now on Gedmatch with a 3 cent Morgan limit. But I’ll be showing you in a second why you still may want to get down to three centimorgans as well. Okay, between, we already said that 72 was combined between those two. So between 50 and 84 is 26.4 plus 27.7, that’s going to be 53, 54 0.10 cent of organs. And then we have 86.1

(10:58):
290 is 8.3, and that means that between 84 and 90 is 17.5 minus 8.3 is gonna be 9.2. So there we have our mega bases and we have our centimorgans all calculated out. Now we can make our chromosome map A B is yellow there, a C is yellow throughout this section. BC is or AC is green right here. AC is green right there. BC is yellow right there. A B is yellow all the way up to 90. That’s the rest of the length here. A B is green between 50 and 71.2

(12:04):
AB That’s all the way to 84 because we have to include that other one down here where we combine them. A AC is between 54 and 55 is gonna be half matched. And BC between 54 and 55 is half matched AC between 57 and 58 is half matched and BC is also half matched. And then we get down to the last ones. BC is 84.0 to 90.2 bc, 84.0 to 90.2, and a B is going to be 86 to 90 is full. And then our last one is AC is, oh, this one was BC here. I just messed up already. I can see that AC at the bottom is 86,290. This one right here. So there we now have our chromosome map. Now we can assign the recombination points. This first one is going to go to A, this second one is going to C, the third one is going to B. The fourth here is going to A 54 is going to C, C, C, C. And then we have the B and a C, B and a C, and that is our recombination points. We can see that B has two recombination points and A has two recombination points. So when we’re doing our next step of assigning the grandparent segments, we can start with either B or A. I’m going to start with B in this case. So I’ll do A one here and I’ll do B one here

(14:02):
And then I will copy that over to the next recombination point. And that is my starting location. So a B is the same in this section, but then we have BC is opposite

(14:26):
In this section. So that one, that one and that one. Then we have, oh, BC’s also opposite right there. Okay, now we can length out some of these. So for instance, this AB here goes to the end cuz there is no A to stop it. Whereas down here we can actually copy this to here cuz that’s where the C stops. We have these C stops here, but we can also copy this to right there. Now because we have this section right here, you’ll notice it doesn’t match up with any of these. We don’t have the a B there, so we can’t do that. So we’re stuck. Now I’m going to, oh, oh, you got one little other section I just about missed there, right on the end, A has that and B has to match that. So what we should be seeing right now is as much of the green and the red as possible and the places where we don’t have a double green and a double red, it’s because we just haven’t been able to extend it all the way there. Now to extend this, I’m going to extend the B because that’s gonna extend this all the way to the end of the B. So I’m gonna go A two and B one and I’m going to extend that.

(15:56):
And now let’s see what else I can solve. All right, A B is opposite right here. So this is A one and B two and it’s gonna go from there to there, but it’s also gonna go to right there. And that’s where an A recombination point is. So we can see that now a C is green right there. So we can copy that same thing down there and since there’s no C there, we can move that all the way across. Now we can already see that, hey, it looks like there is a little bit of a problem because this BC says that there should be a green or a yellow right here and it’s showing a red right here. We’ll come to that in a little bit and on the other end we don’t have anything that we can solve more. So this is the extent of what we have.

(16:48):
Now, one of the things you might notice here is that we have these little tiny portions of stuff right here that it looks like we’ve solved for A, and we’ve solved for B in this area, but it looks like C changed and C changed again. Now what could have happened here? Well, if we had this change here, well that one would have to stay the same. And then you’d have to have something similar happen here. And let’s just do a b2, a B one, sorry, and an A two. You’d basically have to have two recombination points on the same chromosome happening very close together in this case, 2.80 centimorgans and 1.5 centimorgans. Now, is that possible? Well, this is where we go to the next step. Now the next step is if you have impossible segments, in other words segments that could not exist, then you need to delete them.

(17:48):
Now because we’re do dealing with many segments that are below 7 centimorgans, this is where most of those impossible segments are gonna show up. So we want to remove all of the impossible segments. So let’s look at our impossible segments. Here are these really impossible segments? Could we have two recombinations that close together? If I go back to the setup page, there is this chromosome information and this is where it’s important because what this shows is it shows our starter end location, how long that chromosome is, but then it has this minimum internal segment length. And what I’ve done here is I’ve solicited information from lots of people, several thousand matches, and I’ve looked at all of the interior segments between grandparents and grandchildren to see for each chromosome what is the minimum length that that interior segment could be. Now if we’re looking at this here, we can see that yes, these are interior segments.

(18:54):
These little segments right here are interior segments. So what’s the minimum length? Because right now one of these is 2.80 centimorgans and one of ’em is 1.50 centimorgans on this chart. If I go down to the 16, I can see that for paternal chromosomes the minimum segment length is 38 centimorgans. Well, this is way smaller than that, so maybe it’s a maternal chromosome. Well, the minimum segment length for maternal chromosomes is eight centimorgans. So that means this is much smaller. Now, we can also look at mega bases as well. It’s 12 and six, and if we go back here, we can see that this is just a single megabase if that much between both of them. So these two little segments are false segments. Now, one of the clues that we can use for that also is the fact that we had so many C recombinations all at one point 54, 55, 57, 58, and if we look, they’re all basically the same thing. They’re actually AC and BC of both of them. So we can delete all of these segments in order to do that, just go highlight the columns for those and delete them, and those segments are gone. We need to redo our formulas. Let’s get this resized and change our formulas down here.

(20:33):
And there we go. That is how we can eliminate those impossible segments. Now, really at this point, everything else is just visual phasings. There is a link to this spreadsheet down below and you can duplicate that into your own Google Drive and then you can make adjustments to it and copy it as much as you want. You can figure out ways to improve this or make it better. If you want to learn more about visual phasing, there’s a couple of videos right here and make sure you subscribe to the channel. Leave a comment below about your successes with visual phasing.