Site icon BIOINFORMATICAMENTE

The Illumina sequencing

The Illumina sequencing technique is one of the second generation sequencing techniques which are different but all have an amplification phase of the library fragments prior to their sequencing.

The Illumina sequencing technique, like the other second generation techniques, is based on three main steps:

  1. Construction of a library for next generation techniques (NGS) which involves the addition of specific adapters to the DNA or cDNA fragments to be sequenced. In this regard, it should be noted that the adapters used are different according to the technique used.
  2. Amplification of library fragments. This phase takes place differently depending on the second generation technique used.
  3. Sequencing of the fragments through cycles of biochemical reactions. During the reaction cycles, information is acquired which allows, by specific software, to reconstruct a DNA or cDNA sequence. This phase also varies according to the technique used.

As I have already mentioned in the article "The sequencing", there are several second generation techniques but surely, from what I have noticed from my brief experience, the most used second generation sequencing technique is Illumina, therefore in this article I will try to explain how it works in the simplest way possible but first I would like to introduce you, using the table below, the advantages and disadvantages of the second generation sequencing techniques.

ADVANTAGESDISADVANTAGES
- There is no need to build a library in cloning vectors therefore the transformation phase is not necessary.
- A large number of library fragments (> 96 fragments) can be sequenced in very small spaces.
- You work with very small volumes.
- The costs are low, but it is not advisable to use a 2nd generation NGS technique if we have to sequence a few fragments.
- The length of the sequences obtained following sequencing is reduced, usually the reads obtained by the 2nd generation NGS techniques are maximum 600-700 bp long.
- The accuracy of the obtained reads is 10 times lower than the Sanger sequencing.

Now, as promised, let's talk about the Illumina sequencing technique. This, in fact, is nowadays widely used and allows to obtain long reads 250-500 bp.

The Illumina technique involves three main steps:

We now describe the different phases in detail.

CONSTRUCTION OF THE LIBRARY

There are two steps to build the Illumina sequencing library:

  1. Fragmentation of the extracted genomic DNA or of the cDNA obtained from the transcriptome extracted from the sample organism by sonication or enzymatic restriction to obtain fragments with a maximum size of 1000 bp. It is important to consider that the fragments obtained must not exceed 1000 bp in length otherwise there will be problems during sequencing.
  2. Then bind to the ends of the double-stranded fragments the Illumina adapters. Adapters are of two types and are referred to as adapters A e adapters B. The fragments that will then be used in the later stages of Illumina sequencing will be those equipped with an A adapter at one end and a B adapter at the other end. These correctly formed fragments are isolated in several ways, but one of the most popular methods involves the use of paramagnetic beads, that is marbles that become magnetic when placed inside a magnetic field, and on which molecules of streptavidin are located. In particular, the selection of the fragments is allowed by the binding between streptavidin and biotin, which is linked to one of the two types of adapters, for example adapter B.
Figure 1. DNA fragmentation and isolation of fragments for sequencing using streptavidin-equipped paramagnetic beads.

Let me open a parenthesis on Illumina adapters, in fact, as I mentioned above, there are two categories of these:

Figure 2. ILLUMINA single index adapters.
  1. universal region that pairs to primers for amplification (this region is different between adapters A, which bind to one primer, and adapters B, which bind to another primer).
  2. universal region to which the primer binds for sequencing.
Figure 4. Using barcodes to use multiple sequencing (multiplexing).

AMPLIFICATION OF SELECTED LIBRARY FRAGMENTS BY PCR BRIDGE (bPCR)

Once the fragments of the library equipped with adapter A and B have been selected, we proceed with the amplification of these using a method called bridge PCR, which consists in amplifying fragments on a solid support, called flow cell, on which the amplification primers are arranged which are able to recognize and bind adapters A and B, for example the forward primer binds to adapter A and the reverse to adapter B.

In particular, the PCR bridge takes place in the following way:

Figure 6. Summary diagram of the bPCR. Source: https://en.wikipedia.org/wiki/Illumina_dye_sequencing
Video 1. Bridge PCR sulla flow cell ILLUMINA. Fonte: canale youtube Illumina (https://www.youtube.com/watch?v=pfZp5Vgsbw0)Video 1. PCR bridge on ILLUMINA flow cell. Source: Illumina youtube channel (https://www.youtube.com/watch?v=pfZp5Vgsbw0)Video 1. Bridge PCR sulla flow cell ILLUMINA. Fonte: canale youtube Illumina (https://www.youtube.com/watch?v=pfZp5Vgsbw0)

SEQUENCING OF THE AMPLIFICATION PRODUCT USING THE SEQUENCING-BY-SYNTHESIS METHOD

The sequencing method used is defined sequencing-by-synthesis since it is based on the activity of the DNA polymerase which, starting from a sequencing primer, adds one nucleotide at a time in order to have a complementary read to a specific fragment of the flow cell. Each nucleotide is marked with different fluorescent molecules that emit light of a different color which is then detected by a special camera. Furthermore, these nucleotides have a block on the free hydroxyl group (-OH) so that only one nucleotide can be added per sequencing cycle.

In particular, there are two different sequencing-by-synthesis approaches:

  1. Sequencing by synthesis single read, that is a sequencing that foresees the reading of only one end of the fragments of the library. This sequencing approach usually allows to obtain only the first 250 nitrogenous bases of one end of each fragment of the library therefore it is often indicated with the words 1 × 250 bp.
  2. Sequencing by synthesis paired end, that is a sequencing that foresees the reading of the two ends of the fragments of the library. In this case we obtain the reads relative to both ends of the fragments, in fact in this case the number of nitrogenous bases of the sequence of each fragment that are provided by the sequencing is equal to 500 bp, 250 relative to the read at 3 'and 250 relating to the 5 'read of the fragment. This approach is indicated by the wording 2 × 250 bp and it is very useful as having the reads relative to the two ends of the fragment allows a better assembly of the sequenced DNA or cDNA sequence. It is also necessary to imagine that if the fragments of the library have a size of 500 bp it means that with a paired end sequencing we would be able to sequence the aforementioned fragments in full.

I decided not to tell you about the steps that occur during sequencing by synthesis for fear of making the reading too heavy, as always these are concepts that are difficult to express in a few simple words. However, if you want to know in detail the phases of single end and paired end sequencing, write me in the comments and follow me on Instagram.

But be careful not all that glitters is gold, in fact this sequencing technique also has its drawbacks. These are mainly two:

Video 2. How Illumina Second Generation Sequencing Works. Source: youtube channel ILLLUMINA (https://www.youtube.com/watch?v=fCd6B5HRaZ8)

Now I just have to thank you for reading. As always I hope I have left you some useful information with this article and that you understand how this sequencing technique works in detail.

Bye-bye and see you soon.

Exit mobile version