Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Exercise: DNA sequencing

  • Create a file called dna_sequencing.py

  • A, C, T, G are called bases or nucleotides

  • Accept a sequence on the command line like this: python dna_sequencing.py ACCGXXCXXGTTACTGGGCXTTGTXX

  • Given a sequence such as the one above (some nucleotides mixed up with other elements represented by an X)

  • First return the sequences containing only ACTG. The above string can will be changed to ['ACCG', 'C', 'GTTACTGGGC', 'TTGT'].

  • Then sort them by lenght. Expected result: ['GTTACTGGGC', 'ACCG', 'TTGT', 'C']

  • Create a file called extended_dna_sequencing.py

  • In this case the original string contains more than on type of foreign elements: e.g. 'ACCGXXTXXYYGTTQRACQQTGGGCXTTGTXX'.

  • Expected output: ['TGGGC', 'ACCG', 'TTGT', 'GTT', 'AC', 'T']

  • Ask for a sequence on the Standard Input (STDIN) like this:

python extended_dna_sequencing.py
Please type in a sequence: