Gene Cagg_3132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3132 
Symbol 
ID7269881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3792485 
End bp3809038 
Gene Length16554 bp 
Protein Length5517 aa 
Translation table11 
GC content58% 
IMG OID643567953 
Productconserved repeat domain protein 
Protein accessionYP_002464426 
Protein GI219849993 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.593801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACCA ACATCGCCGC ACTTCGTCGC TCGTTCGTAA GCTTGTTGCT GGTCATCGCT 
GTTGCCTCCA GCGGTTTACA GCCGACGATA GCCTTTGCAA ATGCTACCGT TAGTGTGATG
CTCAGCACTG ATTTTCCCAT TGTTGATAGT GGTCAATGGG CGGTGATTAA GGTTGACTAT
AGTTGTTCGA GTGTGCTCAA TACGCCGTGC GAGAATGCAA CGGTGACCGC TGTTATTCCG
CCTGAGTTGG CCGGCGCCGT CGGTGATGTC CAAGCCCTCG GTGCTGGGAC CTCACCCTCA
TACAATCCGG CAACGCGTAC CGTGACGTGG GCATTTAATG CGCCACTGGG CGCTGGTGAT
TCCGGCCGCC TTGAGTTGCG GGTTCGCTTT CCTGCCGGTT CGACTCCTGA TGGTACGACG
GCTACCTTGC GTGCCGAGAT GCGCTCGACA ACGGCACCAC CTCGCCTCTC CAATCCGTTG
ACCATTACGG CACGCGCCGA ACCACGGGCG GTGGCGACTA AGACGTTTGT GTCCGGTGGT
GCGCCTGATG TGCCGACAAC CTATCAGGTG CAGGTCTGTA TTCCGAATAG TGGTCTGGGG
GCGTTGAATC TCACGAATGT TCAAATTGTT GATACGTTGC CCGCCGGTGT GACGTTCGTG
AGCGCTTCAG ATGGTGGTAC CTACAACAGT AGTACGAATA CGGTGACATG GCCGTCTACG
AGCCTGACCG TGCCGAGCAG TCTCTGCGCT ACCCGCACGG TGACGGTGAT TTTCCCCTCA
ACAACGTTTT CCGTTGGTAC CGAGGTGCGC AATGTCGTCG ATGTGACAGC GCAGGCCGGT
TCGGTTACCT TGACCTTGAC CGATGATGAT GTTCGGCGGA TTCAACCGCC TACGCCTGGC
TTAGGTTCGA GCAAGAATGG CCCAACAACG GCGCTGATCG GTGATACGGT GACCTACACC
CTCAGCGCAG TTAATACCGG TACGACCGCC CTGACCGATG TGGTTATTAC CGATCCGGTG
CCGCCTGAAT TGCAGGTGAC GCGCATTAAC GTTGATGGCG GGAATGTTTC CGGTATCAGG
GTTGCGCTCG AATATACGAC GAATCTTAAT TCGACCTTTA CCGCCGTGCC CGGTAGCCCC
TTTACCACAA CGAACTGTAT CAATATCGCG CCGGCGACCG GTGGTGGGTG TAGTACGTTG
ACGCTTGGCG TCGGCGAGCA GATTACGGCC ATTCGCTGGC GCTATCTTGA TCCGCTGCCG
TTTGGCTTTA GCGCCACCGG TCACGGTTTC AGCGCCGTTG TGACCGCCAG CCCCGTCAAT
GCGGTGATCG TTAATCAGGC GACAAGCGAG TATACGTTTA ACGGCTATAC GTCAACTCGT
ATTAATGAGA CCCGCACCCG TATTATCGAA CCCGGTGCGC GTGCGGTGGT TGGGAAATCG
GTTAATCCAA CCATTGCCTA TGCCGGCGAT ACGGTAGAGT ACACGCTTAC CTTACAAAAT
AATCAAATCG GTACCCCAGC AGCGGCCCTC GTGAATCCGG TGTTGGCCGA TCTGTTGATC
GAGAGTTTGC AGTACGTGCC CGGCAGTTGG TCGGTGGTAT CACGCCCGTC CGGCGCACCC
GATCCGGTGT TTGAGGCAAT AGATAATTAC AACGGTACCG GTCGTATGCT CTTGCGTTGG
CGGTGGGATA GCTATAGCTT GCCACCCGGT CAGTCGTTTA CGATTCGCTT TCAAGCGCGA
ATCAATCCGG CTACGCTTGC CGGGACGATT ACGAATACAG CGTCATTGGC CAGTTTTGCC
AATCCGCCCG GCCAAATCTT TATCGATCAG TGTAGTCAAC AAAACCCTGA TAACTATGAT
TTTGACGGTG ATGGCAATAT CTCCGAATTA ATCTGCTCTT CATCGATTAC CAGCTTGAGT
TTGGCGGCAG CAGCCACCGC TAGCTCGGCC AAGTGGGTGT TGGGGCAGCT TGATACTGAA
TGGACACGCG ATCCTAATGT TGGTCAGACT ACGCCGGGCG GTATGGCCGA CTATACGTTG
ATCATCACCA ATACCAATTC GGTGGCGTTG ACCAATCTGG TCTTGATCGA TATTTTGCCG
TGGGTTGGTG ATGTCGGTGT AGTGCGTTTC AACGATCCGC GCGGCAGTGC GTGGCAGCCC
TATTTGGCCG GCCCGATTAG CGTGCCCGAC GGCGCGACGG TCTATTACAG CACCACTAAC
AATCCTTGTC GCAATCCTGA TTTGGGTCTG ACCGATGTTG ATGGTAATCC AATAGATGCC
CCCGGTTGTG TTGATCCGCA GTGGTCGACC GTACCGCCCG CTGATATTAC AACCGTTCGT
TCGGTGCGGA TCGATTTCGG GAGTCGCGTT CTCTATCCGG GTGATTCGGT CGTTGTGACG
TGGCCGATGC GAGCGCCGGT CGGTGGTACT CCCGACGAGG TGGCGTGGAA TACCTTTGGT
TACCGTGCTT TTACTGTCAA TGGTGATCCG CTCCTTGCTG CTGAACCACC GCGCGTTGGG
ATTCAGCGTG GTGCGGTGTT GCCTCCGTCG TATGGCAATT ATGTGTGGCT CGATGCCAAT
CTAAACGGAG TGCAAGACAC CGGTGAAGTG GGCGTGAACG GTGCTCGGAT CGATTTCTAC
CAAGATAACG ACGGTATAAC CGGGCCGAGT ACCGGTGATC GTTGGGTGGG CTATACCATT
TCCGGCCCCG ATAATGACGG TAACCCCGGT TTCTATCTCT TCTCCGACCC ACTCGATATT
CCGCTCGGCG ATTACTACAT CCGCGTGACA CCGCCGGCCG GCTATGGCTT TACCACGCCG
AATGTGGGAG CGAATGATGC CATCGATTCG GATATTGAGC CGGCAACCCG CTATAGCGCT
GTGACCGGGT TGACTTCTGG TGAGAATGAC GATACGTGGG ATGTCGGTCT GGTGACGGTG
ACGGCAGTTG GTAATTATGT CTGGATTGAC CGCAACGGTA ATGGGATTCA GGATGAACCT
CCCGGTGATG GCGTCAATGG CGTGACCGTG CGGCTCTACC GTTCAGACAA TACACCGGTA
GCGACGACCG TTACCGCTGA TGATTTTCTC GGTAATCCCG GTTACTATCT CTTCAGTGAT
CTAGCTCCCG GTAGCTATTA CATCGAGTTT GTATTGCCCA CCGGCTTTTC CTTTACGACA
GCAGATAGCG GTAGTAGCGA TGAAGCCGAT TCAGATGCAA ATCCGACCAC CGGTCGCACG
ACGGTCTTCA CGCTGGCAGC CAATCAACTC GACCGCAGCC GCGACGCCGG TCTGATTGCG
CCGTCCGGTA CGCTCCGTTT AGGCAATCGG GTCTGGTATG ACCGCGATAA CGATGACCGC
TACGAGCCGG TCAATGGCGA GACCGGCATT AATGGCGTGT CGTTGAGCCT GTTCCGCGAT
TTCAACAACA ATGGCCAGCC TGATCCCGGC GAGCATGTCG GCAATTCGGT GACGATGACG
GTTGGCGGTG AAGCCGGCTA TTACCAGTTC ACCAATTTGG CCGCCGGCGA CTACATTGTG
GTGGTTGATG ACAGCAACTT TGCTCCCGGC GGCGCGTTGT TCGGCATGCG CACCAGCAGC
GGCAATGATC CGGCGCCTGA TCCTGATAAC AATGTTGACC ACGACGACAA CGGCGATCTG
CTCGGCGTCA TTGTGCGCTC GTTGCCGATT ACCCTTTCGG TTGGTAGTGA GCCGGTTGAT
GATGGTGATG ATGCTAACGG CAACCAGACC CTCGATTTCG GCTTTATCCG TGGCGCTGCG
CTTGGCGATC GGGTTTGGTT TGATACCAAC AATAACGGCA TCCAAGATGC CGCCGAACCC
GGCGTACCCG GCGTGACGGT TGAGCTGCTC GATGGCAGTG GCAATCCGAT TGATAGCGAT
CCGAGTACAA CCGGTATTCA GCCAACTATC ACGGTGACCG ACGGTGATGG TCGTTATGGC
TTCACCGATC TAAACGCCGG TACCTACCGG GTGCGCTTTA GTGGTCTGCC GAGTGGCTAT
AGCTTCACCA CGCCCGATCA GGGGTCGAAC GATGCCCTCG ATAGCGATGC TGACGCAACC
GGTCTCACGA CGGTGATTAC GCTGGCCGCC AATCAGACCG ACCTGCGCTG GGATGCCGGT
TTGGTGGCTA CTCCAGCTAG CCTTGGCAAC CGTGTTTGGA ATGATCTGAA CTATAACGGT
ATCCAAGACA CCGGTGAGCC GGGTGTAAGT GGTGTCAGCG TGTCGCTCTT CCGCCCCGGT
TACGATGGTG TCGCCGGTAC TGCCGATGAT GAGTTAGTGG CCACAGATAC GACTGATAGC
AGTGGGAACT ACAGCTTCAC CAACTTGCCC CCCGGTCGCT ACTTTGTCCA GTTTGGCCCG
CCGCCGACTG GGTACGCGAT CACAGCGACC GATCAGGGGA CGGACGATGC TGCCGATAGC
GATGCTGATC TCACCACCCG CCGCACTGTG TTGATCGATC TTGCCCCAGG CGAGAACGAT
CCCGACTGGG ATATGGGTCT GTTTGTCTTC GCCACCATTG GTGATCGGGT CTGGAGTGAT
ACGAACAACA ACGGGATTCA AGATACCGGC GAGCCGGGCG TGAGTGGTGT ACAAGTGCGC
CTCTACCGAC CCGGCAGCAG TGTTCCGGTG GCGATGACCA CCACCAATAG CAGTGGTATC
TATACCTTCA CCAATCTCAT TCCCGATAAT TACTACGTTG AGTTTAGCTT ACCCGGCGGC
TACCGTGCCA GTCCGCGCGA TCAAGGGGGG GATGATACGC TCGACAGCGA TGCCGATCCG
GTCACGCGCC AGACGGCTGC GACCACGCTC GCTCCGGGTG AGAATGATCC AACATGGGAC
TTCGGTATTG TGCCGACGGC CAGCATCGGT GATCGGGTGT GGCTCGATCT CAACGCCAAC
GGTATTCAAG ATGCCAACGA GACGGCCGGC GTGCCCGGTG TGCAGGTCGT GCTTTACGAC
GGCGTAGGTA ATGTACTCAA CACCACCGTC ACCGACGTTG ATGGCCTCTA TCGCTTCGAC
AATCTGTTGG CCGGCAATTA CTATCTGCGC TTCGTTGTGC CTGCCAGTTT TGTGGTGAGC
CCGCAGGATC AGGGTACGAA CGATAATGCG GACTCGGATG TCAATCCAAC GACCCTCTTA
ACCGTGCCGA CCACCCTGAG CGCCGGCGAG AACGATCTGC GCTGGGACCT TGGTCTCTAC
CAGTTGGCCA GCATTGGCGA CCGGGTGTGG CACGACCTCA ATGGCAATGG CCGTCAAGAC
GGCGGCGAAC CCGGCGTGCC CAGCGTATCG GTGTCGCTCT ACCAACCCGG CCCCGACGGG
TTGGCCGGCA CGGCTGATGA TGTGCTGGTT GCCAGTACGA CCACCGATAG CAATGGCTTC
TACCGCTTTG ACAACTTGAC ACCCGGCCGC TACTTCGTGC AGTTCGGAGC GACTCCTGGC
TACAGCCTGC TTAGTCCACG TGATTCGACC GAGGCAACCG ACGAGACCGA CAGCGATGTC
GATGCCAACC GTCGTACACC GATTGTTGAA CTGGTGTCGA GCGCAGTTGA TCTCAGTCTC
GATATGGGTG TGCTCAACCC GGCCAGCCTC GGCAATTACG TCTGGTTTGA TGCTGATGTA
GACGGTGTTC AGGATGCGAC CGAGAGCGGT GTGCAGGGGG TGCGGGTGCG CCTCTTCACG
ACCGGCAGCG CCACGCCGGT GATGACCACA ACGACTGATA TCAACGGCCT GTATCTGTTC
AACAATCTGT TGCCGGGTGA GTATTACGTC GTCTTTGACC AATTGCCCGC TAATCGCTCG
CTCACCCGCG CTGATCAGGG CAGCGACGAT GCGCTCGATA GCGATGCCAA CCCGCTCGAT
GGACGCACCG GCGTTATCCG ACTGGTATCG GGTGACAATA ATCAAACTGT CGATGCCGGT
ATCTTCGAGA CGATCACGGT CGGCGATCGG GTTTGGATTG ACCGCAACGC GAATGGTGTT
CAAGACACTG ACGAAACCAC CGGCGTGCCC GGTGTGCGCG TCGAATTGCT GCGCGACAGC
GACGGCGCAG TGCTCGATGT GACCTACACC GATCTGACCG GTCTCTACCA ATTCACGAAT
CTCTTCCCCG GCACGTATCG CATCCGCTTC AGTGAGATTC CAACCGGCTA TATCCGTAGC
CCACAAGATC GTGGTGGGGA TGATACGCTC GATAGCGATG CCAACAGCAA TTTCGAGACT
GCACCGTTTA CACCGGGATC GGGCAACAAT CTGCAGTACG ACCTTGGTCT CTACCAGTTG
GCGCGCATCG GCAACTACGT CTGGGAAGAC CGCAACGGCA ACGGCCGCCA GGATGCCGGC
GAGCCGGCGA TTTCCGGCGT GACGGTGACA CTGACCGGCA CGACCGGCGC CGGCGGTTCG
GTGACATTGT CGCAGACGAC CGATGTGAAC GGCTTCTATC TCTTCACTGA CCTTGTGCCG
GGTACGTACA CTGTCAGCGT CACGGCGCCA TCTGGCTATG TCTTCACCGC TGCCAATCAA
GGTGACGATC TGGGTGACTC TGACGCCAAT GCCGGCGGTG TGATGGCTTC AACCACGCTC
GAATCGGGTG AAGAAGACCT GACGTGGGAT GCCGGTCTCT ACCGGCCGGC CAGCATTGGT
GACCGGGTCT GGCGTGACAC CAACGGCAAT GGCGTGCAAG ACGCCGGTGA AGCAGGGATT
GATGGTGTCA ATGTAACACT AAATGGAACA ACTGGTGCGG GTGTGGTGGT TAACCAGACC
ACTACCACTG CCGGCGGCGG CCTCTACAGT TTCACCAATC TCGCGCCGGG TACGTACCAG
ATCACCGTCA CGGCGCCGAG TGGTGAGGTC TTTACCTATC GTGACATTCT GGCGAGTGAG
GTGGCCGGGG CGAACGATAC CAACGACAGC GATGCTGACG CTAGCGGCAT AATGATCGCC
ACCACGCTTG AGTCGGGTGA GAATGATCTG ACATGGGATG CCGGTCTGGT CATTCCGGCC
AGCTTAGGTG ATCTGGTCTG GGAAGACCTC AATGGCAACG GTGTGCAAGA GACAGGCGAA
CCCGGCTTCA ACAATGTAAC CGTCGCGCTG ATTGGGGCCG GACGCGACCG CACCTTTGGC
ACCGCTGATG ATACGTCGGC GACGACGACC ACCAATGGCA GCGGCAGCTA TAGCTTTACC
AACCTGCAAC CCGGTCTGTA TCGCGTCCGC TTTACCCGTC CGAACGGTTA CGGCTTTACG
GTGGGTGACG CGGCGGTAGC CACCGATACG ACCGACTCGG ACGTACCCGG TGGGGTGAGC
GCAACAGCCA CCACGATCAC GGTTGATCTG GAGTCGGGTG AGAATGATCC AACGTGGGAC
GCCGGTCTTT ACCAATTGCT CTCATTGGGC AATCGGGTTT GGGACGATGT CAATAACAAT
GGCCTGCTTG ATACCGGTGA GGATGGCATT GACGGCGTGA CCGTGCGTCT CTACCGTGAT
CTCGACGGTG ATGGAGATGT CAACGACTCC GGTGAGACCA CACCCGTGGC AACCACGACT
ACCGGCAACG GTGGCTACTA CCTGTTCAGC GGCTTGGTGC AAGGCGACTA TCTGGCTGAG
GTGGTCTTGC CGAGCGGTTA TGTCTCAAGC ACCGGCACGA ACGGTAGCGC CAGCGGCCCC
TACGAGTCGG CTCCCGATGC CAATACCAAC AATACCGACA GTGATGACAA CGGTACGCAG
AGTGGCAGCG TTGTGCGTAG TACCGTTGTG CAGTTGCGGC CCAGCACCGA ACCAACTAGC
GAGACCGATC CGCTACCCCC TGCAATCAGC GACCCGGCCC GCAACGAAAA TAGCAACCTG
ACGGTTGACT TCGGGCTTTT CCGCCCGGCC AGCCTCGGCA ATCTGGTCTG GTTCGACCGC
GACGCCAACG GTGTGCAGGA CGGTGGTGAT GAGACGGGAG TGAGTGAGGT TCAGGTCCAG
CTCTTCCGTG ACGATGACGG CACACCCGGC CAAAGTGCCG ACGATAGCCT GATTGCCAGT
ACCACCACCG ATGCTTCCGG TGTGTATGGG TTTGGTTATC TGATCCCGGC GAATAACTAC
TATCTGGTCT TCGATCTGCC AACTGGCTAT ATGCGCAGTC TGCCCGATCA GGGCGGTAAT
GATGCAACCG ATAGCGATCC CGACCGGACG AGCGGTGCAA CTGCGTTGAT CACGTTGGTA
GCCGGTCAGA ATGACCAGAC GTGGGATGCC GGCCTCTACC AATTGGTCAA CCTCGGCAAC
CGGGTCTGGA ACGATGTGAA TAACAACGGC CAGCTCGATG ACGGCGAAAG CGGCATTGAC
GGCGTGACCG TCAACCTCTA CTACGACGCG GATCGCAATG GCACGATCGA TCCGGCTGAA
AATACGCCGG TGGCAACGAC CACCACCAGC GGCGGCGGGT TCTATGCCTT TGCCAACCTT
GATCCGGGTA ACTATCTGGT TGAGATCCCG GCCAGCAACT TTGCCGCCGG CGGAGCGCTT
GGCCCGATCG GCACGATGCC GGCCTTCCGC TCTAGCACCG GCACGAATGG CAGCGCCACC
GGCCCCTACG AGGATGCGCC CGATCCCGAT GACAATGTGG ACAATGACGA CAACGGTACC
GACGATGGAA GTGGGAATGT GCAATCGGCG CTCATTACCC TCCGCTCACA AGATGAGCCG
GCCACCGACG GTGACGATAA TAACGGCAAC CTGACCGTTG ACTTCGGTTT CTTCCGCCCG
CTGGCGTTGG GTAACTTCGT CTGGCACGAC TACAACAACA ACCGCGCAGT AAATGCGGGC
GAACCCGGTA TCAATAATGT GACAGTACAG CTCTACCGTG ATGTCAACAG CAACGGCGTC
TACGATCCCG GCACTGATAC GCTGGTGGCG ACAACGACAA CGTCTGGTGG CGGGTCCTAT
CGGTTCGATA ACCTCATCCC CGGTGATTAC CTCGTTGTCA TTCCAGAAGT CAACTTTAGC
GCTAGCGGCG CTCTCCGCTT CTTCCGCAGC AGTGATGGTG GCAATGCCGA TGACGGTGCA
GGTACGCTTG ATCCCGATAA CAATGTTGAT AATGATGACA ACGGTATCGG GCCGAATGTG
GGTGTAGTCA CCGGTACATC AACTGACCGG GTGGTTAGTG CGGCAGTGAC ACTTTCGTTT
GCCGATGAGC CAACGACCGA AGATAGTGAT AGCAACACCA ATCTGACGGT AGATTTTGGC
TTCTACAGCT TGACTGTCGG TAATCGAGTT TGGCTGGATG TCAACAATAA TGGTGTGGTA
GATGGGGGTG AAACTGGTGT CGATGGTGTC ACAGTGCAGT TGCTCTACGA TGCCGATGGT
AGTGACACAA TCAACGGTTT AGAGACGACG GCTGTAGTAA CGGTTACTAC CAGCAACGGC
GGACGCTACT TCTTTGGTGG TCTGCGTGAT GGCGGTACCT ACGCGGTACA AGTGGCTTCA
AATGCGTCGC TGGTGTCGAG CACCGGTATT AACGGTATGG CAAGCGGTCC GTATGAACCG
GGTCTTGATC CCGATGCCGA CCTGACCGAT AACAACGATA ACGGTACCCA AACCGGCAAT
AGCGAACGTA GCCCAGTCTT TGCGGTGCGG GTTGGTTCAC TGCCGACCAA TGAGCCTGAT
ACCACATTAC CGACCGGTAT CACCAATCCG GCCATCGATG CCAACAGCAA TCTGACAATT
GACTTTGGTC TGTTTGACGA TGCACGGCTA GGTGATCGGA TTTGGCTAGA TGTCAACGGT
GATGGGATTC AGGATACCGG CGAAACCACA AATATTGCCG GAATTACCAT CACCATCTAC
GATGCCACGA CCAACCAGCC GCTCGACGGC GATCCGGTAA CGTCGGGCAT CCAGCCGATT
ACGCGCACCT CGACGACGGC AGCGACGAGC TACCTGTTCA ACCGCCTGAT CACCGGTTCG
TACTATCTGG TCTTTAGCGA CATTCCAGCA CAGTACGCCA TTAGCCCGCT TGATCAGGGC
ACTGACAATG CGGTCGACTC GGATGTCGAT CCGGCAACCC TGCGTACTGC AACGGTCACA
TTTGGCACGA GCAATAACAA CAACCTAACG TGGGATCTAG GCCTCTATCC GCGGTTGACA
CTCGGTAATC TGGTCTGGTA CGACACCAAC GATAACGGAG TAGTTGATAG TGGTGAGAGT
GGTATTGCCA ACGTCCGGGT TGAACTCTAC CGTGACAGCA ATAACAATGG TCAGCCTGAT
CTCAGTGAGT TCGTCACCTT CACTACAACC GACGCGAATG GCAACTATCT CTTTACCGGC
CTCGAGCAGG GTGACTACAT TGTGGTCATT CCGGCGAGCA ACTTCGTGCT TGGTCAGCCG
CTCTACCGCC ATCGCTCAAG CACCGGTGCC AATGGTGCGG CCAGCGGCCC GTATGAGCCG
GCGCCCGATC CCGACAATAA CGCTGATAAC GACGACAACG GTACCGATCA GATCGGCGAT
GATGTGGTCA GTGGTGTGAT CACGCTTAAC CCGTCGTTCG AGCCGGCCAC CGATGGCGAT
GACGCCAACG GCAATCTGAC CCTCGATTTC GGCTTCTTCG AGCCATTGAC ACTCGGTGAT
CTGGTATGGA ATGACCTGAA CAACAACGGC CTGTTCGAGA CCGGTGAAAC CGGTATTGAC
GGTGTACGTG TCGAGTTGTA TCTCGATGGC AACGGCAACG ATCAGGTTGA TGCAGGTGAG
TTTGTTGCCT TCACCACCAC AGCGGGCGGC GGTCTCTACA CCTTCAGCGA CTTGATTGAA
GGCAACTACA TCGTGCGTAT TCCGGCCAGC CAGTTCGCGA CGGGCCAACC GTTGGCCGGC
TTCGTCTCGA GCACCGGTAC GAACGGCAGC GCTAATGGTC CCTACGAAGG TACGGCAACG
CCAGACCCCG ATAACAACCT TGATAACGAC GACAACGGCA CGACCGGTAC CAGCGGCAAC
GTCGATAGCC TACCGATCAC GCTAAGCCGT GGTAACGAGC CAACCAATGA TGGTGATGGC
AACAACGGCA ACTTGACGGT TGACTTTGGC TTCTTCCGCC ATGCCCGCCT CGGTGACCGG
GTCTGGCACG ATGTCAACGC TAACGGTCTG CAAGAGGGCG GCGAAAGCGG TATCAACAAC
GTGACTGTCG AGCTGTACAG CGCTGGCGCT GATGGTGTGA TCGGCGGCGG TGACGATGTG
TCGGTGGCGA CCACAACCAC TGACTCTAGC GGGATCTACG GCTTCGGCTA CTTGATCCCC
GGCAACTACT ACGTGCGCTT TGCTCTGCCA ACTGGCTACA CCGACGTAAG CCCACGAGAT
CAGGGCAGCG ACAACGCGAT CGACAGCGAC GCCGATCCGG TTAACCGCCA AACGGTGGTG
ATCACGCTGG CTGCCGGTGA TAACGATCCG ACATGGGATA TGGGTGTCTT TAACCGGGCC
AGCGTCGGCA ACTTCGTCTG GGAAGATCAC GACGGCGACG GTGTGCAAGA CGCCGGCGAG
TCGGGGATTG ATGGCGTGAC GGTGACGCTC TACCGCGCCG ATGGGACGAC TGTTGCGACA
ACTACTACCG CAGCCGATGG TTCGTACAGC TTCACCGGCC TTGTGCCCGG CGAGTACTAT
GTGGTCTTCA GCAACCTGCC CAGCGGCTAT GTCTTCACCG CCGCCGATCA GGGTAGTGAT
AATACGGTTG ACTCGGATGC CAATCCAACC ACCGGCCGCA CTGCAAACTT CACGCTCACG
AGCGGGCAGA CCGACACCAC GTGGGATGCA GGCGCGTATC GGCCGGCCAG CATTGGTAAC
TACGTCTGGG AAGACACAAA CGGCAACGGC GTGCAAGAAA CGGGTGAGTC GGGAGTGGCG
AACGTGACGG TGACGCTCTA CCGCGCGAGC GACAACTCCG TGGCGGGGAC GGCTACGACC
GCGCTCGATG GCTCGTACAG CATCACCAAT CTCGTGCCCG GCGAGTACTA TATCGTCTTC
AGCAACCTGC CGAGCGGCTA TGTCTTCACG GCGGCTGATC AGGGCGATAA TGCGCTCGAT
TCGGACGCTA ATCAAACCAC CGGCCGCACG GCCAACTTCA CGTTGGTCAG TGGTCAGAGT
GACCTGACGT GGGACGCCGG TCTCTACCAG CCGGCCAGCG TGGGCAACTA CGTCTGGGAA
GACACAAACG GCAATGGCGT GCAAGACACG GGTGAGGCGG GAGTGGCGAA CGTGACGGTG
ACGCTCTACC GCGCGAGCGA CAACTCCGTG GCGGGGACGG CTACGACCGT GCTCGATGGC
TCGTACAGCA TCACCAATCT CGTGCCCGGC GAGTACTATA TCGTCTTCAG CAACCTGCCG
AGCGGCTATG TCTTCACGGC GGCTGATCAG GGCGATAATG CGCTCGATTC GGACGCTAAT
CAAACCACCG GCCACACGGC CAACTTCACG TTGGTCAGTG GTCAGACCGA CCTGACGTGG
GACGCGGGTC TGTTCGGTGC GGCCAGCATT GGTGACCGGG TGTGGGAAGA CCTAAATGGT
AATGGCGTGC AAGACGCCGG CGAGCTTGGT GTAGCCAATG TGGAAGTGCG CCTGAGTGGG
ACGACCGGTG TCGGGGCGAC GGTCAATCTG ACGACGACGA CAACCATCAC CGGCTCCTAC
CGGTTCGACA ATCTGGCGCC GGGCACCTAC ACGGTAACGG TGGTGCGACC GAGCGGGTAC
GAGTTCACAG CGGTCAATCA GGGAAGTGAC GATGCAGTGG ACTCGGACGC TGACCCGACC
AGTGGCGCGA TGAGCGCAAC CGTGTTGGTG TCGGGTGAGG AAGACCTGAC GTGGGACGCC
GGTCTCTACC GCCCGGCCAG CATCGGCGAC CGGGTCTGGC GTGATACCAA CGGCAATGGT
GTGCAAGACG CCGGCGAGCT TGGCGTGGCC AATGTGGAAG TGCGCCTCAG CGGGATGACC
GGTGCAGGTG TGTCGGTTAA TCGCACCACG TTCACCGATA GCGATGGCCT CTACCGGTTC
GACGGCCTTG CGCCGGGTGA GTACGCCATC ACTGTGATCG CGCCGCCGAA TGACGCCTTC
ACCGTACCCA ATCAAGGTGG CGATGATGCA CTTGACTCGG ATGCGGACGC AACCGGTGCG
ATGCCGATCA CAACGCTGAC GTCGGGCGAG GAAGACCTGA CGTGGGACGC CGGCCTGTTC
GGTGCGGCCA GCATTGGTGA TCGGGTGTGG GAAGACCTAA ATGGTAATGG CGTGCAAGAC
GCCGGCGAGC CGGGGGTGGT GAATGTTGAA GTGCGGTTGA GCGGGACGAC CGGCGCAGGC
GTGGCAGTCA ACCTGACCAC CATGACCGAT AGCGGTGGCC TCTACCGCTT CGACAACCTC
GCGCCGGGGA CGTACACGGT GACGGTGAGC CGACCGAGCG GGTACGAGTT CACGGCGGCC
AATCAGGGCG CCGACGATGC GGTCGACTCG GATGTGACCA ATTCGGCTAC CGGTGCTATG
AACGCCACAG TGCTGGTGTC GGGCGAGGAA GACCTGACGT GGGACGCGGG TCTCTACCGC
CCGGCCAGCA TCGGCGACCG GGTCTGGGAG GACGTGAACG GGAACGGCGT GCAAGACACG
GGTGAGGCGG GAGTGGCGAA CGTGACGGTG ACGCTCTACC GCGCGAGCGA CAACTCCGTG
GCGGGGACGG CTACGACCGT GCTCGATGGC TCGTACAGCA TCACCAATCT CGTGCCCGGC
GAGTACTATA TCGTCTTCAG CAACCTGCCG AGCGGCTATG TCTTCACGGC GGCTGATCAG
GGCGATAATG CGCTCGATTC GGACGCTAAT CAAACCACCG GCCGCACGGC CAACTTCACG
TTGGTCAGTG GTCAGAGTGA CCTCACGTGG GACGCGGGTC TCTACCGTCC GGCCAGCGTC
GGCAACTTCG TCTGGCGCGA TGTGAACGGC AACGGCGTGC AAGACGCCGG CGAATCCGGT
ATCAGCGGCG TGACCGTCAC TCTGACCGGG ACCGGGCCGG ATGGTGTGTT CGGTACTGCC
GACGACATCA ATCGTACCGT GACAACCGAC AGCAACGGTG AGTACATCTT TGACAACCTG
CCGCCGAGCA ACTACCGGCT GACCTTTAGC GGCATTCCGG CTGAATTAAC CTTCAGTCCA
GCCGATCAAG TCGGCGATGA TACCGCTGAT AGTGATGTCA TCACGAGCGG CGGCGTCACC
GATGTTTTTG CCCTCACCAG TGGTCAGAGC GATCTCACCC GCGATGCCGG ACTCTACCCG
CTGCTCAGTC TGGGTAATCT GGTCTGGATC GATACCAACA ACAACGGTGT CTTTGATAGT
AGCGAGTCGG GCGCAGACGG TGTGCGAGTC CTCCTCTACC GCGACAGCAA CGGCAACGGG
ATGTGGGATG CGGATGATCC ACAGGTTGGT ACAACGTGGA CGGATAGCAA TGGCAATTAC
CGCTTTACCG GCTTGCCACA AGGCAACTAC TTCGTGGTCT TGCCCGGTCG GCAATTCGGA
GTCGACGGCG TCTGGTATGG CTATCGCAGC AGTACTGGCG ACTTCTCGCT AACGGCTGGG
CCGTATGAGC CGGCGCCCAA CGCCAACAAC GATCTCGACA ACGACGACAA CGGTACGCGG
CAGGCTGATG CTGACTCGGA CTTCAACATC GTCAGCGGCT TGATCGAGCT GCGGCCCGAT
AGCGAGCCGG ATACATCCGC TGATGGTGAT GGACGTGATA GCAACTTGAC CATCGACTTC
GGCATCTTCC AGCCGGCGGT GGTGGGCGAC ACAGTTTGGA GTGACCGTAA TGGCAATGGC
CGGCAAGACG CTGAAGAGCC GGGAGTGGCG AATGTGCGCG TCACCCTCTA CTACGTGGGC
ACGGACGGCA TTGCCAATAC GGCTGACGAT ACGTTGGTAG GGACGCGGCT CAGCGATAGC
GATGGCTTCT ACGAGTTTAC CGATCTGCTG CCGGGCGATT ACTATCTGGT CTTCAGCGAA
TTGCCGGTGG GGGCGCGCTT CACCGCTGCC GATCAAGGTG CTGATGACAC GATCGACTCG
GATGCTGATC CGGTCACCGG CGTGACCGCC GTCTTCTCGA TCGAGAGCGG CACGATCACT
TGGGATTGGG ACTCCGGTCT GCTCTTGCCG GCCAGTGTGG GTGATCGGGT GTGGCTGGAT
ACTAACGGGA ACGGGGTGCA AGACAGCGAT GAGAGCGGGA TCGAAGGCGT GACCGTGCGC
CTGACCGGTA CCGACATTGA TGGGAACACG GTTGATCTGA CCAGGGTTAC CGATGCGAAC
GGGAACTACC GGTTCGACAA CGTGCAGCCG GGTACGTACA CGATCACGGT GACGCCACCG
ACCGGTTATA TGATCACAGC AGCGAATCGA GGTAGCAACG ACAGCACCGA TTCGGACATC
GACTCGAGCG GTGCGACCGA TAGCTTCACC GTTCTCGGCG GTGATGTCGT GCTGACGTGG
GATGCCGGTC TCTACCGGCC GGCCAGCATC GGTAACTTCG TCTGGGAGGA CGTGAACGGG
AATGGGGTGC AAGATGCCGG CGAGTCGGGG ATTGATGGTG TGATGGTGAC ATTGAATGGG
ACAACCGGTG CGGGGGAGAC GGTCAACCTG ACCACTACCA CCAGCAGCGG TGGCCTCTAC
CGGTTCGACA ACGTGCAGCC GGGTACGTAC ACGATCACGG TGACGCCACC GACCGGTTAT
GTGATCACGG CAGCGAATCG GGGCAGTGAT GACAACACCG ATTCGGACAT CGATCCGGTC
ACCGGTGCCA CCACCACCTT CGCTCTCACG AGTGGCGCCA CCGATCTAAC GTGGGATGCC
GGATTGTACC AACCGGCGAC GTTGGGCAAT CGAGTCTGGC ACGACAGCAA TGCCAACGGC
ATTGCCGAAA GTGGTGAAGA GGGTGTCAGC GGAGTTACGG TCCGGCTATA TCGGGCTGAC
GGCACACTGG TTGATACCGT CGTGACCGAT AGTAATGGTC GCTACCTGTT CACCAATTTA
CCACCGGGAA GCTACTACCT GGAGTTTGAG CTACCGAGTG GCTGGGTCTT TAGCCCACCG
ATGCAAGGCG GTGACAGGGG ACAGGATAGC GATGTCGATC CGAATACGCA GCGCACAGCA
ATCTTTACGA TCGGTTACGG TGAAACCGAT CTCAGTTGGG GGGCAGGCAT CCACCAGCCG
GCGCCGCCGA CCGCGATCAC GCTGCTCAGC TTCACCGCCG AGCGCCAGAC CAACGGAGTG
TTGCTGCGCT GGGTAACCGG TAGCGAACGG GATACACTCG GTTTCGTGAT CTTGCGCAGC
GCGAGTGGCA ACCGGGCTGA TGCAGTGCAA CTCTTCACGA CACCAATTCC GGCACAAGGT
AGTGCAGGCA GTGGTGCAAG CTATCAGTGG TTCGACCGCA CGGCCCAGCC TGACGTAAGC
TATCGCTACT GGCTGGTTGA AATCGAGTCT GGCGGTGGGC GTAACGAGTT TGCCCTTCAT
AGCCCAGCGC TCCAATCCAC CTACCGTCTG TTGATACCGG TGATCTTGCG TTAA
 
Protein sequence
MLTNIAALRR SFVSLLLVIA VASSGLQPTI AFANATVSVM LSTDFPIVDS GQWAVIKVDY 
SCSSVLNTPC ENATVTAVIP PELAGAVGDV QALGAGTSPS YNPATRTVTW AFNAPLGAGD
SGRLELRVRF PAGSTPDGTT ATLRAEMRST TAPPRLSNPL TITARAEPRA VATKTFVSGG
APDVPTTYQV QVCIPNSGLG ALNLTNVQIV DTLPAGVTFV SASDGGTYNS STNTVTWPST
SLTVPSSLCA TRTVTVIFPS TTFSVGTEVR NVVDVTAQAG SVTLTLTDDD VRRIQPPTPG
LGSSKNGPTT ALIGDTVTYT LSAVNTGTTA LTDVVITDPV PPELQVTRIN VDGGNVSGIR
VALEYTTNLN STFTAVPGSP FTTTNCINIA PATGGGCSTL TLGVGEQITA IRWRYLDPLP
FGFSATGHGF SAVVTASPVN AVIVNQATSE YTFNGYTSTR INETRTRIIE PGARAVVGKS
VNPTIAYAGD TVEYTLTLQN NQIGTPAAAL VNPVLADLLI ESLQYVPGSW SVVSRPSGAP
DPVFEAIDNY NGTGRMLLRW RWDSYSLPPG QSFTIRFQAR INPATLAGTI TNTASLASFA
NPPGQIFIDQ CSQQNPDNYD FDGDGNISEL ICSSSITSLS LAAAATASSA KWVLGQLDTE
WTRDPNVGQT TPGGMADYTL IITNTNSVAL TNLVLIDILP WVGDVGVVRF NDPRGSAWQP
YLAGPISVPD GATVYYSTTN NPCRNPDLGL TDVDGNPIDA PGCVDPQWST VPPADITTVR
SVRIDFGSRV LYPGDSVVVT WPMRAPVGGT PDEVAWNTFG YRAFTVNGDP LLAAEPPRVG
IQRGAVLPPS YGNYVWLDAN LNGVQDTGEV GVNGARIDFY QDNDGITGPS TGDRWVGYTI
SGPDNDGNPG FYLFSDPLDI PLGDYYIRVT PPAGYGFTTP NVGANDAIDS DIEPATRYSA
VTGLTSGEND DTWDVGLVTV TAVGNYVWID RNGNGIQDEP PGDGVNGVTV RLYRSDNTPV
ATTVTADDFL GNPGYYLFSD LAPGSYYIEF VLPTGFSFTT ADSGSSDEAD SDANPTTGRT
TVFTLAANQL DRSRDAGLIA PSGTLRLGNR VWYDRDNDDR YEPVNGETGI NGVSLSLFRD
FNNNGQPDPG EHVGNSVTMT VGGEAGYYQF TNLAAGDYIV VVDDSNFAPG GALFGMRTSS
GNDPAPDPDN NVDHDDNGDL LGVIVRSLPI TLSVGSEPVD DGDDANGNQT LDFGFIRGAA
LGDRVWFDTN NNGIQDAAEP GVPGVTVELL DGSGNPIDSD PSTTGIQPTI TVTDGDGRYG
FTDLNAGTYR VRFSGLPSGY SFTTPDQGSN DALDSDADAT GLTTVITLAA NQTDLRWDAG
LVATPASLGN RVWNDLNYNG IQDTGEPGVS GVSVSLFRPG YDGVAGTADD ELVATDTTDS
SGNYSFTNLP PGRYFVQFGP PPTGYAITAT DQGTDDAADS DADLTTRRTV LIDLAPGEND
PDWDMGLFVF ATIGDRVWSD TNNNGIQDTG EPGVSGVQVR LYRPGSSVPV AMTTTNSSGI
YTFTNLIPDN YYVEFSLPGG YRASPRDQGG DDTLDSDADP VTRQTAATTL APGENDPTWD
FGIVPTASIG DRVWLDLNAN GIQDANETAG VPGVQVVLYD GVGNVLNTTV TDVDGLYRFD
NLLAGNYYLR FVVPASFVVS PQDQGTNDNA DSDVNPTTLL TVPTTLSAGE NDLRWDLGLY
QLASIGDRVW HDLNGNGRQD GGEPGVPSVS VSLYQPGPDG LAGTADDVLV ASTTTDSNGF
YRFDNLTPGR YFVQFGATPG YSLLSPRDST EATDETDSDV DANRRTPIVE LVSSAVDLSL
DMGVLNPASL GNYVWFDADV DGVQDATESG VQGVRVRLFT TGSATPVMTT TTDINGLYLF
NNLLPGEYYV VFDQLPANRS LTRADQGSDD ALDSDANPLD GRTGVIRLVS GDNNQTVDAG
IFETITVGDR VWIDRNANGV QDTDETTGVP GVRVELLRDS DGAVLDVTYT DLTGLYQFTN
LFPGTYRIRF SEIPTGYIRS PQDRGGDDTL DSDANSNFET APFTPGSGNN LQYDLGLYQL
ARIGNYVWED RNGNGRQDAG EPAISGVTVT LTGTTGAGGS VTLSQTTDVN GFYLFTDLVP
GTYTVSVTAP SGYVFTAANQ GDDLGDSDAN AGGVMASTTL ESGEEDLTWD AGLYRPASIG
DRVWRDTNGN GVQDAGEAGI DGVNVTLNGT TGAGVVVNQT TTTAGGGLYS FTNLAPGTYQ
ITVTAPSGEV FTYRDILASE VAGANDTNDS DADASGIMIA TTLESGENDL TWDAGLVIPA
SLGDLVWEDL NGNGVQETGE PGFNNVTVAL IGAGRDRTFG TADDTSATTT TNGSGSYSFT
NLQPGLYRVR FTRPNGYGFT VGDAAVATDT TDSDVPGGVS ATATTITVDL ESGENDPTWD
AGLYQLLSLG NRVWDDVNNN GLLDTGEDGI DGVTVRLYRD LDGDGDVNDS GETTPVATTT
TGNGGYYLFS GLVQGDYLAE VVLPSGYVSS TGTNGSASGP YESAPDANTN NTDSDDNGTQ
SGSVVRSTVV QLRPSTEPTS ETDPLPPAIS DPARNENSNL TVDFGLFRPA SLGNLVWFDR
DANGVQDGGD ETGVSEVQVQ LFRDDDGTPG QSADDSLIAS TTTDASGVYG FGYLIPANNY
YLVFDLPTGY MRSLPDQGGN DATDSDPDRT SGATALITLV AGQNDQTWDA GLYQLVNLGN
RVWNDVNNNG QLDDGESGID GVTVNLYYDA DRNGTIDPAE NTPVATTTTS GGGFYAFANL
DPGNYLVEIP ASNFAAGGAL GPIGTMPAFR SSTGTNGSAT GPYEDAPDPD DNVDNDDNGT
DDGSGNVQSA LITLRSQDEP ATDGDDNNGN LTVDFGFFRP LALGNFVWHD YNNNRAVNAG
EPGINNVTVQ LYRDVNSNGV YDPGTDTLVA TTTTSGGGSY RFDNLIPGDY LVVIPEVNFS
ASGALRFFRS SDGGNADDGA GTLDPDNNVD NDDNGIGPNV GVVTGTSTDR VVSAAVTLSF
ADEPTTEDSD SNTNLTVDFG FYSLTVGNRV WLDVNNNGVV DGGETGVDGV TVQLLYDADG
SDTINGLETT AVVTVTTSNG GRYFFGGLRD GGTYAVQVAS NASLVSSTGI NGMASGPYEP
GLDPDADLTD NNDNGTQTGN SERSPVFAVR VGSLPTNEPD TTLPTGITNP AIDANSNLTI
DFGLFDDARL GDRIWLDVNG DGIQDTGETT NIAGITITIY DATTNQPLDG DPVTSGIQPI
TRTSTTAATS YLFNRLITGS YYLVFSDIPA QYAISPLDQG TDNAVDSDVD PATLRTATVT
FGTSNNNNLT WDLGLYPRLT LGNLVWYDTN DNGVVDSGES GIANVRVELY RDSNNNGQPD
LSEFVTFTTT DANGNYLFTG LEQGDYIVVI PASNFVLGQP LYRHRSSTGA NGAASGPYEP
APDPDNNADN DDNGTDQIGD DVVSGVITLN PSFEPATDGD DANGNLTLDF GFFEPLTLGD
LVWNDLNNNG LFETGETGID GVRVELYLDG NGNDQVDAGE FVAFTTTAGG GLYTFSDLIE
GNYIVRIPAS QFATGQPLAG FVSSTGTNGS ANGPYEGTAT PDPDNNLDND DNGTTGTSGN
VDSLPITLSR GNEPTNDGDG NNGNLTVDFG FFRHARLGDR VWHDVNANGL QEGGESGINN
VTVELYSAGA DGVIGGGDDV SVATTTTDSS GIYGFGYLIP GNYYVRFALP TGYTDVSPRD
QGSDNAIDSD ADPVNRQTVV ITLAAGDNDP TWDMGVFNRA SVGNFVWEDH DGDGVQDAGE
SGIDGVTVTL YRADGTTVAT TTTAADGSYS FTGLVPGEYY VVFSNLPSGY VFTAADQGSD
NTVDSDANPT TGRTANFTLT SGQTDTTWDA GAYRPASIGN YVWEDTNGNG VQETGESGVA
NVTVTLYRAS DNSVAGTATT ALDGSYSITN LVPGEYYIVF SNLPSGYVFT AADQGDNALD
SDANQTTGRT ANFTLVSGQS DLTWDAGLYQ PASVGNYVWE DTNGNGVQDT GEAGVANVTV
TLYRASDNSV AGTATTVLDG SYSITNLVPG EYYIVFSNLP SGYVFTAADQ GDNALDSDAN
QTTGHTANFT LVSGQTDLTW DAGLFGAASI GDRVWEDLNG NGVQDAGELG VANVEVRLSG
TTGVGATVNL TTTTTITGSY RFDNLAPGTY TVTVVRPSGY EFTAVNQGSD DAVDSDADPT
SGAMSATVLV SGEEDLTWDA GLYRPASIGD RVWRDTNGNG VQDAGELGVA NVEVRLSGMT
GAGVSVNRTT FTDSDGLYRF DGLAPGEYAI TVIAPPNDAF TVPNQGGDDA LDSDADATGA
MPITTLTSGE EDLTWDAGLF GAASIGDRVW EDLNGNGVQD AGEPGVVNVE VRLSGTTGAG
VAVNLTTMTD SGGLYRFDNL APGTYTVTVS RPSGYEFTAA NQGADDAVDS DVTNSATGAM
NATVLVSGEE DLTWDAGLYR PASIGDRVWE DVNGNGVQDT GEAGVANVTV TLYRASDNSV
AGTATTVLDG SYSITNLVPG EYYIVFSNLP SGYVFTAADQ GDNALDSDAN QTTGRTANFT
LVSGQSDLTW DAGLYRPASV GNFVWRDVNG NGVQDAGESG ISGVTVTLTG TGPDGVFGTA
DDINRTVTTD SNGEYIFDNL PPSNYRLTFS GIPAELTFSP ADQVGDDTAD SDVITSGGVT
DVFALTSGQS DLTRDAGLYP LLSLGNLVWI DTNNNGVFDS SESGADGVRV LLYRDSNGNG
MWDADDPQVG TTWTDSNGNY RFTGLPQGNY FVVLPGRQFG VDGVWYGYRS STGDFSLTAG
PYEPAPNANN DLDNDDNGTR QADADSDFNI VSGLIELRPD SEPDTSADGD GRDSNLTIDF
GIFQPAVVGD TVWSDRNGNG RQDAEEPGVA NVRVTLYYVG TDGIANTADD TLVGTRLSDS
DGFYEFTDLL PGDYYLVFSE LPVGARFTAA DQGADDTIDS DADPVTGVTA VFSIESGTIT
WDWDSGLLLP ASVGDRVWLD TNGNGVQDSD ESGIEGVTVR LTGTDIDGNT VDLTRVTDAN
GNYRFDNVQP GTYTITVTPP TGYMITAANR GSNDSTDSDI DSSGATDSFT VLGGDVVLTW
DAGLYRPASI GNFVWEDVNG NGVQDAGESG IDGVMVTLNG TTGAGETVNL TTTTSSGGLY
RFDNVQPGTY TITVTPPTGY VITAANRGSD DNTDSDIDPV TGATTTFALT SGATDLTWDA
GLYQPATLGN RVWHDSNANG IAESGEEGVS GVTVRLYRAD GTLVDTVVTD SNGRYLFTNL
PPGSYYLEFE LPSGWVFSPP MQGGDRGQDS DVDPNTQRTA IFTIGYGETD LSWGAGIHQP
APPTAITLLS FTAERQTNGV LLRWVTGSER DTLGFVILRS ASGNRADAVQ LFTTPIPAQG
SAGSGASYQW FDRTAQPDVS YRYWLVEIES GGGRNEFALH SPALQSTYRL LIPVILR