Gene Tery_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3830 
Symbol 
ID4242281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5906307 
End bp5916920 
Gene Length10614 bp 
Protein Length3537 aa 
Translation table11 
GC content40% 
IMG OID638108763 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_723346 
Protein GI113477285 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTA ATAGCTCAAA ACTCTGCACC ATAAAAGCTC TGAGTTTAAT ACTAGGCACA 
TTAGCCACCA CTCCCGCCAA CTCCCAGCCC ATAACCCCGG CTAAGGACGG TACCAATACT
ACTGTTACTC CACAGGGCCA ACAATTTCAT ATTAAAGGTG GGACTCGCAG TGGTACTAAT
GTATTTCACA GCTTTGATAA GTTTAATGTG CACTCTGGTC AAACTGCCAA CTTTGTCACG
ACCCCGGATA CTCGCAACGT TCTCGGACGA GTTAAGGGAG GCGCTTCTTA TATTAATGGT
CTTATTCAGG TTCTAGGTAG CAGCTCTAAC CTCTTCTTAA TGAACCCCGC AGGCATAATG
TTTGGCCCAA ACGCCAGCCT CAATGTACCT GCGTCATTCA GTGTTACTAC TGCTACTGGT
ATTGGTTTTG ACCAGAATAA TTTCTGGTTC AAGGCTATGG GCACTAATGA CTATTCAAAT
TTGGTCGGAA ACCCTAGTGG TTATAGGTTT GATGTTTCTA AGCCTGGTTC TATTGTTAAT
GAAGGGAATT TAACTTTAAA ACCCCAAGGA AATTTAACTT TGTCGGGTGG AACTGTTGTT
AATACAGGGG AACTCTCTAG CCCTGGGGGT AATATTACTG TCACTGCTGT CGAAGGTGGT
AGTACCCTGA AAATATCTCA ACCAGGACAT TTGTTAAGTT TAGAAGTACC CCTAGAGGAT
GGAGAAAATA TTTCTAATAT TGATCCTCTA TCTTTACCAG AGTTATTAGC TGGTGGAGGA
GATATCGTTG AGGCTACATC TGTCGTTGTT AAAGAGAATG GTGATGTGGT TTTGAATGGT
TCAAATACTA TGGTGGCTGA GACTCCGGGA ACGGCAACTA TTTCGGGAAA GATAGATGTA
TCAACTACTG CTACTTCTCT TAAGGAGAAG GGGCGTGTTC CTAGTCAGAG AAAGAAGGAG
GGTGCTGCTA GTCCAGGAAA GATAGATGTT CCTCCTTCCC TCAAGGAGAA GAGAGATGTT
GCTAAGGCCG GTAAAGTTAA TGTGTGGGGC GATCGCGTGG CTCTTATAGA TACAAATATC
AAGGCTGATG GCAAAAATGG GGGTGGAACT GTATTAATAG GAGGAGATTT TCAAGGGTTG
GGTATAGTTC CTAATTCTCA GCATACTTTT GTTAATAATA ATTCATTTAT TTCTGCGGAT
GCTATCACAA ATGGTGATGG AGGTCAAGTT AGAATTTGGT CGGATGGTAT TACTAATTTT
GCGGGAAATA TTAGTGCTAA AGGCGGCACA TCTTCGGGAA ATGGAGGGTT GGTTAAAATT
GGGGGAAAAG AACAATTAAT CTTTGATGGA AAGGTGGATG TTACTGCGGC TTTAGGAACT
AAAGGTAGGA TTTTATTAGA CCCAGAAAGT GTGACGGTGG GAGAGGATAA TTCGGAAGGT
GAAAAGGAAA TTGTCGAGGA TAATTCGGAG GTTTCTGAGA CAGAAAATAC TGATAATTAT
GATGGGGAAA TTATTGAGGA TAATTCCGAG GTTTCTGAGA CAGAAAATAC TGATAATTAT
AATGGGGAAA TTGTCGAGGA TAATTCCGAG GTTTCTGAGA CCGAAAATAC TGATAATTCC
ACAACAAAAA ATACTGATAA TTTAGAAGAT AAAGAAACAG AAAATCCTCT TGATCCTTTT
GCTGCTGATG AGAATTCTGA TGTGACTATT TCGGCGGATA ATTTAGGGGA ATTATCGGGG
AATGTGATTA TTGATGCTGA TAATGATATT ACGATAAATG AAAGGATAGA AACTGATTCT
TCTGTGGAGT TAAAGGCGGG TCGAAGTATT AATATAAATG CAGATATTGA TACTAAGAGT
GGAAATGGAA ATATTGATTT ATTGGGTAAT AATGATGAGA TGAATTTGGC TAACCGTTCT
GATGGGAAAG CTTCGATAAA TCAATTGGAT GGAACAATTT TAAATGCTGG AAGTGGTGGA
ATTAATATTA AGTTGGGAAG TTTAGGAGAG GTAGGTGATA TAAATCTGGG AAATTTAAGG
ACGACAGGGA AGGTTTTGGT TGATGCAAAT GGGGGAAATA TTGTTAGAGT TTCTGAGAAT
TCTTTGATAA ATGCGGGGAG TGTTTTGTTT AGAACTTCTG GGAATGGAGG TATAGGTTTT
CTTGGTCAAC CTTTGCGGTT AGATGTGCAG AATTTGGAGG CGGTTTCTGG TAGTGGTGGG
GTGTTTTTTG AGTCGTTGAA AAATGTGAAT ATTGGTGGTG TAAGTGAGGA TGTGGATGGT
ATTGCTACTT TTGGAGGAGA TGTTGATATT AAGAGCGCGG GAAATGTGAC TTTAAATGAA
ACTATTTCTA GCAATGAGGT TGTAGAGAAT AATTCTGAGG CGACATCTAC AGAAAATACG
GAGGTTGTTG ATGGAGGTGG TGGGATAAAT ATTGAAGCGG CGGGAGATAT TGTGGCTACG
GGTAGTGGTA TTAAAGGTGG TGGGGAAGCT GTTTCTTTGT CTGGGACTAA TATTAGGATA
AATGATGAAT TTAATGAGAC TTCTGGGGAT GCGGATGTTA AGTTGTCGGC GACAAATGAT
ATTGTTGTTG AGGATATAGA GGATGATGCA TTGGAGTTTA TGCCTGGGAA TGGGGAGATA
GAATTTCGTG CGGATGTGGA TGGGGATGGG TTTGGTCTTG TGGAGATTTT GGATAATAAG
CCGGATGTTG GGAGTAACCC TGATGTTTTT GATAATGGGG CGGATACTAT TAAGACTAAT
GGGCGAGGTT TGACTATTGC TGGTGCTGGT GTGGTTTTGG GGAATGTTGA TACTTCTTGG
TTGCCTGTTT ATTCTGGTGG GGAGCTATTG AAGGCGATCG ATGTGGATAA GGGGGGGGCG
ATACCTCCGG TAGGTACAGA AGGCACTGCA ACATTTACTT TTACTGTAGA TGGTGACTTG
GGAACTATAA AAAATATAGA TGTTCGCTTT TCTGCAAAAC ATACTTACAA TAAGCAGTTA
GACGTTAGTC TGGAATCACC CCAAAAAACG AAAGTACAAT TGTTTGCAGA GGTTGGTAAT
AGGGGAGATA ACTTTCAGGA TACTGTATTG GATGATGATG CTTCTACAAT TATTAGTATG
GGCAGTGTTC CGTTTGATGG TACGTATCAT CCCCAAGGAA GTTTAACAGA TTTTAATGGA
GAAACTCCGA ATGGAAGGTG GAATTTGAAG GTAACTGATA CCTCTGAGGG ATACGATGGC
ACTTTATACA GAGCTGGTGA GCAGGCTCCC TGGGGAACTG CTATAGGTAC ACAACTGTTG
CTGCGTAACC CTCTTGTTCA GAGTGGTGGA GGAAGAGGAA GTGGAAATGG TGGAGCGATA
AACCTAAAAG CAACTCATGG TGATATTAGT GTAGGAAATA TTCGTAGTTT GAGCGAAGCT
GCTAATGGGG GAAGAATTGA CCTGAATGCT AATAACAATA TTATTACTGG TCTGATTAAT
TCTAGTTCTG TGCAAAGGAA TGGAGGAGCA ATTGATTTGG TAGCTGGTGG GAATATCACT
ACACAAGACC TTTATTCCTA TTCCTGGTCC AGCTCAGGCA ACTCAGAAAA TGGAGGAGTA
ATTGATTTAG ATGCTGGTAG CAATATCACT ATACAATCCC TCTATTCCTA TTCCTATTCC
AGCTTAGGCA AATCAGGAGA CGGAGGAGCC ATTAATTTGG ATGCTGATGG CAATATCACT
ACACAATCCC TTAGTTCGAG GTCTTATTCC CCTTCAGGTT CAGGAAATGG AGGAGCAATT
AATTTGGATG CTGGTGGCAA TATTACTATA CAATCCCTCC ATTCCAGGTC CACTGCCAGA
TCATTCTCAG GAAATGGAGG AGCAATTGAT TTGGATGCTG GTGGCAATAT CACTACACAA
GAACTCTATT CCTATTCCTA TTCCAGCTTA GACAACTCAG GAAATGGAGG AGCAATTAGT
TTGGATGCTG GTGGCAATAT CACTACACCA TCTCTCAATG CATATTCTTT TTCCCCCTCA
GGCAACTCAG GAAATGGAGG AGCAATTAGT TTGGATGCTG GTGGCAATAT CACTACACAA
TCCCTCAATT CTTATTCCTA TTTGTCTTCA GAAAAAGGGA ACTCAGGAAA TGGAGGAGCA
ATTAGTTTGG ATGCTGGTGG CAATATCACT ACACAATCCC TCAATTCCTT TTCCTCTTCA
GAAAAAGGTA ACTCAGAAAA TGGAGGAAAA ATTGATTTGG ACGCTGGTGA CTATATCACT
ACACAAGATC TCAAGTCCTA TTCTTTTTCT CCCTCAGGCA ACTCAGGAAA TGGAGGAGCA
ATTAATTTGG ATGCTGGTGA CTATATCACT ACACAAGACC TTGATTCGAG ATCCTCTTCC
AGCTCAGTCT CAGGAAATGG AGGAGCAATT AATTTGGATG CTGGTGACTA TATCACTACA
CAAGGCCTTG ATTCCTATTC GTACTCAGAA AAAGGCAACT CAAGAAATGG AGGAGACATA
AAACTGAATG CCAATAAAAT AAAACCAACC TCGGATAATG AAGAAAAACT AACAATACAT
ACATTTTCAT TCAGGAAAAA TAAGTTTGGG GAAGGAAAAG GTGGAGACGT TAACATTACC
ACTAACAATC TCAGCAAGAC AGAAATATTT ACACTATCTT CCCGCTCAGA ATCAGGAAAA
GTAACCATAG AAAGTAAGAC ACAAGAACCA CTTCAAATCA ATGACTCATC CATTACTACA
AGTGTACAAG TAAGCGTACA AATATGTTCC TCCTGTGGAA CAAGACTAGT TGGTGAAATA
GGAGATACTC GATCAGGCGA AGTCTCTATC AATAGTTCCG GCGACCTCAG CCTCAACAAC
GTCACCATAG AAAGCGACAC CAAAAGTAAC CAAGCCGCAG GAGACGTCAA CATCTACAGC
ATTGGCAACA TTACACTTGA CAAGACGGAT ATCATCAGTA CTACCAACTC CCAAGGCAAA
GCCGGCCAAA TCAACTTAGA AACCAAGCAA AACATAGAAC TCACCAACAA CAGCAAAATC
TTAGCCAACA CAGAAGGAAC CGGAAACGCA GGTCAAATAA ACATAGAAGC TAACAACCTC
ATCCTAGACC AAAACACCAA ACTAATCACA GAAACCGCAA GAGCTGGAAA CCCCGGAGAC
ATAAATATCC AAGCTAACAC CATAGAGATA GGAGAAGGCG CTAAAGCCAG CACCACAGTC
TTAACAGGTT CAACCAGCAC CGGAGAAGGA GGCAACATCA CCATCAACAC AAACGAACTC
AACGTCACAG GAAAACTCGG AATATTTGCC GAAACCGAAG CGAGCAAAAA CGCCGGTATC
CTTCGTATAT CACCCTACAA AAACAACCCC AACCTAGACA TCATCTTCAA AAACGAGGGC
TTCATCTCCG CCTCTACCTC ATCCACAGGC AACGGAGGCA GCATATTCAT ACAAGCCCCA
GAAAACATAA AAATAACAGG CAAAGGATTC ATTGCCACAG AAACATCAGG TACAGGTAAC
GCCGGAATTA TTGATATCAA AACCAACAAC CTGAGAATAT CCAATGGAGT CAAAATCAAT
GCTTCCACAG AGGACCAAGG CAACGCCGGC GAAATCAAAA TCAACACCAC AGACTTCACC
CTAGAAAAAG GAACAAGTCT AACAACAGAA ACCAACAGCG CCGGACTAGC TGGAAACATA
GAAATAAACA CCAAAAAACT CACAATAGGA AAAAACGCCC AAATAAGTGC CACAGCACTA
GAAAGAGCCA GCAACAAAGA AGCAGGCGCA GGCGGCAACA TCACCATCAA CGCCAACAAC
CTAGAAATCT CAGGAAAACT AGGAATATTC GCCGAAACCG CAGGAGAAAG TCCCGCAGGC
ACCCTCACCC TGAAACCATA TAAAAACAAC CCCAACCTAA ATATAGAATT CAAACAACAA
GGCTTCATTT CTGCTCGCAC CAGCTCCAGT GGCAACGGAG GCAACATAAA TATTCAAGCA
CCAGAAAAAA TAAACATTAC AGGAGACGGA AAAATATCAG CGGAAACCAC AGGGAGTGGA
AACGCAGGCA CCATCAATAT CCAAACAGAA AACCTCAACC TATTAGAGCA AGTAGCCATA
AGCGCCGAAA CCAACAGTCA AGGTCAAGCC GGAAATATTG AAATTAACTC CCAAACAGTT
ACCATAGGAA AAGGCACAGA AATAAGTGCG ACAGCCGGAA AAAAAGCAAC AAGCACCGGA
GATGGAGGCA ACATCACTAT CAACACCAAC GACTTAGAAA TCTCAGGAAA ATTAGGAATA
TTTGCTGAAA CTAAAGGAGT ATCAAATGCT GGCACTTTAA CCCTCACCCC TTACAAAACT
AACCCAAACC TCAATATTAC GTTCACCGAC CAAGGCTTCA TTTCTGCTCG CACCAAATCT
CTAGGAAAGG GAGGTGATAT CAATATATTG GCACCAGAAA ACATAAATAT TACAGGAGAT
GGTCAGATAA CAGTTGAAAC CAAAGGTAGT GGAGATGCAG GCATTATTAA CATTGAAACA
GAAAACCTGA AAATAGCCGA AAATACAAAA ATATCAGCAT CCACATCTGA CAGCGGAAAC
GGAGGCGAAA TCAAAATTAA TTCTAGTGAA ACATTCCAAC TGCAAGGCCG AATATTAACA
GAAACCAGAG GAACAGGAGA TGGAGGTATT ATCAATATTG AAGCAGGAGA AATAACTGCG
CCTAACAGTA GGATATCAGC TAAAAGTACC GACGCCGGCA ACGCAGGTAC AATAGATATT
ACCGCCCAAG GAGATATTAC CACAGGAGCT GTTACCTCCG CCGCCAAAAA TGACATGGAA
ACCTCTGACG GAGGTAGTAT CAGTATTACC AGTGAACAAG GAAAAATTAA CGCTACCCGA
GCAATTCAAT CATTTTCAGA AGGAGGGAAC GCCGGAGATG TTACCCTCAA AGCCCAAACA
GACATAACCG CCAAAACCAT AAGTTCTCAC GGAAAGCAAG AGGGTGGTCA AATAACCATC
ACATCAGAAA CCGGAAACAT TGACACCAGT AGTGGTAACC TCTTAGCCAA CTATTCAGGA
GGCGGTGATG CCGGAGACAT TACAATGGAA GCACCCCAAG GAAATATTAC CACAAATGAT
ATCTATTCCT ACGCCGACGG AGACGGAGGT CAAATAAGTA TTAAAGCCGG AAACAATATT
AATATAAAAG TAAACAGTAA TATTATTTCC GCATCAGAAC CACCAAATGA GGGGAACAGT
GACAAACAAG GACAAGGAGG AGATATTACT CTAGAAGCAG GCAACAATAT TAATACTACA
ACAGCTAAAA TATATTCTGG TGCCAACGAG GGCGACACAG GAAAAATTGA TATTACTGCA
GATAACGCAA TAGAAACAGG CAAAATAGAC TTAGCATCAG GTTTTGTCAG ACAACAAGAA
AAAGTCAACG AAAACTTTAC TATTATTCCC AAACCAGAAG GAGAAGCAAC TCAAGGAAAA
GCCAGAGAGA TCCGACTCCG CAGCAGAAAC AGTACAATAG ATACTACAGG CGGCACAATA
AATTCTCGTT CCCCAGACGG TACTGGAGAT ATTATTATCA ATGCCAAAGG AAATATTAGC
ACAGGGAAAT TAGAAGCCAG CGCCCTTAAC CCAGACAAGC CTACCACAGG TGGAGACGTT
AATATTATCA GTGAGCAAGG AGAAATTAAT GCAACTCAAA ACATAGAAAC CTTCTCAGAA
AAAGGTACAG CAGGAGATGT AAATATTACT GCAGTTGGTC AGATCCAGAC AAATAATATT
CTTTCTCAGG GAATGGAACG GGGGGGTAAT ATTCAAGTCA AGAGTGAAGG AGTAGAAATT
CATTCCAGGG GTAATATAGA CTCATATTCA GAACAAGGAA GAGGAGGAAG TGTCAAAGTA
GATGCCCCAG AAAGAGTAAA TTTAGCAAAT GTATCATCCT ACGGTATGAC AGAAAGTGGT
GACTTAATTA TTCAGAGCCA ACAAGCAGAA GTTAATACAG TTAATGTCAC AACACAAGCT
CCGGAGGGAT ATAGTGGTCG TATAGTTATC AACGGAACAG AAGTAGGTAC GGGAAATTTA
AGTTCTATCG CTAGAACAAG TGCCGGGGAA ATTAATGTAG AAGCGACAGA TGGTTCCATT
GCCACCTATG ATATAGAAAT GACATCAGAT GGAACAATAG GAGCTTTAAC ACTGAGAGCA
AGAGAAAATA TTACTACAGA AGATGTTACT CAAAATGCTG GAGAGGGAAA TGTAGATATT
AATATTGAGT CAGGAAGAGA TCAAAATATC GGCAATACTA CTCAAATAGC AAAAACAGGG
GATACTAATA ATGTTCAAAC TGCAGGCCGC GACCAAAATA GTGGGAATAT GAATCAGAAT
TCTGGAGGTA ATGCTAACAA TATTCAAACT GCAGGTCGCG ACCAAAATGG TAGGAATATG
AATCAGAATT CTGGAGGTAA TGCTAACAAT ATTCAAACTG CAGGTCGCGA CCAAAATGGT
AGGAATATGA ATCAAAATTC TGGAGGTAAT GCTAACAATA TTCAAACTGC AGGTCGCGAC
CAAAATATTG AAAATGTGAA TCAAAATTCT GGAGGTAATG CTAACAATAT TCAAACTGCA
GGTCGCGACC AAAATATTGA AAATGTGAAT CAAAATTCTG GAGGTAATGC TAACAATATT
CAAGCTGCAG GTCGCGACCA AAATATTGAA AATGTGAATC AAAATTCTGG AGGTAATGCT
AACAATATTC AAACTGCAGG TCGCGACCAA AATATTGAAA ATGTGAATCA AAATTCTGGA
GGTAATGCTA ACAATATTCA AACTGCAGGT CGTGACCAAA ATATTGAAAA TGTGAATCAA
AATTCTGGAG GTAATGCTAA CAATATTCAA ACTGCAGGTG GGGAGAAAAA TATTGGGCAA
GTCACTCAAA ATGCCGAGAA TAATACCATT AATATTCAAA CCGCAAGTGG GGAGCAAAAT
ATTGAACAAA TTAATCAAAC TTTTGGCAAC GAATCTATCA ACATTCAAGA TCAAGAAATT
AATCAAAGTC TAGATACTAC TTCAGCAATT TCTAATAACA ATGTACCTAA TAATCAAGTA
CTTAATCAAG AAGATTTCTC TAATAATAAC AATAACAATA TCGAAGATAA TTTCACTACA
AATCCCTCAA ATAATAAAAA CATTTTATCA AACCTTACCC AGTCTCAAAG GTCTGAATTA
ATTTCAAATT CTACCCCCAG CAACAATAAC CAAACAACTA ATACAAACAC AGCCCAAGAA
CAAACAGAAT CATCTATTAA TAGTACAACA GATACAAAAA AAATCTTAAA CACAATTAAT
GCTATAAATA CCAATTATCT CACAGTTTCT GCCAATTTAG AGGAAGCAGT AACAGTATTA
GAACAAAGCC GTACTAACGA ATATTCAAAT TATTTTGGAA CAGATTTTAA AGAACAATTT
ATAAACCAAA AAACACCCCG AGAAATATTA ACAGATATGG CTGCTAAAAT AGGAAAAGAA
TCTGCAGTTG TTTATATTAA TGCTTATCCA GAGGAATTAC AAATAATCTT ATATACCAAA
GATGGTCAGC CTATTCTTAA AACTATCCCC GAAGCTAACC GTAAAAAATT AGAGAAAGTA
GTTATAAATT TCCTCAAATT AACGACAAGT CCTGCCTATC GTGATTCTAA TAGTTATCTA
TCACCAGCAA AACAATTATA TGATTGGTTT ATTGCACCTA TATCAGCAGA ATTAGAAGCA
GCAAATCTCG ATACCTTATT GTTCAGTATG GGTGAAGGTT TGCGTATTTT ACCAGTGGCA
GCATTACATG ATGGAGAACA ATTTTTAATT GAGAAATATA GCCTAAGTTT AATTCCGAGT
ATCAGCTTAA TGGATACAAA TTATCGCCCA CTTCAAGGTA CTCAAGTATT AGCTATGGGA
GCTAGTCAGT TTATCAATGA AGAACCTTTA CCAGCAGTAC CTGTGGAGGT AGAAACAATT
TCTGAAAAGC TATGGGAAGG TAGTAAATTT TTGAACGAAG AATTTACCCA GAATAATTTG
TTAACTCAAA GAAAAAATTA TCCCTATCCT ATTATTCACC TAGCTACCCA TGCAACATTT
AATAGAGGAA AACCTAGTAA TTCTTACATT CAGCTATGGG GTAATGAACA AATAAAGTTA
AACCAAGTGC GGGAGTTGGG TTGGAGTAAC CCATCGGTTG ACTTGTTGGT ATTGTCTGCT
TGTCGCACTG CTGTAGGAAA TAGAGAAGCG GAATTAGGGT TTGCAGGGTT AGCAGTAGCA
GCGGGAGTGA AGTCAGCTTT AACGAGTCTT TGGAGTGTGA GTGATGAAGG GACATTGGCC
TTAATGACAG AGTTTTATAC TTATTTAAAT GATGCTAAGA TCAAGTCAGA GGCATTAAGA
CAGGCGCAGT TAGCAATGTT GCAAGGGAAG GTGGTTATTA CAGGTGGGGA GTTGAGGGGA
AGCAGCACTC GTGGTGAGGT GAAGCTACCT TCGGCGTTGG CAAATGTCAA TAATCAAAAT
TTATCTCATC CTTATTATTG GGCAGGGTTT ACTATAGTTG GTAGTCCTTG GTAA
 
Protein sequence
MKPNSSKLCT IKALSLILGT LATTPANSQP ITPAKDGTNT TVTPQGQQFH IKGGTRSGTN 
VFHSFDKFNV HSGQTANFVT TPDTRNVLGR VKGGASYING LIQVLGSSSN LFLMNPAGIM
FGPNASLNVP ASFSVTTATG IGFDQNNFWF KAMGTNDYSN LVGNPSGYRF DVSKPGSIVN
EGNLTLKPQG NLTLSGGTVV NTGELSSPGG NITVTAVEGG STLKISQPGH LLSLEVPLED
GENISNIDPL SLPELLAGGG DIVEATSVVV KENGDVVLNG SNTMVAETPG TATISGKIDV
STTATSLKEK GRVPSQRKKE GAASPGKIDV PPSLKEKRDV AKAGKVNVWG DRVALIDTNI
KADGKNGGGT VLIGGDFQGL GIVPNSQHTF VNNNSFISAD AITNGDGGQV RIWSDGITNF
AGNISAKGGT SSGNGGLVKI GGKEQLIFDG KVDVTAALGT KGRILLDPES VTVGEDNSEG
EKEIVEDNSE VSETENTDNY DGEIIEDNSE VSETENTDNY NGEIVEDNSE VSETENTDNS
TTKNTDNLED KETENPLDPF AADENSDVTI SADNLGELSG NVIIDADNDI TINERIETDS
SVELKAGRSI NINADIDTKS GNGNIDLLGN NDEMNLANRS DGKASINQLD GTILNAGSGG
INIKLGSLGE VGDINLGNLR TTGKVLVDAN GGNIVRVSEN SLINAGSVLF RTSGNGGIGF
LGQPLRLDVQ NLEAVSGSGG VFFESLKNVN IGGVSEDVDG IATFGGDVDI KSAGNVTLNE
TISSNEVVEN NSEATSTENT EVVDGGGGIN IEAAGDIVAT GSGIKGGGEA VSLSGTNIRI
NDEFNETSGD ADVKLSATND IVVEDIEDDA LEFMPGNGEI EFRADVDGDG FGLVEILDNK
PDVGSNPDVF DNGADTIKTN GRGLTIAGAG VVLGNVDTSW LPVYSGGELL KAIDVDKGGA
IPPVGTEGTA TFTFTVDGDL GTIKNIDVRF SAKHTYNKQL DVSLESPQKT KVQLFAEVGN
RGDNFQDTVL DDDASTIISM GSVPFDGTYH PQGSLTDFNG ETPNGRWNLK VTDTSEGYDG
TLYRAGEQAP WGTAIGTQLL LRNPLVQSGG GRGSGNGGAI NLKATHGDIS VGNIRSLSEA
ANGGRIDLNA NNNIITGLIN SSSVQRNGGA IDLVAGGNIT TQDLYSYSWS SSGNSENGGV
IDLDAGSNIT IQSLYSYSYS SLGKSGDGGA INLDADGNIT TQSLSSRSYS PSGSGNGGAI
NLDAGGNITI QSLHSRSTAR SFSGNGGAID LDAGGNITTQ ELYSYSYSSL DNSGNGGAIS
LDAGGNITTP SLNAYSFSPS GNSGNGGAIS LDAGGNITTQ SLNSYSYLSS EKGNSGNGGA
ISLDAGGNIT TQSLNSFSSS EKGNSENGGK IDLDAGDYIT TQDLKSYSFS PSGNSGNGGA
INLDAGDYIT TQDLDSRSSS SSVSGNGGAI NLDAGDYITT QGLDSYSYSE KGNSRNGGDI
KLNANKIKPT SDNEEKLTIH TFSFRKNKFG EGKGGDVNIT TNNLSKTEIF TLSSRSESGK
VTIESKTQEP LQINDSSITT SVQVSVQICS SCGTRLVGEI GDTRSGEVSI NSSGDLSLNN
VTIESDTKSN QAAGDVNIYS IGNITLDKTD IISTTNSQGK AGQINLETKQ NIELTNNSKI
LANTEGTGNA GQINIEANNL ILDQNTKLIT ETARAGNPGD INIQANTIEI GEGAKASTTV
LTGSTSTGEG GNITINTNEL NVTGKLGIFA ETEASKNAGI LRISPYKNNP NLDIIFKNEG
FISASTSSTG NGGSIFIQAP ENIKITGKGF IATETSGTGN AGIIDIKTNN LRISNGVKIN
ASTEDQGNAG EIKINTTDFT LEKGTSLTTE TNSAGLAGNI EINTKKLTIG KNAQISATAL
ERASNKEAGA GGNITINANN LEISGKLGIF AETAGESPAG TLTLKPYKNN PNLNIEFKQQ
GFISARTSSS GNGGNINIQA PEKINITGDG KISAETTGSG NAGTINIQTE NLNLLEQVAI
SAETNSQGQA GNIEINSQTV TIGKGTEISA TAGKKATSTG DGGNITINTN DLEISGKLGI
FAETKGVSNA GTLTLTPYKT NPNLNITFTD QGFISARTKS LGKGGDINIL APENINITGD
GQITVETKGS GDAGIINIET ENLKIAENTK ISASTSDSGN GGEIKINSSE TFQLQGRILT
ETRGTGDGGI INIEAGEITA PNSRISAKST DAGNAGTIDI TAQGDITTGA VTSAAKNDME
TSDGGSISIT SEQGKINATR AIQSFSEGGN AGDVTLKAQT DITAKTISSH GKQEGGQITI
TSETGNIDTS SGNLLANYSG GGDAGDITME APQGNITTND IYSYADGDGG QISIKAGNNI
NIKVNSNIIS ASEPPNEGNS DKQGQGGDIT LEAGNNINTT TAKIYSGANE GDTGKIDITA
DNAIETGKID LASGFVRQQE KVNENFTIIP KPEGEATQGK AREIRLRSRN STIDTTGGTI
NSRSPDGTGD IIINAKGNIS TGKLEASALN PDKPTTGGDV NIISEQGEIN ATQNIETFSE
KGTAGDVNIT AVGQIQTNNI LSQGMERGGN IQVKSEGVEI HSRGNIDSYS EQGRGGSVKV
DAPERVNLAN VSSYGMTESG DLIIQSQQAE VNTVNVTTQA PEGYSGRIVI NGTEVGTGNL
SSIARTSAGE INVEATDGSI ATYDIEMTSD GTIGALTLRA RENITTEDVT QNAGEGNVDI
NIESGRDQNI GNTTQIAKTG DTNNVQTAGR DQNSGNMNQN SGGNANNIQT AGRDQNGRNM
NQNSGGNANN IQTAGRDQNG RNMNQNSGGN ANNIQTAGRD QNIENVNQNS GGNANNIQTA
GRDQNIENVN QNSGGNANNI QAAGRDQNIE NVNQNSGGNA NNIQTAGRDQ NIENVNQNSG
GNANNIQTAG RDQNIENVNQ NSGGNANNIQ TAGGEKNIGQ VTQNAENNTI NIQTASGEQN
IEQINQTFGN ESINIQDQEI NQSLDTTSAI SNNNVPNNQV LNQEDFSNNN NNNIEDNFTT
NPSNNKNILS NLTQSQRSEL ISNSTPSNNN QTTNTNTAQE QTESSINSTT DTKKILNTIN
AINTNYLTVS ANLEEAVTVL EQSRTNEYSN YFGTDFKEQF INQKTPREIL TDMAAKIGKE
SAVVYINAYP EELQIILYTK DGQPILKTIP EANRKKLEKV VINFLKLTTS PAYRDSNSYL
SPAKQLYDWF IAPISAELEA ANLDTLLFSM GEGLRILPVA ALHDGEQFLI EKYSLSLIPS
ISLMDTNYRP LQGTQVLAMG ASQFINEEPL PAVPVEVETI SEKLWEGSKF LNEEFTQNNL
LTQRKNYPYP IIHLATHATF NRGKPSNSYI QLWGNEQIKL NQVRELGWSN PSVDLLVLSA
CRTAVGNREA ELGFAGLAVA AGVKSALTSL WSVSDEGTLA LMTEFYTYLN DAKIKSEALR
QAQLAMLQGK VVITGGELRG SSTRGEVKLP SALANVNNQN LSHPYYWAGF TIVGSPW