Gene Tery_0573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0573 
Symbol 
ID4244599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp913780 
End bp923676 
Gene Length9897 bp 
Protein Length3298 aa 
Translation table11 
GC content41% 
IMG OID638105878 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_720491 
Protein GI113474430 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACTCTG GTCAAACTGC CAACTTTGTC ACGACCCCGG ATACTCGCAA CGTTCTCGGA 
CGAGTTAAGG GAGGCGCTTC TTATATTAAT GGTCTTATTC AGGTTCTAGG TAGCAGCTCT
AACCTCTTCT TAATGAACCC CGCAGGCATA ATGTTTGGCC CAAACGCCAG CCTCAATGTA
CCTGCGTCAT TCAGTGTTAC TACTGCTACT GGTATTGGTT TTGACCAGAA TAATTTCTGG
TTCAAGGCTA TGGGCACTAA TGACTATTCA AATTTGGTCG GAAACCCTAG TGGTTATAGG
TTTGATGTTT CTAAGCCTGG TTCTATTGTT AATGAAGGGA ATTTAACTTT AAAACCCCAA
GGAAATTTAA CTTTGTCGGG TGGAACTGTT GTTAATACAG GGGAACTCTC TAGCCCTGGG
GGTAATATTA CTGTCACTGC TGTCGAAGGT GGTAGTACCC TGAAAATATC TCAACCAGGA
CATTTGTTAA GTTTAGAAGT ACCCCTAGAG GATGGAGAAA ATATTTCTAA TATTGATCCT
CTATCTTTAC CAGAGTTATT AGCTGGTGGA GGAGATATCG TTGAGGCTAC ATCTGTCGTT
GTTAAAGAGA ATGGTGATGT GGTTTTGAAT GGTTCAAATA CTATGGTGGC TGAGACTCCG
GGAACGGCAA CTATTTCGGG AAAGATAGAT GTATCAACTA CTGCTACTTC TCTTAAGGAG
AAGGGGCGTG TTCCTAGTCA GAGAAAGAAG GAGGGTGCTG CTAGTCCAGG AAAGATAGAT
GTTCCTCCTT CCCTCAAGGA GAAGAGAGAT GTTGCTAAGG CCGGTAAAGT TAATGTGTGG
GGCGATCGCG TGGCTCTTAT AGATACAAAT ATCAAGGCTG ATGGCAAAAA TGGGGGTGGA
ACTGTATTAA TAGGAGGAGA TTTTCAAGGG TTGGGTATAG TTCCTAATTC TCAGCATACT
TTTGTTAATA ATAATTCATT TATTTCTGCG GATGCTATCA CAAATGGTGA TGGAGGTCAA
GTTAGAATTT GGTCGGATGG TATTACTAAT TTTGCGGGAA ATATTAGTGC TAAAGGCGGC
ACATCTTCGG GAAATGGAGG GTTGGTTAAA ATTGGGGGAA AAGAACAATT AATCTTTGAT
GGAAAGGTGG ATGTTACTGC GGCTTTAGGA ACTAAAGGTA GGATTTTATT AGACCCAGAA
AGTGTGACGG TGGGAGAGGA TAATTCGGAA GGTGAAAAGG AAATTGTCGA GGATAATTCG
GAGGTTTCTG AGACAGAAAA TACTGATAAT TATGATGGGG AAATTATTGA GGATAATTCC
GAGGTTTCTG AGACAGAAAA TACTGATAAT TATAATGGGG AAATTGTCGA GGATAATTCC
GAGGTTTCTG AGACCGAAAA TACTGATAAT TCCACAACAA AAAATACTGA TAATTTAGAA
GATAAAGAAA CAGAAAATCC TCTTGATCCT TTTGCTGCTG ATGAGAATTC TGATGTGACT
ATTTCGGCGG ATAATTTAGG GGAATTATCG GGGAATGTGA TTATTGATGC TGATAATGAT
ATTACCATAA ATGAAAGGAT AGAAACTGAT TCTTCTGTGG AGTTAAAGGC TGGTCGAAGT
ATTAATATAA ATGCGGATAT TGATACTAAG AGTGGAAATG GGAATATTGA TTTATTGGGT
AATAATGATG AGATGAATTT GGCGAACCGT TCTGATGGGA AAGGTTCAAT AAATCAATTG
GATGGAACAA TTTTAAATGC TGGAAGTGGT GGAATTAATA TTAAGTTGGG AAGTTTAGGG
GAGGTAGGTG ATATAAATCT GGGAAATTTA AGGACGACAG GGAAGGTTTT GGTTGATGCG
AATGGGGGAA ATATTGTTAG AGTTTCTGAG AATTCTTTGA TAAATGCGGG GAGTGTTTTG
TTTAGAACTT CTGGGAATGG AGGTATAGGT TTTCTTGGTC AACCTTTGCG GTTAGATGTG
CAGAATTTGG AGGCGGTTTC TGGTAGTGGT GGGGTGTTTT TTGATGTGGG AAATGTGAAT
ATTGGTGGTG TGAGTGAGGA TGTGGTTGGT ATTGCTACTT TTGGAGGAGA TGTTGATATT
AAGAGTGCGG GAAATGTGAC TTTAAATGAA ACTATTTCTA GTAATGAGGT TGTGGAGAAT
AATTCTGAGG GGGGAACTAC GGAAAATACG GAGGTTGTTG ATGGAGGTGG TGGGATAAAC
ATTGAAGCGG CGGGAGATAT TGTGGCTACG GGTAGTGGTA TTAAAGGTGG TGGGGAAGCT
GTTTCTTTGT CTGGGACTAA TATTAGGATA AATGATGAAT TTGATGAGAC TTCTGGGGAT
GCGGATGTTA AGTTGTCGGC CACAAATGAT ATTGTTGTTG AGGATATAGA GGATGATGTG
TTGGAGTTTA TGCCTGGGAG TGGGGAGATA GAATTTCGTG CGGATAAAGA TGGGGATGGG
TTTGGTCTTG TGAAGATGTT GGATAATAAG CCGGATGTTG GGAGTAACCC TGATATTTTT
GAGAATGGGG CGGATACTAT TAAGACTAAT GGGCGAGGTT TGACTATTGC TGGTGCTGGT
CTGGTTTTGG GGAATGTTGA TACTTCTTGG TTGCCTATTT ATTCTGGTGG TGGGGAGTTG
TTGAAGGCGA TCGATGTGGA TGAGGGGGGG CCAATACCTC CGGAAGGTAC AGAAGGTACT
ACAACATTTA CTTTTACTGT AGATGGTGAC TTGGGAACTG TAGAAAATAT AGACGTTCGC
TTTTCTGCGG CATATACTCA CACTGGGGAC TTAGACGTTA GTCTGGAATC ACCCCAAGGG
AAGGTAGTAC AATTGTTTGC AGGTGTTGGT CGTTGGGGAG ATAACTTTCA GGATACTGTG
TTAGATGATA ATGCTTCTAG AAGTATTGGC ATAAGCAATG CTCCGTTTGA TGGTAGATAT
AGTCCCCAAG GAAGTTTAGT AGATTTTAAT GGAGAAAATC CTAAGGGAAT TTGGACTTTG
AAGGTAAAGG ATACCAATAT ATTTTCAAAT TTGGCTGATG GTAATTTATA CAGAGCTGGT
GAGACTGCTC CCTGGGGAAC TGCCATAGGT ACACAACTGT TGCTGCATAA CCCTCTTGTT
AAGATTGCTG GAGGAATAGA AAGTGGAAAA GGTGGAGCGA TCAACCTAGA AGCAACTCAT
GGTGATATTA GTGTGGGAAA TATTCGTAGT TTGAGCGAAG CTGCTAATGG GGGAAGAATT
GACCTGAATG CTAATAAGGA TATTATTACT GGTCTGATTA ATTCTAGTTC TGCTTCTGTG
CAAGGAAATG GAGGAGCAAT TGATTTGGAT GCTGGTGGCA ATATCACTAC ACAATCCCTC
AATTCTAGGT CCTATTCCTG GGAAGGCAAC TCAGGAAATG GAGGAACCAT TGATTTGGAT
GCTGGAGGCA ATATCACTAC ACAATCTCTC TATTCCAGCT CCTATTCCGG GTCTGGCAAC
TCAGGAGATG GAGGAGCAAT TGATTTGGAT GCTGGTGGTG ATATCACTAC TACACAAGAC
CTTGATTCCA GCTCCTATTC CTGGGAAAGC AACTTAGGAA ATGGAGGAGC AATTGATTTG
GATGCTGGTG GCAATATCAC TACACAAGTC CTTGATTCCA GATCCTTTTC CTGGGAAGGC
AACTCAGGAA ATGGAGGAGC AATTGATTTG GATGCTGGTG GCAATATCAC TACACAAATC
CTTGATTCCT GTTCCTATTC CGGGTCTTAC GACTCAGGAA ATGGAGGAGC AATTGATTTG
GTAGCTGGCG GCGATATCAC TACACAAAAC CTCTATTCCG ATTCCTTTTC CAGGTCTGGC
AACTCAGGAA ATGGAGGAGC AATTGATTTA GTAGCTGGTG GCGATATCAC TATACAAGAC
CTCTATTCCT TCTCCTATTC AGGATCTGGC AACTCAGGAA ATGGAGGAGA CATAACACTG
AATGCAAAAC AAATTAACCC TCCAGAAGAC GGAAATGCAG AAAAACTAAC AATCTATACA
TTTTCAGCAG GAAAAAAAGA ATCTGAGGAA GGAAAAGGTG GAGACGTTAA CATTACCACC
AACAATCTCA GCAACACAGA CATATTAACA CTATCCTCCC ACTCTGGATC AGGGAAAGTA
ACAATAGAAA GTCAGACACA AGAACCACTT CAAATAAAAG ACTCATCCAT TATTACTAGT
GAAAAAATAA CAATAACAAT GCCTTGGGGT GAAGAAATAC AAGTTGAAAC AGGAGATACC
CAATCAGGGG ACGTCTCCAT TAACAGTTCC GGCGACCTCA ACCTCAGCAA CGTCACCATA
GAAAGCGACA CCGAAAGTAA CCAAGCCGCC GGAGACGTCA ACATCTACAG CCGCGGCAAC
ATCACACTCG AAAACACCGA CATCATCAGC ACCACCAACT CCCAAGGCAA CGCCGGCCAA
ATCACCTTAG AAAGCAACCA AAACATAGAA CTCACCAACA ACAGCAAAAT CCTAGCCAAC
ACAGAAGGAA CCGGAAACGC AGGTCAAATA AACATAGAAG CCAACAAACT CATCCTAGAC
CAAAACACCA AACTAATCAC AGAAACCGCA AGAGCTGGAA ACCCCGGAAA CATAAACATC
CAAGCCAACA CCATAGACAT AGGAGAAGGC GCTAAAGCCA GCACCACAGT CTTAACAGGT
TCAACCAGCA CCGGAGAAGG AGGCAACATC ACCATCAACA CAAACGAACT CAACGTCACA
GGAAAACTCG GAATATTTGC CGAAACCGAA GCGAGCAAAA ACGCCGGTAT CCTTCGCATC
TCACCCTACA AAAACAACCC CGACCTAGAC ATCACCTTCA AAAACGAGGG CTTCATCTCC
GCCTCCACCT CATCCACAGG CAACGGAGGC AACATATTCA TCAAAGCCCC AGAAAACATC
AACATAACAG GCCAAGGGTT CATTGCCACC AAAACATCAG GCACAGGTAA CGCCGGAATT
ATCGACATCA AAACCAACAA CCTGAGAATA TCCAATGGAG TCAAAATCAA TGCTTCCACA
GAAGACCAAG GCAACGCCGG CGAAATCAAA ATCAACACCA CAGACTTCAC CCTAGAAAAA
GGAACAAGTC TAACAACAGA AACCAGCAGC GCCGGACTAG CCGGAAACAT AGAAATAAAC
ACCAAAAACC TCACAATAGG ACAAAACGCC CAAATAAGTG CCACAGCACT AGAAGGAGCC
AGCAACAAAG AAGCAGGAGG CAACATCACC ATCAACGCCA ATAACCTAGA CATATCAGGA
AAACTAGGAA TATTCGCGGA AACCGCAGGA GAAAGCCCTG CAGGCACCCT CACCCTGAAC
CCATATAAAA ACAACCCCAA CCTAAATATA GAATTCAAAG AAAAAGGCTT CATTTCTGCT
CGCACCACCT CCAGTGGCAA CGGAGGCAAC ATAAATATTC AAGCACCAGA AAAAATCAAC
ATTACAGGAG ACGGAAAAAT ATCAGCCGAA ACCACAGGGA GTGGAAATGC AGGCACCATC
AATATCCAAA CAGAAAACCT CAACCTATCA GAGCAAGTAG CCATAAGTGC CGAAACCAAC
AGTCAAGGTC AAGCCGGAAA CATTGAAATT AACTCCCAAA CAGTTACCAT AGGAAAAGGC
ACAGAAATAA GTGCAACAGC CGGAAAAAAA GCAACAAGCA CCGGAGATGG AGGCAACATC
ACCATCAACA CCAACGACCT AGAAATCTCA GGAAAACTAG GAATATTCGC CGAAACCAAA
GGAGCATCAA ACGCCGGAAC CTTAACCATC ACCCCTTACC AAACCAACCC AGACCTCAAT
AGCAAAACCG ACCCAAACAT CAATATTACC TTCACCGACC AAGGCTTCAT TTCTGCTAGC
ACCAAATCTC TGGGAAAGGG AGGCGACATC AATATATTGG CACCAGAAAA CATAAATATT
ACAGGAGATG GTCGGATAAC AGTTGAAAGC GAAGGTAGTG GAGATGCAGG CATTATTAAC
ATTGAAACAG AAAACCTGAC AATAGCCGAA AACACAAAAA TATCAGCATC CACATCCGAC
AGCGGAAACG GAGGCGAAAT CAAAATTAAC TCTAGCGAAA CATTCCAACT ACAAGGCCGA
ATATTAACAG AAACCACAGG AACAGGAGAT GGAGGCATTA TCAATATTGA AGCAGGAGAA
ATAACTGCGC CTAACAGCAA GATATCAGCT AAAACTACCG ACGCCGGCAA CGCAGGTACA
ATAGATATTA CTGCCCAAGG AGATATTACC ACAGGAGTTG TTACCTCCGC CGCCAAAAAT
GACATAGAAA CCGCTGACGG AGGTAGCATT AGTATTACCA GTGAACAAGG AAAAATTAAC
GCCACCCGAG CAATTCAATC ATTTTCAGAA GGAGGGAACG CCGGAAATGT TACCCTCAAA
GCCCAAACAG ACATAACCGC CAATACCATA AGTTCTCACG GAAAGCAAGA GGGTGGTCAA
ATAACCATCA GATCAGAAAC CGGAAATATT GACACCAGTA GTGGTAAATT CTTAGCCAAC
TATTCAGGAG GCGGTGACGC GGGAGACATT ACAATGGAAG CACCCCAAGG AAATATTACC
ACAAATAATA TCTATTCCTA CGCCGACGGA GACGGAGGTC AAATAAACAT TAAAGCCGGA
AACAATATTA ATATAGAAGG AAAAAGTAAT ATTATTTCCG CATCCAAGTC ACCAAGTGAT
GGTAGCAGTG ATATGCCAGG AAAAGGAGGA GATATTACTC TAGAAGCAGG CAATAATATT
AATACCAGAA CAGCAAAAAT ATATTCCGGT GCGAACGAGG GCGATACAGG AAAAATTGAT
ATTACTGCAG ATAATGCAAT AGAAACAGGT AAAATAGACT TAGTATCAGG TTTTGTCAAA
GAAAAAGAAA AAGTCAACGA AAACTTTACT ATTATTCCCA AAGGAGAAGG AGAAGCAACT
CAAGGAAAAG CTGGAGACAT CAGACTCCGC AGCAGAAACA GTACAATAGA TACCACAGGC
GGCACAATAA ATTCTCGTTC CCCAGACGGC ACCGGAAATA TTATTATCAA TGCCAAAGGA
AATATTAGCA CAGGGAAATT AGAAGCCAGC GCCCTTAACC CAGACAAGCC TACCACAGGT
GGAGACGTTA ATATTACCAG CGAGCAAGGA GAAATTAATG CAACTCAAAA CATAGAAACC
TTCTCAGAAC AAGGTATAGC AGGAGATGTA AATATTACTG CCTTTGGTCA GATCCAGACA
AATAACATTA GTTCCCAGGG AATGAAACGG GGAGGTGATA TTAATATCAG GAGTGATAGT
GAAAGTAGTA TAGATGCAGC AGGAGTATTA CAAACATATT CAGATGCAGG AACAGCAGGA
AATGTCAACC TCACATCCCC AGGGGATGTT AATATTAGCG GCATTCGTTC AGAAGGAATG
GAGCAGGGAG GTGATATTAA TATTAGGAGT GAACGGGGAG AAATTAACTC CACAGGTGAT
ATAGACTCCT ATTCAAAACA AGGAAAAGGA GGATACGTCA AGGTAGATGC CCTAGAAAGA
GTAAATTTAG CAAATGTATC ATCCTACGGC ATGACAGAAA GTGGTGACTT AATTATTCAG
AGCCAACAAG CAAAAGTTAA CACAGGCAAT GTCACAACAC AAGCTTTAGA GGGAAAGAGT
GGACGTATAG TCATCAACGG AACAGAAGTA GGTACAGGAA ACTTAAGTTC TATCGGCACA
ACAAGCGCCG GAGAAATTAA AGTCACAGCC ACAGATGGTT CCATAAAAAC CTATAATGTA
GAAATAAGAT CAGATGGCAC AATAGGAGTT TTAAGCCTGA AAGCAACAGA AGATATTAAC
ACAGGAGACC AAACAGCTAT TGCTGGAGAG GGTGATGTTT TTATTGATAA CGATGCTGGA
GATGACCTGA CCACAGGAGA CCAAACGGCC ATTACTGGAG AGGGTGATGC TTTTATTGAT
AACGATGCCG GAGATGACCT GACCACAGGA GACCAAACAG CTATTACTGG AGGGGGTGAT
GTTTTTATTG ATAACGATGC CGGAGATGAC CTGACCACAG GAGACCAAAC AGCTATTACT
GGAGGGGGTG ATGCTTTTAT TGATAACCAT GCCGGAGATG AACTGACCAC AGGAGACCAA
ACAGCTATTA CTGGAGGGGG TGATGCTTTT ATTGATAACG ATGCCGGAGA TGACCTAACC
ACAGGAGAGA AAACAGTTAT TACTGGAAAT GTTACTGCTA CTGTAACTAA TTTTATTCAA
AATGATGGAG TGGATCAAAA TCTAGATACT ACTGTAACTA ATTTTATTCA AAATGATGGA
GTGAATCAAA ATCTAGATAC TACCTCAGTA ATTTCTAATA ACAATATACC TAATAATCAA
GTACTTAATC AAGAAAATTT CTCTAATAAT AACAATAACA ATATTGAGAA TAATTCCACT
ACAAATTCCT CAAACAATAA AAACATTTTA TCAAATCTTA CCCAGTCTCA AAGATCTGAA
TTAATTTCAA ATTCTACCCT CAGCAACAAT AACCAAACAA CTAATACAAA TACAGCCCAA
GAACAAACAG AATCATCTAT TAATAGTACA ACGGATACCC AAAAAATCTT AAACATAATT
GATACCGTCA ATACCAATTC CTTGACAGTT GCGACTGGTT CAGACCAAGT AATAACAATG
TTAGAACAAA ACCTTACTAA CGAATATTCT AATTACTTTG GAACAGATTT TAAGGAACAA
TTTATTAACC AAAAAACACC CCGAGAAATC TTAACCGATA TGGCTGCTAA AACAGGAAAA
GAATCTGCAG TTGTTTATAT TAATGCTTAT CCAGAGGAAT TACAAATAAT TTTATATACC
AAAGATGGTC AACCTATTCT TAAAACTATC CCGGAAGCTA ACCGTAAAAA ATTAGAGAAA
GTAGTTATAA ATTTCCTCAA ATTAACGACA AGTCCTGCCT ATCGTGATTT TAATAGTTAT
CTATCACCAG CAAAACAATT ATATGATTGG TTCATTGCAC CTATATCAGC AGAATTAGAA
GCAGCAAATA TCGATACCTT ATTGTTCAGT ATGGGTGAAG GTTTACGTAT TTTACCAGTG
GCAGCATTAC ATGATGGAAA GCAATTTTTA ATTGAGAAAT ATAGCCTAAG TTTAATTCCG
AGTATCAGCT TAATGGATAC AAATTATCGC CCACTTCAAG GTACTCAAGT ATTAGCTATG
GGAGCTAGTA AATTTATCAA TGAAAAACCT TTACCAGCAG TACCTGTGGA GATAGAAACA
ATTTCTGAAC AGTTATGGGA AGGTAGTAAA TTTTTAAACG AAGAATTTAC CAAGAATAAT
TTGTTAACTC AAAGAAAAAA TTATCCCTAT CCTATTATTC ACTTAGCTAC CCATGCAACA
TTTAATAGAG GAAAACCTAG TAATTCTTAC ATTCAGCTAT GGGGTAATGA ACAAATAAAG
TTAGACCAAG TGCGGGAGTT GGGTTGGAGT ACCCCATCGG TTGACTTGTT GGTATTGTCT
GCTTGTCGCA CTGCTGTAGG AAATAGAGAA GCGGAATTAG GGTTTGCAGG GTTAGCAGTA
GCAGCAGGAG TGAAGTCAGC CTTAACGAGC CTTTGGACTG TAAGTGATGA AGGTACATTG
GCATTAATGA CAGAGTTTTA TACTCATTTG AATAATGTCA GTATTAAAGC AGAAGCGTTA
AGACAGGCGC AGTTAGCAAT GTTGCAAGGG CAGGTGCTTA TTACAGGTGG GGAGTTGAGG
GGAAGCAGCA CTCGTGGTGG GGTGGAGCTA CCTTCGGCGT TCGCAAATGT AAACAATCAA
AATTTATCTC ATCCTTATTA TTGGGCAGGG TTTACTATGG TTGGTAGTCC TTGGTAA
 
Protein sequence
MHSGQTANFV TTPDTRNVLG RVKGGASYIN GLIQVLGSSS NLFLMNPAGI MFGPNASLNV 
PASFSVTTAT GIGFDQNNFW FKAMGTNDYS NLVGNPSGYR FDVSKPGSIV NEGNLTLKPQ
GNLTLSGGTV VNTGELSSPG GNITVTAVEG GSTLKISQPG HLLSLEVPLE DGENISNIDP
LSLPELLAGG GDIVEATSVV VKENGDVVLN GSNTMVAETP GTATISGKID VSTTATSLKE
KGRVPSQRKK EGAASPGKID VPPSLKEKRD VAKAGKVNVW GDRVALIDTN IKADGKNGGG
TVLIGGDFQG LGIVPNSQHT FVNNNSFISA DAITNGDGGQ VRIWSDGITN FAGNISAKGG
TSSGNGGLVK IGGKEQLIFD GKVDVTAALG TKGRILLDPE SVTVGEDNSE GEKEIVEDNS
EVSETENTDN YDGEIIEDNS EVSETENTDN YNGEIVEDNS EVSETENTDN STTKNTDNLE
DKETENPLDP FAADENSDVT ISADNLGELS GNVIIDADND ITINERIETD SSVELKAGRS
ININADIDTK SGNGNIDLLG NNDEMNLANR SDGKGSINQL DGTILNAGSG GINIKLGSLG
EVGDINLGNL RTTGKVLVDA NGGNIVRVSE NSLINAGSVL FRTSGNGGIG FLGQPLRLDV
QNLEAVSGSG GVFFDVGNVN IGGVSEDVVG IATFGGDVDI KSAGNVTLNE TISSNEVVEN
NSEGGTTENT EVVDGGGGIN IEAAGDIVAT GSGIKGGGEA VSLSGTNIRI NDEFDETSGD
ADVKLSATND IVVEDIEDDV LEFMPGSGEI EFRADKDGDG FGLVKMLDNK PDVGSNPDIF
ENGADTIKTN GRGLTIAGAG LVLGNVDTSW LPIYSGGGEL LKAIDVDEGG PIPPEGTEGT
TTFTFTVDGD LGTVENIDVR FSAAYTHTGD LDVSLESPQG KVVQLFAGVG RWGDNFQDTV
LDDNASRSIG ISNAPFDGRY SPQGSLVDFN GENPKGIWTL KVKDTNIFSN LADGNLYRAG
ETAPWGTAIG TQLLLHNPLV KIAGGIESGK GGAINLEATH GDISVGNIRS LSEAANGGRI
DLNANKDIIT GLINSSSASV QGNGGAIDLD AGGNITTQSL NSRSYSWEGN SGNGGTIDLD
AGGNITTQSL YSSSYSGSGN SGDGGAIDLD AGGDITTTQD LDSSSYSWES NLGNGGAIDL
DAGGNITTQV LDSRSFSWEG NSGNGGAIDL DAGGNITTQI LDSCSYSGSY DSGNGGAIDL
VAGGDITTQN LYSDSFSRSG NSGNGGAIDL VAGGDITIQD LYSFSYSGSG NSGNGGDITL
NAKQINPPED GNAEKLTIYT FSAGKKESEE GKGGDVNITT NNLSNTDILT LSSHSGSGKV
TIESQTQEPL QIKDSSIITS EKITITMPWG EEIQVETGDT QSGDVSINSS GDLNLSNVTI
ESDTESNQAA GDVNIYSRGN ITLENTDIIS TTNSQGNAGQ ITLESNQNIE LTNNSKILAN
TEGTGNAGQI NIEANKLILD QNTKLITETA RAGNPGNINI QANTIDIGEG AKASTTVLTG
STSTGEGGNI TINTNELNVT GKLGIFAETE ASKNAGILRI SPYKNNPDLD ITFKNEGFIS
ASTSSTGNGG NIFIKAPENI NITGQGFIAT KTSGTGNAGI IDIKTNNLRI SNGVKINAST
EDQGNAGEIK INTTDFTLEK GTSLTTETSS AGLAGNIEIN TKNLTIGQNA QISATALEGA
SNKEAGGNIT INANNLDISG KLGIFAETAG ESPAGTLTLN PYKNNPNLNI EFKEKGFISA
RTTSSGNGGN INIQAPEKIN ITGDGKISAE TTGSGNAGTI NIQTENLNLS EQVAISAETN
SQGQAGNIEI NSQTVTIGKG TEISATAGKK ATSTGDGGNI TINTNDLEIS GKLGIFAETK
GASNAGTLTI TPYQTNPDLN SKTDPNINIT FTDQGFISAS TKSLGKGGDI NILAPENINI
TGDGRITVES EGSGDAGIIN IETENLTIAE NTKISASTSD SGNGGEIKIN SSETFQLQGR
ILTETTGTGD GGIINIEAGE ITAPNSKISA KTTDAGNAGT IDITAQGDIT TGVVTSAAKN
DIETADGGSI SITSEQGKIN ATRAIQSFSE GGNAGNVTLK AQTDITANTI SSHGKQEGGQ
ITIRSETGNI DTSSGKFLAN YSGGGDAGDI TMEAPQGNIT TNNIYSYADG DGGQINIKAG
NNINIEGKSN IISASKSPSD GSSDMPGKGG DITLEAGNNI NTRTAKIYSG ANEGDTGKID
ITADNAIETG KIDLVSGFVK EKEKVNENFT IIPKGEGEAT QGKAGDIRLR SRNSTIDTTG
GTINSRSPDG TGNIIINAKG NISTGKLEAS ALNPDKPTTG GDVNITSEQG EINATQNIET
FSEQGIAGDV NITAFGQIQT NNISSQGMKR GGDINIRSDS ESSIDAAGVL QTYSDAGTAG
NVNLTSPGDV NISGIRSEGM EQGGDINIRS ERGEINSTGD IDSYSKQGKG GYVKVDALER
VNLANVSSYG MTESGDLIIQ SQQAKVNTGN VTTQALEGKS GRIVINGTEV GTGNLSSIGT
TSAGEIKVTA TDGSIKTYNV EIRSDGTIGV LSLKATEDIN TGDQTAIAGE GDVFIDNDAG
DDLTTGDQTA ITGEGDAFID NDAGDDLTTG DQTAITGGGD VFIDNDAGDD LTTGDQTAIT
GGGDAFIDNH AGDELTTGDQ TAITGGGDAF IDNDAGDDLT TGEKTVITGN VTATVTNFIQ
NDGVDQNLDT TVTNFIQNDG VNQNLDTTSV ISNNNIPNNQ VLNQENFSNN NNNNIENNST
TNSSNNKNIL SNLTQSQRSE LISNSTLSNN NQTTNTNTAQ EQTESSINST TDTQKILNII
DTVNTNSLTV ATGSDQVITM LEQNLTNEYS NYFGTDFKEQ FINQKTPREI LTDMAAKTGK
ESAVVYINAY PEELQIILYT KDGQPILKTI PEANRKKLEK VVINFLKLTT SPAYRDFNSY
LSPAKQLYDW FIAPISAELE AANIDTLLFS MGEGLRILPV AALHDGKQFL IEKYSLSLIP
SISLMDTNYR PLQGTQVLAM GASKFINEKP LPAVPVEIET ISEQLWEGSK FLNEEFTKNN
LLTQRKNYPY PIIHLATHAT FNRGKPSNSY IQLWGNEQIK LDQVRELGWS TPSVDLLVLS
ACRTAVGNRE AELGFAGLAV AAGVKSALTS LWTVSDEGTL ALMTEFYTHL NNVSIKAEAL
RQAQLAMLQG QVLITGGELR GSSTRGGVEL PSAFANVNNQ NLSHPYYWAG FTMVGSPW