Gene Tery_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0572 
Symbol 
ID4244598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp902632 
End bp912891 
Gene Length10260 bp 
Protein Length3419 aa 
Translation table11 
GC content42% 
IMG OID638105877 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_720490 
Protein GI113474429 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTA ATAGCTCAAA ACTCTGCACC ATAACAGCTC TGAGTTTAAT ACTAGGCACA 
TTAGCCACCA CTCCCGCTAA CTCCCAGCCC ATAACCCCGG CTAAGGACGG TACCAATACT
ACTGTTACTC CACAGGGCCA ACAATTTCAT ATTAAAGGTG GGACTCGCAG TGGTACTAAT
GTATTTCACA GCTTTGATAA GTTTAATGTG CACTCTGGTC AAACTGCCAA CTTTGTCACG
ACCCCGGATA CTCGCAACGT TCTCGGACGA GTTAAGGGAG GCGCTTCTTA TATTAATGGT
CTTATTCAGG TTCTAGGTAG CAGCTCTAAC CTCTTCTTAA TGAACCCCGC AGGCATAATG
TTTGGCCCAA ACGCCAGCCT CAATGTACCT GCGTCATTCA GTGTTACTAC TGCTACTGGT
ATTGGTTTTG ACCAGAATAA TTTCTGGTTC AAGGCTATGG GCACTAATGA CTATTCAAAT
TTGCTCGGAA GCCCTAGTGG TTATAGGTTT GATGTTTCTA AACCTGGTTC TATTGTTAAT
GAAGGAAGTT TAACTTTAAA TTCTGGAGAA AATTTAACTT TATTAGGGGG AACTGTTGTT
AATACAGGGG AACTCTCTAG CCCTGGGGGT AATATTACTG TCACTGCTGT CGAAGGTGGT
AGTACCCTGA AAATATCTCA ACCAGGACAT TTGTTAAGTT TAGAGGTACC CCTAGAGGAT
GGAGAAAATA TTTCTAATAT TGATCCTCTA TCTTTACCAG AGTTATTAGC TGGTGGAGGA
GATATCGTTG AGGCTACATC TGTTATTGTT AAAGAGAATG GTGATGTGGT TTTGACCGGT
TCAAATACTA TGGTGGCTGA GACTCCGGGA ACGGCAACTA TTTCGGGAAA GATAGATGTA
TCAACTACTG CTAGTTCTCT CCAAAATGAG GGGGGTGCTA GTTTTGGAGG CCAAGTTAAT
GTGTGGGGCG ATCGCGTGGC TCTTATAGAT ACAAATATCA AGGCTGATGG CAAAAATGGG
GGTGGAACTG TATTAATAGG AGGAGATTTT CAAGGGTTGG GTATAGTTCC TAATTCTCAG
CATACTTTTG TTAATAATAA TTCATTTATT TCTGCGGATG CTATCACAAA TGGTGATGGA
GGTCAAGTTA GAATTTGGTC GGATGGTATT ACTAATTTTG CGGGAAATAT TAGTGCTAAA
GGCGGCACAT CTTCGGGAAA TGGAGGGTTG GTTAAAATTG GGGGAAAAGA ACAATTAATC
TTTGATGGAA AGGTGGATGT TACTGCGGCT TTAGGGACTG AAGGTACGAT TTTATTGGAC
CCAGAAAGTG TGACGGTGGG AGAGGATAAT TCGGAAGGTG AAAAAGAAAT TGTAGATGAT
AATTCCGAGG TTTCTGAGAC AGAAAATACT GATAATTATA ATGGGGAAAT TGTAGATGAT
AATTCGGAGG TTTCTGAGAC AGAAAATACT GATAATTTCA AAGACAAAAA ACCAGACACT
CCTAAAGACA AAAAACCAGA CACTCCTCTT GATCCTTTTG CTGCTGATGA GAATTCTGAT
GTGACTATTT CGGCGGATAA TTTAGGGGAA TTATCGGGGA ATGTGATTAT TGATGCTGAT
AATGATATTA CCATAAATGA AAGGATAGAA ACTGATTCTT CTGTGGAGTT AAAGGCTGGT
CGAAGTATTA ATATAAATGC GGATATTGAT ACTAAGAGTG GAAATGGGAA TATTGATTTA
TTGGGTAATA ATGATGAGAT GAATTTGGCG AACCGTTCTG ATGGGAAAGG TTCAATAAAT
CAATTGGATG GAACAATTTT AAATGCTGGA AGTGGTGGAA TTAATATTAA GTTGGGAAGT
TTAGGGGAGG TAGGTGATAT AAATCTGGGA AATTTAAGGA CGACAGGGAA GGTTTTGGTT
GATGCGAATG GGGGAAATAT TGTTAGAGTT TCTGAGAATT CTTTGATAAA TGCGGGGAGT
GTTTTGTTTA GAACTTCTGG GAATGGAGGT ATAGGTTTTC TTGGTCAACC TTTGCGGTTA
GATGTGCAGA ATTTGGAGGC GGTTTCTGGT AGTGGTGGGG TGTTTTTTGA TGTGGGAAAT
GTGAATATTG GTGGTGTGAG TGAGGATGTG GTTGGTATTG CTACTTTTGG AGGAGATGTT
GATATTAAGA GTGCGGGAAA TGTGACTTTA AATGAAACTA TTTCTAGTAA TGAGGTTGTG
GAGAATAATT CTGAGGGGGG AACTACGGAA AATACGGAGG TTGTTGATGG AGGTGGTGGG
ATAAACATTG AAGCGGCGGG AGATATTGTG GCTACGGGTA GTGGTATTAA AGGTGGTGGG
GAAGCTGTTT CTTTGTCTGG GACTAATATT AGGATAAATG ATGAATTTGA TGAGACTTCT
GGGGATGCGG ATGTTAAGTT GTCGGCCACA AATGATATTG TTGTTGAGGA TATAGAGGAT
GATGTGTTGG AGTTTATGCC TGGGAGTGGG GAGATAGAAT TTCGTGCGGA TAAAGATGGG
GATGGGTTTG GTCTTGTGAA GATGTTGGAT AATAAGCCGG ATGTTGGGAG TAACCCTGAT
ATTTTTGAGA ATGGGGCGGA TACTATTAAG ACTAATGGGC GAGGTTTGAC TATTGCTGGT
GCTGGTCTGG TTTTGGGGAA TGTTGATACT TCTTGGTTGC CTATTTATTC TGGTGGTGGG
GAGTTGTTGA AGGCGATCGA TGTGGATAAG GGGGGGGCGA TACCTCCGGA AGGTACAGAA
GGTACTGCAA CATTTACTTT TACTGTAGAT GGTGACTTGG GAACTGTAGA AAATATAGAT
GTTCGCTTTT CTGCGGCACA TACTTGGGAT GAGGACTTAG ACGTTAGTCT GGAATCACCC
CAAGGGAAGG TAGTACAATT GTTTTCACGC GTTGGTGGTA GTGGAGAGAA CTTTCAGGAT
ACTGTGTTGG ATGATGATGC TTCTAGAAGA ATTATCAGTG GCAATGCTCC GTTTGATGGT
ACATATCGTC CCCAAGGAAG TTTAGCAGAT TTTAATGGAG AAAATCCTAA TGGAGCTTGG
ACTTTGAAGG TGACTGATAC CTATCCCTGG GCAGATGATG GCACTTTATA CAGAGCTGGT
GAGACTGCTC CCTGGGGAAC TGCCATAGGT ACACAACTGT TGCTACGTAA TCCTCTTGTT
AAGAGTGGTG GAGGAATAGG AAGTGGAAAT GGTGGAGCGA TCAACCTAGA AGCAACTCAT
GGTGATATTA GTGTAGGAAA TATTCGTAGT TTGAGCGAAA CTGCTAATGG GGGAAGAATT
GACCTGAATG CTAATAAGGA TATTATTTCT GGTCTGATTA ATTCTAGTTC TGTGCAAGGA
AATGGAGGAG CCATTGATTT GGATGCTGGT GGCGATATCA CTACACAATA CCTCAATTCC
TGGTCCTCTT CCTGGGAAAA AGGCAACTCA GGAAATGGAG GAGCCATTGA TTTGGATGCT
GGTGGCGATA TCACTACACC ATACCTCAAT TCCGGGTCCT CTTCCAGCTC AGGTAACTCA
GGAAATGGAG GAGCCATTGA TTTGGATGCT GGTGGCGATA TCACTACACA ATACCTCAAT
TCCTGGTCCT CTTCCAGCTC AGGCAACTCA GGAAATGGAG GAGCCATTGA TTTGGATGCT
GGTGGCGATA TCACTACACA ATACCTCAAT TCCGGGTCCT ATTCCGGCTC AGGCAACTCA
GGAAATGGAG GAGCCATTGA TTTGGATGCT GGTGGCGATA TCACTACACA AGACCTCGAT
TCCGGGTCCG ATTCCTGGGA AGGCAAGTCA GGAAATGGAG GAGCCATTGA TTTGGTAGCT
GGTGGCGATA TCACTACACA AAACCTCGAT TCCAGGTCCG ATTCCAGCTC AGGCAACTCA
GGAAATGGAG GAGCCATTGA TTTGGATGCT GGTGGCGATA TCACTACACA ATACCTCTAT
TCCGGGTCCT CTTCCTGGGA AAAAGGCAAC TCAGGAAATG GAGGAGCCAT TGATTTGGAT
GCTGGTGGCG ATATCACTAC ACAATACCTC TATTCCGGGT CCATTTCCTG GTCAGGCAAC
TCAGGAAATG GAGGAGCCAT TGATTTGGTA GCTGGTGGCG ATATCACTAC ACAATACCTC
TATTCCGGGT CCTATTCCTG GTCAGGCAAC TCAGGAAATG GAGGAGACAT AACACTGAAT
GCAAAGACAA TAAAATTCAA CTCTCCACAA CACGCAAATG GAGAAAAAAT AAAAATCCAT
ACATTTTCAG TAGGAAAAAA AGAATCTGAG GAAGGAAAAG GTGGAGACGT TAACATTACC
ACCAACAACC TCAGCAACAC AGATATATTA ACACTATCCT CCCACTCTGA ATCAGGAAAA
GTAACAATAG AAAGTAAGAC ACAAGAACCA CTTCAAATCA AAGACTCATC CATTATCACT
AGTGAACAAG TAACAGTAGA AATCTTCGAG GAACAAATAC AAATAGAAAC AGGAAATACC
CAATCAGGCG ACGTCTTCAT CAACAGTGAC GGCGACCTCA ACCTAAACAA CGTCACCATA
GAAAGCGACA CCAAAAGTAA CCAAGCCGCC GGAGACGTCA ACATCTACAG CCGCGGCAAC
ATCACACTCG AAAACACCGA CATCATCAGC ACCACCAACT CCCAAGGCAA CGCCGGCCAA
ATCACCTTAG AAACCAACGA AAACATAGAA CTCACCAACA ACAGCAAAAT CCTAGCCAAC
ACAGAAGGAA CCGGAAACGC AGGTCAAATA AACATAGAAG CCAACAACCT CATCCTAGAC
CAAAACACCA AACTCATCAC AGAAACCGCA AGAGCTGGAA ACCCCGGAAA CATAAAAATC
CAAGCCAACA CCATAGACAT AGGAGAAGGC GCTAAAGCCA GCACCACAGT CTTAACAGGT
TCAACCAGCA CCGGAGAAGG AGGCAACATC ACCATCAACA CAAACAAACT CAACGTCACA
GGAAAACTGG GAATATTTGC CGAAACCGAA GCCAGCAAAA ACGCCGGTAC CCTTCGCATC
TCACCCTACA AAAACAACCC CAACCTAGAC ATAACCTTCA AAAACGACGG CTTCATCTCC
GCCTCCACCT CATCCACAGG CAACGGAGGC AACATATTCA TCAAAGCCCC AGAAAACATC
AACATAACAG GCCAAGGGTT CATTGCCACC AAAACATCAG GCACAGGTAA CGCCGGAATT
ATCGACATCA AAACCAACAA CCTGAGAATA TCCAATGGAG TCAAAATCAA TGCTTCCACA
GAAGACCAAG GCAACGCCGG CGAAATCAAA ATCAACACCA CAGACTTCAC CCTAGAAAAA
GGAACAAGTC TAACAACAGA AACCAGCAGC GCCGGACTAG CCGGAAACAT AGAAATAAAC
ACCAAAAACC TCACAATAGG ACAAAACGCC CAAATAAGTG CCACAGCACT AGAAGGAGCC
AGCAACAAAG AAGCAGGCGG CAACATCACC ATCAACGCCA ATAACCTAGA CATATCAGGA
AAACTAGGAA TATTCGCGGA AACCGCAGGA GAAAGCCCCG CAGGCACCCT CACCCTGAAC
CCATATAAAA ACAACCCCAA CCTAAATATA GAATTCAAAC AACAAGGCTT CATTTCTGCT
CGCACCAGCT CCAGTGGCAA CGGAGGCAAC ATAAATATTC AAGCACCAGA AAAAATCAAC
ATTACAGGAG ACGGAAAAAT ATCAGCCGAA ACCACAGGGA GTGGAAATGC AGGCACCATC
AATATCCAAA CAGAAAACCT CAACCTATCA GAGCAAGTAG CCATAAGTGC CGAAACCAAC
AGTCAAGGTC AAGCCGGAAA CATTGAAATT AACTCCCAAA CAGTTACCAT AGGAAAAGGC
ACAGAAATAA GTGCAACAGC CGGAAAAAAA GCAACAAGCA CCGGAGATGG AGGCAACATC
ACCATCAACA CCAACGACCT AGAAATCTCA GGAAAACTAG GAATATTCGC CGAAACCAAA
GGAGCATCAA ACGCCGGAAC CTTAACCATC ACCCCTTACC AAACCAACCC AGACCTCAAT
AGCAAAACCG ACCCAAACAT CAATATTACC TTCACCGACC AAGGCTTCAT TTCTGCTAGC
ACCAAATCTC TGGGAAAGGG AGGCGACATC AATATATTGG CACCAGAAAA CATAAATATT
ACAGGAGATG GTCGGATAAC AGTTGAAAGC GAAGGTAGTG GAGATGCAGG CATTATTAAC
ATTGAAACAG AAAACCTGAC AATAGCCGAA AACACAAAAA TATCAGCATC CACATCCGAC
AGCGGAAACG GAGGCGAAAT CAAAATTAAC TCTAGCGAAA CATTCCAACT ACAAGGCCGA
ATATTAACAG AAACCACAGG AACAGGAGAT GGAGGCATTA TCAATATTGA AGCAGGAGAA
ATAACTGCGC CTAACAGCAA GATATCAGCT AAAACTACCG ACGCCGGCAA CGCAGGTACA
ATAGATATTA CTGCCCAAGG AGATATTACC ACAGGAGTTG TTACCTCCGC CGCCAAAAAT
AAAAGAGAAA CTGCTGACGG AGGTAGCATT AGTATTACCA GTGAACAAGG AAAAATTAAC
GCCACCCGAG CAATTCAATC ATTTTCAGAA GGAGGCAACG CCGGAAATGT TACCCTCAAA
GCCCAAACAG ACATAACCGC CAATACCATA AGTTCTCACG GAAAGCAAGA GGGTGGTCAA
ATAACCATCA GATCAGAAAC CGGAAATATT GACACCAGTA GTGGTAAATT CTTAGCCAAC
TATTCAGGAG GCGGTGACGC GGGAGACATT ACAATGGAAG CACCCCAAGG AAATATTACC
ACAAATAATA TCTATTCCTA CGCCGACGGA GACGGAGGTC AAATAAACAT TAAAGCCGGA
AACAATATTA ATATAGAAGC AAACAGTAAT ATTATTTCTG CATCAGAGCC ACCAAGTGAG
GGAAACAGTG ACAAACAAGG AAAAGGAGGA GATATTACTC TAGAAGCAGG CAACAATATT
AATACTACAG CAGCTAATAT ATATTCTGGT GCCAACGAGG GCGACACAGG ACAAATTGAT
ATTACTGCAG ATAACGCAAT AGAAACAGGT AAAATAGACT TAGCATCAGG TTTTGTCAGA
CAAGAAACAA AAGTAAACCA AAACCTGGTC CTCATCCCTA AACCAGGAGA ACCAGCCACC
AAAGGAAAAG CTGGAGACAT CAGACTCCGC AGCAGAAACA GTACAATAGA CACCACCGGT
GGCACAATAA ATTCTCGTTC CCCAGACGGC ACCGGAGATA TTATTATCAA TGCCAAAGGA
AATATTAGCA CAGGGAAATT AGAAGCCAGC GCCCTTAACC CAGACAACCC TACCACAGGT
GGAGACGTTA ATATTACCAG CGAGCAAGGA GAAATTAATG CAACTCAAAA CATAGAAACC
TTCTCAGAAA AAGGTACAGC AGGAGATGTA AATATTACTG CAGCGGGTCA TATCAATACA
AACACCATTC GTTCAGATGG AATGGAACAG GGAGGAGATA TTAATATTAG GAGCGATAGT
GAAAGTAGCA TAGATGCAGC AGGAGCATTA CAAACATATT CAGATGCAGG AACAGCAGGA
AACGTCAACC TCACATCCCC AGGGAATGTT AATATTAGCG GCATTCGTTC AGAAGGAATG
GAGCAGGGAG GTGATATTAC ACTGAAGAGT CAAGGAGGAG AAATTAACTC TACAGGCGAT
ATAGACTCCT ATTCAAAACA AGGAAAAGGA GGATACGTCA AAGTAGATGC CCCAGAAAGA
GTAAATTTAG CAAATGTATC ATCCTACGGC ATGACAGAAA GTGGTGACTT AATTACTCAG
AGCCAACAAG CAGAAGTTAA TACAGGAAAT GTCACAACAC AAGCTTCAGA GGGAAAGAGT
GGACGTATAG TCATCAACGG AACAGAAGTA GGTACGGGAA ATTTAAGTTC TATCGCAGGA
ACAAGCGCCG GGGAAATTAA TGTAGAAGCA ACAGATGGTT CTATTGCCAC CTATGATATA
GAAATGACAT CAGGTGGTAC ACTAGGAGCT TTAACACTGA GAGCACCAGA AAATATTAAC
ACAGGAGACA TTCGTCAAAA AGCAGGAGAG GGAGATGCGA ATGCAAATAT ATTTTCAGGA
GGAAACCAAA CCACAGGAAA TATTAGTCAA GAAGCAGGCA ATAATACTAA CCTAAATCAA
AATGCAGGAG AGAATATAAA TGCAGGGAAT ATCGAACAAA ATGCTGGTAA TAATACTAAC
CTAAATCAAA CTGCAGGAGA GAATATAAAT GCAGGGAATA TCGAACAAAA TGCTGGTAAT
AATACTAACC TAAATCAAAC TGCAGGAGAG AATATAAATG TAGGGAATAT CGAACAAAAT
GCGGGTAATA ATACTAACCT AAATCAAACT GCAGGAGAGA ATATAAATAC AGGGGATATC
GAACAAAATG CGGGTAATAA TACTAACCTA AATCAAACTG CAGGAGAGAA TATAAATACA
GGGGATATCG AACAAAATGC GGGTAATAAT ACTAACCTAA ATCAAACTGC AGGAGAGAAT
ATAAATACAG GGGATATCGA ACAAAATGCT AGTAATAATA CTAGTATTTA TCAAATTGCA
GAAGGAGAGA TTAATACACC AGTCATTAAT CAAACTTTCG GTAATGAGAC TACGCTCAAT
CAAATATCAG TAAGAGATGT TAATACACCA GTAATTTCTA ATAACAATAT ACCTAATAAT
CAAGGACTTA ATCAAGAAAA TTTCTCTAAT AATAACAATA ACAATATTGA GAATAATTCC
ACTACAAATT CCTCAAACAA TAAAAACATT TCATCAACCC TTACCCAGTC TCAAAGGTCT
GAATTAATTT CAACTTCTAC CCCCAGCAAC AATAACCAAA CAACTAATAC AAATACAGCC
CAAGAACAAA CAGAATCATC TATTAATAGT ACAACGGATA CCCAAAAAAT CTTAAACATA
ATTGATACCG TCAATACCAA TTCCTTGACA GTTGCGACTG GTTCAGACCA AGTAATAACA
ATGTTAGAAC AAAACCTTAC TAACGAATAT TCTAATTACT TTGGAACAGA TTTTAAGGAA
CAATTTATTA ACCAAAAAAC ACCCCGAGAA ATCTTAACCG ATATGGCTGC TAAAACAGGA
AAAGAATCTG CAGTTGTTTA TATTAATGCT TATCCAGAGG AATTACAAAT AATTTTATAT
ACCAAAGATG GTCAACCTAT TCTTAAAACT ATCCCGGAAG CTAACCGTAA AAAATTAGAG
AAAGTAGTTA TAAATTTCCT CAAATTAACG ACAAGTCCTG CCTATCGTGA TTTTAATAGT
TATCTATCAC CAGCAAAACA ATTATATGAT TGGTTCATTG CACCTATATC AGCAGAATTA
GAAGCAGCAA ATATCGATAC CTTATTGTTC AGTATGGGTG AAGGTTTACG TATTTTACCA
GTGGCAGCAT TACATGATGG AAAGCAATTT TTAATTGAGA AATATAGCCT AAGTTTAATT
CCGAGTATCA GCTTAATGGA TACAAATTAT CGCCCACTTC AAGGTACTCA AGTATTAGCT
ATGGGAGCTA GTAAATTTAT CAATGAAAAA CCTTTACCAG CAGTACCTGT GGAGATAGAA
ACAATTTCTG AACAGTTATG GGAAGGTAGT AAATTTTTAA ACGAAGAATT TACCAAGAAT
AATTTGTTAA CTCAAAGAAA AAATTATCCC TATCCTATTA TTCACTTAGC TACCCATGCA
ACATTTAATA GAGGAAAACC TAGTAATTCT TACATTCAGC TATGGGGTAA TGAACAAATA
AAGTTAGACC AAGTGCGGGA GTTGGGTTGG AGTACCCCAT CGGTTGACTT GTTGGTATTG
TCTGCTTGTC GCACTGCTGT AGGAAATAGA GAAGCGGAAT TAGGGTTTGC AGGGTTAGCA
GTAGCAGCGG GAGTGAAGTC AGCCTTAACG AGCCTTTGGA CTGTAAGTGA TGAAGGGACA
TTGGCATTAA TGACAGAGTT TTATACTCAT TTGAATGATG CTAAGATCAA GTCAGAGGCA
TTAAGACAGG CGCAGTTAGC AATGCTGCAA GGGCAGGTGC TTATTACAGG TGGGGAGTTG
AGGGGAAGCA GCACTCGTGG TGGGGTGGAG CTACCTTCGG CGTTCGCAAA TGTAAACAAT
CAAAATTTAT CTCATCCTTA TTACTGGGCA GGGTTTACTA TAGTTGGTAG TCCTTGGTAA
 
Protein sequence
MKPNSSKLCT ITALSLILGT LATTPANSQP ITPAKDGTNT TVTPQGQQFH IKGGTRSGTN 
VFHSFDKFNV HSGQTANFVT TPDTRNVLGR VKGGASYING LIQVLGSSSN LFLMNPAGIM
FGPNASLNVP ASFSVTTATG IGFDQNNFWF KAMGTNDYSN LLGSPSGYRF DVSKPGSIVN
EGSLTLNSGE NLTLLGGTVV NTGELSSPGG NITVTAVEGG STLKISQPGH LLSLEVPLED
GENISNIDPL SLPELLAGGG DIVEATSVIV KENGDVVLTG SNTMVAETPG TATISGKIDV
STTASSLQNE GGASFGGQVN VWGDRVALID TNIKADGKNG GGTVLIGGDF QGLGIVPNSQ
HTFVNNNSFI SADAITNGDG GQVRIWSDGI TNFAGNISAK GGTSSGNGGL VKIGGKEQLI
FDGKVDVTAA LGTEGTILLD PESVTVGEDN SEGEKEIVDD NSEVSETENT DNYNGEIVDD
NSEVSETENT DNFKDKKPDT PKDKKPDTPL DPFAADENSD VTISADNLGE LSGNVIIDAD
NDITINERIE TDSSVELKAG RSININADID TKSGNGNIDL LGNNDEMNLA NRSDGKGSIN
QLDGTILNAG SGGINIKLGS LGEVGDINLG NLRTTGKVLV DANGGNIVRV SENSLINAGS
VLFRTSGNGG IGFLGQPLRL DVQNLEAVSG SGGVFFDVGN VNIGGVSEDV VGIATFGGDV
DIKSAGNVTL NETISSNEVV ENNSEGGTTE NTEVVDGGGG INIEAAGDIV ATGSGIKGGG
EAVSLSGTNI RINDEFDETS GDADVKLSAT NDIVVEDIED DVLEFMPGSG EIEFRADKDG
DGFGLVKMLD NKPDVGSNPD IFENGADTIK TNGRGLTIAG AGLVLGNVDT SWLPIYSGGG
ELLKAIDVDK GGAIPPEGTE GTATFTFTVD GDLGTVENID VRFSAAHTWD EDLDVSLESP
QGKVVQLFSR VGGSGENFQD TVLDDDASRR IISGNAPFDG TYRPQGSLAD FNGENPNGAW
TLKVTDTYPW ADDGTLYRAG ETAPWGTAIG TQLLLRNPLV KSGGGIGSGN GGAINLEATH
GDISVGNIRS LSETANGGRI DLNANKDIIS GLINSSSVQG NGGAIDLDAG GDITTQYLNS
WSSSWEKGNS GNGGAIDLDA GGDITTPYLN SGSSSSSGNS GNGGAIDLDA GGDITTQYLN
SWSSSSSGNS GNGGAIDLDA GGDITTQYLN SGSYSGSGNS GNGGAIDLDA GGDITTQDLD
SGSDSWEGKS GNGGAIDLVA GGDITTQNLD SRSDSSSGNS GNGGAIDLDA GGDITTQYLY
SGSSSWEKGN SGNGGAIDLD AGGDITTQYL YSGSISWSGN SGNGGAIDLV AGGDITTQYL
YSGSYSWSGN SGNGGDITLN AKTIKFNSPQ HANGEKIKIH TFSVGKKESE EGKGGDVNIT
TNNLSNTDIL TLSSHSESGK VTIESKTQEP LQIKDSSIIT SEQVTVEIFE EQIQIETGNT
QSGDVFINSD GDLNLNNVTI ESDTKSNQAA GDVNIYSRGN ITLENTDIIS TTNSQGNAGQ
ITLETNENIE LTNNSKILAN TEGTGNAGQI NIEANNLILD QNTKLITETA RAGNPGNIKI
QANTIDIGEG AKASTTVLTG STSTGEGGNI TINTNKLNVT GKLGIFAETE ASKNAGTLRI
SPYKNNPNLD ITFKNDGFIS ASTSSTGNGG NIFIKAPENI NITGQGFIAT KTSGTGNAGI
IDIKTNNLRI SNGVKINAST EDQGNAGEIK INTTDFTLEK GTSLTTETSS AGLAGNIEIN
TKNLTIGQNA QISATALEGA SNKEAGGNIT INANNLDISG KLGIFAETAG ESPAGTLTLN
PYKNNPNLNI EFKQQGFISA RTSSSGNGGN INIQAPEKIN ITGDGKISAE TTGSGNAGTI
NIQTENLNLS EQVAISAETN SQGQAGNIEI NSQTVTIGKG TEISATAGKK ATSTGDGGNI
TINTNDLEIS GKLGIFAETK GASNAGTLTI TPYQTNPDLN SKTDPNINIT FTDQGFISAS
TKSLGKGGDI NILAPENINI TGDGRITVES EGSGDAGIIN IETENLTIAE NTKISASTSD
SGNGGEIKIN SSETFQLQGR ILTETTGTGD GGIINIEAGE ITAPNSKISA KTTDAGNAGT
IDITAQGDIT TGVVTSAAKN KRETADGGSI SITSEQGKIN ATRAIQSFSE GGNAGNVTLK
AQTDITANTI SSHGKQEGGQ ITIRSETGNI DTSSGKFLAN YSGGGDAGDI TMEAPQGNIT
TNNIYSYADG DGGQINIKAG NNINIEANSN IISASEPPSE GNSDKQGKGG DITLEAGNNI
NTTAANIYSG ANEGDTGQID ITADNAIETG KIDLASGFVR QETKVNQNLV LIPKPGEPAT
KGKAGDIRLR SRNSTIDTTG GTINSRSPDG TGDIIINAKG NISTGKLEAS ALNPDNPTTG
GDVNITSEQG EINATQNIET FSEKGTAGDV NITAAGHINT NTIRSDGMEQ GGDINIRSDS
ESSIDAAGAL QTYSDAGTAG NVNLTSPGNV NISGIRSEGM EQGGDITLKS QGGEINSTGD
IDSYSKQGKG GYVKVDAPER VNLANVSSYG MTESGDLITQ SQQAEVNTGN VTTQASEGKS
GRIVINGTEV GTGNLSSIAG TSAGEINVEA TDGSIATYDI EMTSGGTLGA LTLRAPENIN
TGDIRQKAGE GDANANIFSG GNQTTGNISQ EAGNNTNLNQ NAGENINAGN IEQNAGNNTN
LNQTAGENIN AGNIEQNAGN NTNLNQTAGE NINVGNIEQN AGNNTNLNQT AGENINTGDI
EQNAGNNTNL NQTAGENINT GDIEQNAGNN TNLNQTAGEN INTGDIEQNA SNNTSIYQIA
EGEINTPVIN QTFGNETTLN QISVRDVNTP VISNNNIPNN QGLNQENFSN NNNNNIENNS
TTNSSNNKNI SSTLTQSQRS ELISTSTPSN NNQTTNTNTA QEQTESSINS TTDTQKILNI
IDTVNTNSLT VATGSDQVIT MLEQNLTNEY SNYFGTDFKE QFINQKTPRE ILTDMAAKTG
KESAVVYINA YPEELQIILY TKDGQPILKT IPEANRKKLE KVVINFLKLT TSPAYRDFNS
YLSPAKQLYD WFIAPISAEL EAANIDTLLF SMGEGLRILP VAALHDGKQF LIEKYSLSLI
PSISLMDTNY RPLQGTQVLA MGASKFINEK PLPAVPVEIE TISEQLWEGS KFLNEEFTKN
NLLTQRKNYP YPIIHLATHA TFNRGKPSNS YIQLWGNEQI KLDQVRELGW STPSVDLLVL
SACRTAVGNR EAELGFAGLA VAAGVKSALT SLWTVSDEGT LALMTEFYTH LNDAKIKSEA
LRQAQLAMLQ GQVLITGGEL RGSSTRGGVE LPSAFANVNN QNLSHPYYWA GFTIVGSPW