Gene BURPS1106A_A2802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2802 
Symbol 
ID4906420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2729187 
End bp2738612 
Gene Length9426 bp 
Protein Length3141 aa 
Translation table11 
GC content63% 
IMG OID640145905 
Producthaemagglutinin 
Protein accessionYP_001076831 
Protein GI126455471 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA ATCGCTATCG GGTTGTCTTC AATCGCGCGC GCGGCGCACT CATGGTGGTG 
CAAGAGAATG GCCGTGCGTC CCACGGTAGC GGCAGCCGCG ACGCGCGCGC GGGCGTCGTC
CCTGCTTGGC TGTCGCTCTC GCCGTTTGCG TTGCGCCATG TTGCACTCGC GGTACTGGTT
GCGGCCGGCG TGGTGCCGAT ATGGGTCAAT GCCCAGGTTG TCGCAGGCGG CGCACACGCC
CCGTCCGTGA TCCAGACACA GAATGGACTT CAACAGGTGA ACATCAACCG CCCCGGCGCG
TCGGGCGTGT CGATGAACAC GTACAACCAG TTCGACGTAC CCAAACCCGG GATTATTCTC
AACAACTCCC CGATTAACGT CCAGACTCAA CTGGGCGGCA TTATCGGCGG CAACCCGAAT
TTCCAGGCCG GCGACGCGGC ACGTCTGATC GTCAACCAGG TCAACAGCAA CAATCCGAGT
TTTATCCGCG GCAAAGTCGA AATCGGGGGC GCGGCCGCCC AGCTCGTGAT CGCGAATCAG
GCTGGCTTGG TCGTGGATGG TGGCGGCTTC CTCAATACGA GCCGAGCCAC GCTGACAACC
GGCAATCCGA ACTTCGGGCC CGACGGCTCG CTCACGGGTT TCAACGTCAA CCAAGGCCTG
ATTTCGGTGG TCGGCGCGGG GCTCGATACG GCCAATGTCG ATCAGGTGGA TTTGTTGGCC
CGCGCCGTAC AAATTAACGC TAAGGCTTAC GCGAAGACCC TGAACGTGGT GGCTGGATCG
AACCAAGTCG ACTACAACAC GCTGAACGCC ACGCCGATCG CCGCCAACGG TCCCGCGCCC
ACCATCGCGA TCGATGTGAG CCAGCTTGGC GGAATGTATG CGAATAGGGT CTTTCTCGTC
TCGTCAGAGA ATGGCGTTGG TGTTGCCAAT GCCGGCGACA TCGCCGCGCA AGCCGGCGAC
CTGACGCTGC AAGCGAACGG CCGCCTCGTC CTGTCGGGCC ACACCAATGC GGCCGGCAAT
ATGTCGCTGT CAGCCTCGGG TGGCATTCAG AACAGCGGCG TCACGTATGG CAAGCAATCG
GTCACGATCA CCACGGGCGC GGACCTGACC AACAGCGGCG CGCTGACCGC CCAACAAAAC
CTGACCGCTA ACGTCGGCAG CCTCAACTCG ACGGGCACGC TCGGCGCAGG CATCAACGTC
GACAGCACGG TCGGCACGAG CGGCGATCTG AACGTCACGA GCAGCGGCCA GTTGACCGCG
ACGGGGACCA ACAGCGCGGC TGGCAACGCG ACCTTCACCG GTAGCGGCGT CAATCTGTCG
AACAGCGCAA CGGCCGCCAA CGGCAACCTC GCGCTGAGCG CGACGGCGGG CGATGTGAAT
CTGGCCGGCT CGACTGTCAG CGCCAAGGGC GCCGTGAACG CGCAGGCGAG CGGCACGGTC
GTCAACGACC GGGGCAATCT GTCCAGCGGC GCGGGCATGA CGCTTGGCGG CGGCAGCCTG
TCGAACCAGG GCGGCCGTGC CAATTCGCAA GGCCCGCTGT CGGTGCAGAT GGCCGGCACC
GTGTCGAACC AAAACGGCAT GCTGAGTTCC CAAAGCACGG CCGACGTTCG CGGGAGTGCG
ATCCAGAACA ATGCCGGGCT GATTCAGAGC GCAGGCAAGC AAACGATTGC TGGCGCATCG
ATCGACAATT CGGCAGGCCG GCTGATTTCC CTGAACGCAG ACGGCCTGTC GGTGACGGCA
ACCGGCGCAC TCACGAACGC GGCCGGCGCG AACGTGAGTG GCGATCCGGG CGGCGTCATC
GGCTGCAAGG GCGACGTGAC CGTACAGGGG AACACGGTCA CGAACAGCGG CTCGATGTCG
GCCGACGCGA CCCTGCACGT GATCGGCCAG AGCGTCGACA ACGGTAATGG CGCGCTGCAC
GCCGGACAGA CTACCACCGT CGACGCCGGC AATCATCTTT CGAATGCCGG CGGCCGGGTT
GAGGGTCAAA GCGCCGTGTT GAACGGCGCG ACGCTGGACA ACTCCCAGGG CACGGTCAAT
GCGGCGACGG TCTCGCTGAA CGGCACGACG TTGCTGAACC ACGGCGGGAC GGTCACGCAG
ACCGGCACCG GCCCGATGAC GGTCGCGATT ACCGACACCC TCGACAACTC GAACAACGGC
CTGATCCAGA CGCGCAGCAC GGATCTGTCA CTCACGTCCA CTACCCTGAT CAACGACAAC
GGCGGCACGA TCACGCACGT TGGCCCGGGC ACGCTGACGG TCGGCAACGG TTCGGGCACG
GTCAGCAACA AAGCGGGCGC CATCGCCAGC AACGGCCGAA CCGTCCTGCA AGGCAAGACG
ATCGACAACT CGGCCGGTTC GGCGTCCGGC CAAACGGGCC TGAGCGTCAA CGCCGCCGAC
TCGATTACGA ATCTGGGCGG CAAGCTCACC TCGAACGCCA ACGTCGACGT GACCGCAGGT
GGCGCACTGG TCAACGACGG CGGTGAACTC GGCTCAAAGA CGGCAGCGAC GACGATTCAT
AGCGCCTCGC TCTCGAATCT GAACGGCAAG ATCGTCTCCC CCACTTTGAC GGCAACCGTT
GCTGGCTTGC TCGACAACAG CCAGAACGGC GACTTCGAAG CCAATCAGCT TGCGTTGACG
GCCGCGAACC TGAAAAACCA GGGCGGCCAC ATCTCGCAAT GGCAGAGCGG CCCGACCACA
CTCGCCGTCT CTGGCACGCT CGACAATTCG AACGGCGGCG TCATCCAGAC GAACAGCACG
GACCTGACGC TCGCGCCGGC CGTTCTGGAC AACTCGAAAG GCACCATCAC GCACGGTGGG
ACCGGCACGC TGACGCTCAC GCCCGGCAAC GGCGCGGGAG CCTTGCAAAA TACTGGCGGC
ACGATCGGCA CCAACGGACA GGCCATCGTG AAGGCCGGCA GTCTGGACAA TGGTAGTGGG
GTCATCGCCG CCAAACTGGG TCTGTCGGCC ACGATCGCCG GGGCGATGAA CAACGCTCAG
GGCTTGATGC GCTCGAACGC GGCGCTGTCG ATCATCAGCA ACGGGGCGCT GTCGAACCAG
CAAGGGCACA TCGAGGCGGG CACAGTCGGC GACACGAGCA CGCTGTCGAT TCAGGCCGCT
TCGATCGACA ACACCGATGG CGCGGTGCAC GACTTCGGCA CGGGCAAGAT GACCGTGCAA
GGCGGCAGTC AGATCGTGAA CAGCCACGCG GGCGGCGTCG ACGGCATGGG GCAGATGACC
GGCCAGGGCG ACGTGACAAT CGGTGCGGCC TCGATCTCGA ACACGCAGGG CGGCCAGTTG
ATGGGCGCGA ACCTGCTGAT TCAAGGCGCT ACGCTGGACA ACTCCGGAGG ACAGGTCGGC
AACGTCGCGA ACGCCACGGG CGACGTGAAC GTCGCGATGT CGGGCGCGGT GACGAATACC
AACGGGTCGA TCACCTCGAC GCGCGACCTG TCGGTAGCGG CTTCCACGCT GCTCGGCGGC
GGCGCATACA GCGCGGCGCA CGACGCGACG ATCAATCTGC AAGGCGACTT CACGACGACG
CCGCAAACGC AGTTCAACAT TGGTCGCGAC CTGACGTTTA CCTTGCCGGG CACGTTCGCC
AACAGCGCGA ACCTGCAATC GGTCCATAAC CTGACGGTCA ACGCCGGCAA CATTGTCAAC
ACGGGCGCGA TGACGGCAGG CAGCCTGCTC AGTACGCATT CGGGCGACCT GACGAACTAT
GGCGCGATGG TCGGCGGCAG CGTCGCCATC CAGGCGAGCG GCACGGTGTC GAACCTCGGC
CCGGTCGCGC TGATCGGCGC ATCGGACACG TCGGGCCTGC TGGAAATCGT CGCGCACGAC
ATCGAGAACC GCGACGACAC GACGTTGGGC GATTCGATGC CGACCACGAC GATCTTCGGG
CTCGGCAAAG TGGCTTTGGC GGGCGGCAAG GACGCGAACG GCAACTATAC GAACGCAGCG
CTGATCAACA ATTCGTCGGC GGCGATCCAA TCGGGCGCTT CGATGGAACT GCACGCCGAC
AAGGTCACGA ACACGCGGCG CGTGATGCAG ACCTCGGGCA ACACGAGCCA GGTCGATCCG
GCATTGCTGC AACAGCTCGG CATCAGCATG TCCGGGTGCG CCGCCTACTA CATTGCGGCC
TGTAGTGGTC AGGACGTGCA CTGGATCAAC CTGTTCCACG ATCCGAACTA CCCCGATTAC
GATCCCGCGC CGATCATTGC CGCGCTCAAA TTGCAGCCGG GCGGGGTCTT CACCGTTCCA
CCGAACGGCG GCCAATGGAA CAGCGGGTAT CAATACACGA CCTATGAGGG CAAGGCGACC
GCCAACACCG TGACGAAGCT GAGCCCGGGC GCGCAGATCG CATCGGGCGG CGATCTCGAC
GCGTCGACGG TCAAGACATT CCAGAACTAC TGGAGCAGTG TGACGGCGGC CGGCAACATC
AAGCAGCCCG CGAGCCTCGA CATGGACGGC TGGGGCGCGA CGGGTCAGCA GGCGCCGGGC
GTGACGGTCG TGTATTCCGG CTACTACCAC TACAACAACT ACGACAACTC GGAACACAAC
TGGACCTTGC CGTTCGGCGA CAAGCCGTTC GTCGGCGGAC CAGGCGGCTA TACGCAGGCC
GCGCCGGCCG ATGTGCGGCA ATACAGCTTG CCCGACTATC GCTCGACGTG GGGCGCGAAC
GGCACGATCT CGGGCAACGG CGTGAGCGTG AACAACACGG CGGCCAATGC GACCATTCCA
TCGCTCGGCC TGCTGCCCGG CCAGGCCGTG CCGGGGCTGA CGATCGGCAC GGTCAGCGGC
AACGCGAGCG GTACGCAGTC GGGAGCCGCC GCGATCAAGG GCGGCACGCC CACCTGGGTC
GATCCGGTGA TCGCGAGCGC GACGGCCGTG AACGTGTTGA GCAACCTGAC GATTCCGCAA
GGCGGCCTGT ACCGACCGAA CTCGGCACCG AACCCGACGT ACCTGATCGA GACGAACCCC
GCGTTCACGC GGATGAACAA TTTCCTGTCG AGCGACTATT ACCTGAACCA GATCGGCGTG
AATCCGCTGA CGACCGAGAA GCGCCTTGGA GATGGATTCT ACGAGCAGCA GCTCGTGCGT
AACCAGGTCA CGCAACTGAC GGGCAAGGCG GTATTGGGCC CGTATACGGA CCTTCAGGGC
ATGTATCAGT CGTTGATGCT CGCGGGCGCG GAATTGTCGA AGTCGCTGAA CTTGCCGCTC
GGCATGAGCC TGTCGGCGCA GCAGGTGGCC GCGCTCACGA CCAACGTGAT CATCATGCAG
ACCGAGACGG TCGGCGGCCA GCAGGTATTG GTGCCGGTCG TGTATCTGGC GAAGGCCGAT
CAGCAGAACG CGAACGGGCC GCTGATCACG GCTGGCAATA TCGATTTGAA GAACACGCAG
GTCTTCACGA ACAGCGGGAC CGTGAAGGCG GACACGACGC TCGCGCTTCA GGGCAAGCAG
ATCGACAACG CGTTCGGTGC GCTGCAAAGC GGCGGCTTGA CGTCGCTCGA CACGACAGGC
AACGTGGATC TCACCTCGGC CAATGTGAAG GCCGGTAGCC TCGACCTGAA CGCGGGCAAC
AAGCTGATTC TGGATACGGC GACGCAGACG ACGCACCAGG TCAGCCGCGA CGGCGCGACG
AGCGACAAGA CGACGCTCGG GCCGGCTGCG AACCTGAACG TGGCTGGCGA CGCGTCGATC
AAGACCGGCG GCGACTTCCA GCAGAACGCG GGCAACCTGA ACGTCGGCGG GAACCTGAAC
GCCAATATCG GCGGGAACTG GAATCTAGGC GTGCAGCAGA CCGGCGAACA CAAGGTCGTG
CAGCGTGCGA ACGGCGTGTC GGATACGGAC CTCAACAGCG CGACCGGCAG CACCGTGAAC
GTCGGCGGCA AGTCGGCCAT CGGTGTGGGC GGTGACTTGA CGGCGCAAGG TGCGCGGCTC
GACTTCGGCC AAGGCGGCAC GGTCGCGGCC AAGGGCAACG TGACCTTCGG TGCGGCGAGC
ACCACGTCGA CCATCAACGC TAACAGCTCG GGCGACCAGG GGAATCGCAG CTATGCGGAA
ACCCGGCATG GGTCGGACCA GGCGCTGACA GGCACGACCG TCAAGGGCGG CGATACGCTG
AATGTGGTGT CGGGCAAGGA CATCAATGTC ATCGGCAGCA CGATCGACCT GAAGAAAGGC
GATGCGAACC TGCTGGCGGC CGGCGATGTG AATGTCGGGG CCGTGACCGA AACGCACGTG
TACAACTCGC GCGAGACGCA TAGCCGTAGC GGCGTCGTGA GCGGCACGAA AATCGCCAGC
AGCCAGGACG CGACCAGCAC CGTCGCGAAC GGCAGCCTGA TTTCGGCGGA CGGCGTGTCG
ATCGGCAGCG GCAAGGACAT CAATGTTCAA GGTAGTACGG TCGTCGGTAC GCACGATGTG
GCGCTGAATG CGGCGCACGA CGTGAACATA ACGACGTCGC AGGACACCAG CCAATCGTCG
ACCACCTATC AGGAACAGCA CTCGGGTTTG ATGTCGGGCG GCGGTCTGTC TTTCTCGGTC
GGCAACAGCA AGCTCGCGCA GCAGAATCAA TCCTCGAGCG TCACGAACAA CGCCAGCACG
GTCGGCTCGG TCGACGGCAA TCTGACCGTC AATGCGGGTA ATACGCTGCA CGTGAAAGGC
AGCGATCTGG TCGCGGGCAA GGATGTGACC GGGACGGCGG CGAACATCGT CGTCGATTCG
GCCACCGACA CCACGCATCA AGCGCAACAG CAGCAGACGA GCAAGAGCGG GCTGACGGTC
GGCCTGTCGG GCTCGGTGGG CGATGCGATC AACAATGCGA TCAGCGAGAC GCAGGCCGCG
CGCGAATCGG CGAAGGATAG CAACGGCCGC GCATCGGCAC TGCATAGCAT CGCGGCGGCC
GGTGATGTGG CATTCGGCGG TTTGGGGGCC AAGGCCCTGC TGGACGGCGC GAAGGGGCCG
CAGGCCCCGA GCATCGGCGT GCAGGTCAGT GTCGGTTCGA GCCATAGTTC GATGCAGTCG
TCGGAAGACC AGACGATTCA GCGTGGCTCA AGCATCAATG CGGGAGGCAA CGCGAAGCTG
ATTGCGACGG GCAACGGCAC GCCGAAGGAC GGCAACATCA CGATCGCGGG CTCGAACGTG
AACGCGGCCA ACGTGGCGCT GATCGCGAAC AATCAGGTCA ATCTCGTCAA TACGACCGAT
ACGGACAAAA CGCAGAGTTC GAACTCGTCG TCTGGGTCGA GCGTCGGCGT GTCGATCGGC
ACGAACGGCA TCGGCGTCTC CGCATCGATG CAGCGCGCGC ACGGCGACGG GAATTCGGAC
GCGGCGATCC AGAACAACAC GCACATCAAC GCCAGCCAGA CCGCGACCAT TGTCAGCGGC
GGCGACACGA ACGTGATCGG CGCGAACGTG AACGCGAACA AGGTTGTGGC CGACGTAGGC
GGCAACCTGA ACGTGGCGAG CGTCCAGGAC ACGACCGTAA GCGCCGCGCA TCAGTCGAGC
GCGGGTGGCG GCTTCACGAT CAGCCAGACC GGCGGCGGGG CTAGTTTCAG TGCACAGAAC
GGCCACGCGG ACGGCAACTA TGCGGGCGTG AAAGAGCAGG CAGGTATCCA GGCCGGCTCG
GGCGGGTTCG ACGTGACCGT GAGGGGCAAC ACCGACCTGA AGGGCGCGTA TATCGCTAGC
ACGGCCGATG CGTCGAAGAA CAGTCTGACG ACGGGCACGC TCACGACGAG CGACATCGAG
AACCACTCGC ACTACAGCGC GAACAGTGCG GGCTTTAGCG CTGGCGCGTC AGTTGGGGTA
AGCACCAAGG CAGTTGGGCC GTCGTCGGTA TCGGGTTCGG GTGGCGTCAC GCCGATGGTG
TTCCAGAACG ACAGCGGCGA CCAAAGCGCG ACGACGAAGA GCGCGGTGAG CGCCGGCACG
ATCAATATCA CGAAGCCGGG CGAGCAGACG CAGGACGTCG CGAACCTGAA CCGCGACACG
ACGAATCTGA ACGGGACCGT TTCGAAGACG CCTGATGTAC AAAAGATGCT GTCGCAGCAG
GCCGATACGA TGAATGCGGC GCAGGCGGCC GGACAGACGG TTTCGCAGGG GATCGGGCTG
TACGCTGACG GCAAGCGTAA GGATGCGATC GACGCGGCCA AGGCCGCCTA CGAGCGCGGC
GATCTCGTGG CGATGCAGTC GTACATTGAT CAGGCGAAGA GCTGGGATGA GGGCGGAGCG
TCACGTGCGG GCTTGCAAGC AACTGGCGGT GCGCTGATCG GCGGTCTTGG CGGCGGCAGC
GTGCTCACGG CGATTGGCGG TGCAGCGGGA GCCGGCACGT CGTCACTGCT GGCTGGCCAG
GCAGAGAAGA TCAGCAAATC GGTGGGCGAT ATGACCGGTT CGTCGCTGGT CGGCAACATC
GCTGCGAACG TTGCGGCAAC GGTCGGTGGC GCACTGGTGG GAGGCAGTGC GGGCGCGGCA
ATGGCATCGA ACGTCGAGCT TTACAACGCA GGCAACGACC CCCAAAAAAC GGATGACCGA
GCGACAATCG CGGGACTGCA GGGGCTGCTC AGTCGGACTG CTGCGATGGC TTCGGATGCC
AAGGCCGGTG TCTGGAACGG AATGGTCAAC GTCGCGGGCG TGATCGTCAA TATTCCGAAC
GGCGGGCCAT TCGCGTCGCC CGGTGATCCG GGCTATGTCT CGCTGGATGG GCTGAAGAAG
CCGTATAAAT CTGGGACTTC AATCGGCCCG GATACTGAAT TCCTGACGCC TATCTTGGCG
ACGCTGGGTC TCGGCGGGAA AGCGGCAGTC GGAACTGATG CGGGAATAAC ATCGGCGGAT
GTCGCCACGG TTGGAAACGG TGCGCTGAAG AATGCGAGCG GTGATCTCTC CGCGGCAGCG
AATTCGGCGA GGAATCAGCC GTATGGTCAG GGAGCAAGCG CCAGTCAGTC TCCGGGAACC
CAAGGGGCAA GCTCAGGTAG TAATATCTCG GCATCAAACG GTTCATCGAG TCCCACGACA
ATTGTTGCGA GCAATCCAGT TGATCTGAAT GCTTTCGATC GATTGAACGT TGTTGATCCG
GCAGTAGGTA AATTTAGACC AGGTGAAGCC GGAGCAGCGG CAGAGCTTGA AAATTATCTG
GGTGGCACGC TCCAACGTGC GCCTCAGGGC TCCTCGGTTG ACTTTGTATT CAGCTCCGGG
CCGAACAACG GTAAGACCGT GGATTTTATG CTTACGCCGG ATACGGTCGC GCAGGCGGCA
AAGATAAATC AGTTTTTTGA TAAAAATCTT AATAATTTCA TGAACACTCT TTCGGATCAT
GCGGCTGCTG CGGATTTTGT GCCGTTGGAT TCTAGATTTT TGAGTGAGGC AAACAAAACA
TTGCTTGTCA AGGCTATTGG CAATCTCCCG CAAAAACTGC AGGCGAAGAT TATTCTGATC
AAGTGA
 
Protein sequence
MNKNRYRVVF NRARGALMVV QENGRASHGS GSRDARAGVV PAWLSLSPFA LRHVALAVLV 
AAGVVPIWVN AQVVAGGAHA PSVIQTQNGL QQVNINRPGA SGVSMNTYNQ FDVPKPGIIL
NNSPINVQTQ LGGIIGGNPN FQAGDAARLI VNQVNSNNPS FIRGKVEIGG AAAQLVIANQ
AGLVVDGGGF LNTSRATLTT GNPNFGPDGS LTGFNVNQGL ISVVGAGLDT ANVDQVDLLA
RAVQINAKAY AKTLNVVAGS NQVDYNTLNA TPIAANGPAP TIAIDVSQLG GMYANRVFLV
SSENGVGVAN AGDIAAQAGD LTLQANGRLV LSGHTNAAGN MSLSASGGIQ NSGVTYGKQS
VTITTGADLT NSGALTAQQN LTANVGSLNS TGTLGAGINV DSTVGTSGDL NVTSSGQLTA
TGTNSAAGNA TFTGSGVNLS NSATAANGNL ALSATAGDVN LAGSTVSAKG AVNAQASGTV
VNDRGNLSSG AGMTLGGGSL SNQGGRANSQ GPLSVQMAGT VSNQNGMLSS QSTADVRGSA
IQNNAGLIQS AGKQTIAGAS IDNSAGRLIS LNADGLSVTA TGALTNAAGA NVSGDPGGVI
GCKGDVTVQG NTVTNSGSMS ADATLHVIGQ SVDNGNGALH AGQTTTVDAG NHLSNAGGRV
EGQSAVLNGA TLDNSQGTVN AATVSLNGTT LLNHGGTVTQ TGTGPMTVAI TDTLDNSNNG
LIQTRSTDLS LTSTTLINDN GGTITHVGPG TLTVGNGSGT VSNKAGAIAS NGRTVLQGKT
IDNSAGSASG QTGLSVNAAD SITNLGGKLT SNANVDVTAG GALVNDGGEL GSKTAATTIH
SASLSNLNGK IVSPTLTATV AGLLDNSQNG DFEANQLALT AANLKNQGGH ISQWQSGPTT
LAVSGTLDNS NGGVIQTNST DLTLAPAVLD NSKGTITHGG TGTLTLTPGN GAGALQNTGG
TIGTNGQAIV KAGSLDNGSG VIAAKLGLSA TIAGAMNNAQ GLMRSNAALS IISNGALSNQ
QGHIEAGTVG DTSTLSIQAA SIDNTDGAVH DFGTGKMTVQ GGSQIVNSHA GGVDGMGQMT
GQGDVTIGAA SISNTQGGQL MGANLLIQGA TLDNSGGQVG NVANATGDVN VAMSGAVTNT
NGSITSTRDL SVAASTLLGG GAYSAAHDAT INLQGDFTTT PQTQFNIGRD LTFTLPGTFA
NSANLQSVHN LTVNAGNIVN TGAMTAGSLL STHSGDLTNY GAMVGGSVAI QASGTVSNLG
PVALIGASDT SGLLEIVAHD IENRDDTTLG DSMPTTTIFG LGKVALAGGK DANGNYTNAA
LINNSSAAIQ SGASMELHAD KVTNTRRVMQ TSGNTSQVDP ALLQQLGISM SGCAAYYIAA
CSGQDVHWIN LFHDPNYPDY DPAPIIAALK LQPGGVFTVP PNGGQWNSGY QYTTYEGKAT
ANTVTKLSPG AQIASGGDLD ASTVKTFQNY WSSVTAAGNI KQPASLDMDG WGATGQQAPG
VTVVYSGYYH YNNYDNSEHN WTLPFGDKPF VGGPGGYTQA APADVRQYSL PDYRSTWGAN
GTISGNGVSV NNTAANATIP SLGLLPGQAV PGLTIGTVSG NASGTQSGAA AIKGGTPTWV
DPVIASATAV NVLSNLTIPQ GGLYRPNSAP NPTYLIETNP AFTRMNNFLS SDYYLNQIGV
NPLTTEKRLG DGFYEQQLVR NQVTQLTGKA VLGPYTDLQG MYQSLMLAGA ELSKSLNLPL
GMSLSAQQVA ALTTNVIIMQ TETVGGQQVL VPVVYLAKAD QQNANGPLIT AGNIDLKNTQ
VFTNSGTVKA DTTLALQGKQ IDNAFGALQS GGLTSLDTTG NVDLTSANVK AGSLDLNAGN
KLILDTATQT THQVSRDGAT SDKTTLGPAA NLNVAGDASI KTGGDFQQNA GNLNVGGNLN
ANIGGNWNLG VQQTGEHKVV QRANGVSDTD LNSATGSTVN VGGKSAIGVG GDLTAQGARL
DFGQGGTVAA KGNVTFGAAS TTSTINANSS GDQGNRSYAE TRHGSDQALT GTTVKGGDTL
NVVSGKDINV IGSTIDLKKG DANLLAAGDV NVGAVTETHV YNSRETHSRS GVVSGTKIAS
SQDATSTVAN GSLISADGVS IGSGKDINVQ GSTVVGTHDV ALNAAHDVNI TTSQDTSQSS
TTYQEQHSGL MSGGGLSFSV GNSKLAQQNQ SSSVTNNAST VGSVDGNLTV NAGNTLHVKG
SDLVAGKDVT GTAANIVVDS ATDTTHQAQQ QQTSKSGLTV GLSGSVGDAI NNAISETQAA
RESAKDSNGR ASALHSIAAA GDVAFGGLGA KALLDGAKGP QAPSIGVQVS VGSSHSSMQS
SEDQTIQRGS SINAGGNAKL IATGNGTPKD GNITIAGSNV NAANVALIAN NQVNLVNTTD
TDKTQSSNSS SGSSVGVSIG TNGIGVSASM QRAHGDGNSD AAIQNNTHIN ASQTATIVSG
GDTNVIGANV NANKVVADVG GNLNVASVQD TTVSAAHQSS AGGGFTISQT GGGASFSAQN
GHADGNYAGV KEQAGIQAGS GGFDVTVRGN TDLKGAYIAS TADASKNSLT TGTLTTSDIE
NHSHYSANSA GFSAGASVGV STKAVGPSSV SGSGGVTPMV FQNDSGDQSA TTKSAVSAGT
INITKPGEQT QDVANLNRDT TNLNGTVSKT PDVQKMLSQQ ADTMNAAQAA GQTVSQGIGL
YADGKRKDAI DAAKAAYERG DLVAMQSYID QAKSWDEGGA SRAGLQATGG ALIGGLGGGS
VLTAIGGAAG AGTSSLLAGQ AEKISKSVGD MTGSSLVGNI AANVAATVGG ALVGGSAGAA
MASNVELYNA GNDPQKTDDR ATIAGLQGLL SRTAAMASDA KAGVWNGMVN VAGVIVNIPN
GGPFASPGDP GYVSLDGLKK PYKSGTSIGP DTEFLTPILA TLGLGGKAAV GTDAGITSAD
VATVGNGALK NASGDLSAAA NSARNQPYGQ GASASQSPGT QGASSGSNIS ASNGSSSPTT
IVASNPVDLN AFDRLNVVDP AVGKFRPGEA GAAAELENYL GGTLQRAPQG SSVDFVFSSG
PNNGKTVDFM LTPDTVAQAA KINQFFDKNL NNFMNTLSDH AAAADFVPLD SRFLSEANKT
LLVKAIGNLP QKLQAKIILI K