Gene BURPS1106A_3880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3880 
Symbol 
ID4899535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3776738 
End bp3786217 
Gene Length9480 bp 
Protein Length3159 aa 
Translation table11 
GC content62% 
IMG OID640137106 
Productfilamentous haemagglutinin/adhesin 
Protein accessionYP_001068101 
Protein GI126454501 
COG category[S] Function unknown 
COG ID[COG5654] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGGTTTC CGTTCAACGA CTTGGCAATT GTTGACAGTC ATTCTTTTTG TGTTTCGGTA 
GGCTGGCGTT CAATAATCAA TGAACGGACG CGGGCCGCGT CGACGAGAAT GAACAAGGAT
ACGCACCGCC TGGTGTTTTC CAGGCTTCGG GGTATGGTGG TGGCCGTCGC GGAGACGGCG
ACCGCCGAGG GGAAATCGGC GACGGGCGAA GCACGTGCAC GGCCGCGATC GTCGACGTCA
GGTGTCCGCG TCGTTGTGGC AGGTGTCGCA GCGCTCGGCA TGCTGCCTAC CCTGTCCGAT
GCGCAGATCG TGCCGACGCC GGGTACCAGC ACACACGTGA TCCAGACGCC GAACGGCTTG
CCTCAAGTCA ACGTGGCCGC GCCGTCGGGC GCCGGTGTCT CAGTTAACAC CTACAACCAG
TTCGACGTCT CGCGCGCCGG CGCGATCCTG AACAACTCGG CCACGATGGT TCAGACTCAG
CAGGCGGGCT GGATCAACGG CAACCCGAAT TACAGTGCAG GCCAGGCCGC AAGGATCATT
GTCAACCAGG TCAACAGCCC GAACCCCTCC CAAATTCGTG GCGCTTTGGA GATCGCCGGC
TCCCGGGCGG AGCTCGTCCT CGCGAATCCG TCGGGCATTT ATCTTGATGG CGCGTCCTTC
ATCAACACCA GTCGCGCAAC CCTCACAACG GGCGTGCCGT ACTACGGTGC CGATGGCTCG
CTCGCTGGCT ACAATGTCAA CCGCGGGCTC GTGACGGTTG CCGGCGCGGG CCTGAATGCC
GCCAATATCG ACCAAGTGGA CCTCATCGCC CGCGCGGTGC AGGTGAACGC GGCGGTCTAC
GCGAAGAATC TGAATGTGAT CGCCGGCGCG AGCCAGGTCA ACCACGACAC GCTCGCCGCG
ACACCGATTG CCGGCGACGG CCCGGCCCCG TCGGTGGCGA TCGACGTGAG CCAGTTGGGC
GGTATGTATA GCGATCGGAT CTTTCTTGCA TCGAACGAGA ACGGCGTCGG GGTGGGCAAC
GCTGGCACGA TCGCTGCGCA AGCGGGCGAC CTGACGCTGC AGTCGAACGG CCGGCTCGTC
CTGACGGGTA AGACCACGGC GAGCGGCAAC CTCGCGCTGT CGGCCGCGGG CGGCATCCAG
AATAGCGGCA CCACGTACGC GCAGCAATCG CTGTTGGCCA GCACAAGCGC CGATCTTGCG
AACAGCGGCA CGCTTGCGGC GCAGCAGAAT ACAACGGTCA ACGCTGGCAG CGTCAACTCG
ACCGGCACGC TCGGCGCCAG CGTGAACAAC GACGGTTCCG TCACGCACAG CGGCGACCTG
AACTTGACGG CGTCGGGCCA ACTGACCGCC GCCGGCCAGA ATGTCGCGGG CGGCAACGCA
TCGCTGGCGG GCGGCAGCGT GAATCTCGCC GGCAGCCAGA CGGCAGCGAA CGGTAATCTG
TCGCTGAACG CGACAGCCGG CGACGTGAAC CTGTCGAACG CGACGACGAG CGCGCAAGGC
ACCATTCAGG CGAACGCATC GGGCACCGTG ATCAACGATC ACGGCAGCCT GTCGAGCGGT
GGCGGCACAA CGCTGACCGG TGGCAATCTG TCGAACCAGA GTGGCAAGGT TTCATCGCAG
GGGCCGTTGT CAGTCAACGT GGCTGGCCAG ATCGCCAACC AGTCCGGCGA ACTGGTATCC
GAAAGCACGG CTGACCTGAG TGGCGACGCC ATCGCGAACA ACCAGGGCAC CCTTCAAAGC
GCGGCCGGCA TGACGGTGGC TGGTGCGTCA CTGGATAACA CGGCGGGCCG AATTACGTCG
CTCAACGGAG ATGGCCTGTC GGTGACGACG AGTGGCCAGT TGACCAACGT CGCAGGGACG
ACCGCGAACG GTGCGCAAGG CGGTGTGATC GGCGGCAACG GTGATGTTTC GATCCAAGGC
GCAAACGTTG TCAATCGCGG CGCGATTACA TCCAATACGA ATCTGCGGGT ATCGGGGCAG
GCAGTCGACA ACGGTGGTGG CACGCTGCAG GCCGCGCAAA AGGTGGCTGT CGACGCGGGC
GCGCGCCTGA TCAACAACGG CGGCTCGATC GTCGGTCGGA CCGCCGCACT GACTGGTACG
ACGCTCGACA ACAGCGCCGG CACCGTGCAG GCGGACCAGA TGTCGTTGAA CGCGACCGAC
CTCGTGAATC ACGGCGGCAC GATCACGCAG ACCGGCGCCG GCGCGATCAG CGTGAACGTG
TCGGACATGC TCGACAATTC CAGCGGCGGC ACGCTGCAAA CCAACAGTAC CGACCTGACG
CTCGCCCCTG TCGCACTCGT CAACGATGGC GGCACGATCA CGCATGCGGG CAACGGCACG
CTGACGCTCG GAAGCGGTTC AGGCTCTGTG TCGAACGTCG GTGGAGCGAT TGCCAGCAAC
GGGCGCGTCG TCGCGCAAAC TGGCGCGCTG AACAACACAT CGGGCTCGAT CAACGCGCAG
AACGGACTGA CGGCAACCGT CGCCGGCACG CTGAACAATG CGAATGGCAA GCTGTTGTCG
AACACGGATC TGAGTGTCAC CAGTGGCGTG CTGGCGAACG ACGGTGGCCA GATCGGTGCC
GGCACGAATG CGACGATCCG CACGGGCTCG ATGACGAACC AGGGAGGTTC GATCGTTGCA
CCGAATCTGT CGGTCACGGC CGACTCGACG CTGGATAACA GCGGCGGCAA GCTCGAGACC
AATCAGCTGA CGCTGACGGC CGCGAACCTG ACGAACCACG GCGGCACGAT CACGCAGTAC
GGCACGGCGG CGATGGGTGT GAACGTCAGC GGCACGCTCG ACAACTCGGC TGCAGGTGTG
ATTCAGACCA ACAGCACGGA CCTGACGCTC ACCCCTGCCG AACTGAACAA CGCGGGCGGC
ACCATCACGC ATGCCGGTAC GGGCACGCTG ACGATCGCGC CGGGCAACGG CGCCAGCGCA
CTGAACAACG CGTCGGGCAC CATCGTGACG AAGGGACAAG CCATCGTCAA CGCGGCTACC
TGGAACAACG CGAGCGGTAT TCTTGCCGCG CAACGCGGCC TCAATGCGAC CATCGCCGGC
GACGTGAACA ACGCGCAGGG TTTGCTGCGA TCGGACGCGT CGCTGTCGTT GAAGAACGGC
GGCGCTCTAT CAAACCGAGG CGGCCACATC CAGGCCGGGC AATCGGTGGC GGGCGACACC
AGCACGCTCG CCATTCAATC GGCCTCGATC GACAACGCGG ACGGCGCCAT TGTCGACCTC
GGTGCCGGCA AGATGACGGT GCAAGGCGGC AGCCAGATCG CCAACAGCCA CGCCGGCGGC
GTAGCGGGCA TGGGCGCGAT TACCGGCAAC GGTGATGTGA CGGTCAGCGC CGCGTCGATC
AGCAACACGC AGACCGGCCA GCTCAGCGGC GCATCGCTGC ATGTTCAAGG CAACACCTTG
GACAACAGCG GCGGCACGAT CGGCAATGTC ACGAATTCGA ACGGCGACGT GGACGTTACG
ACGACCGGTG CGATCACCAA CACGAACGGT CAGATCAGCT CGACGCACGA CCTGTCGGTC
ACGGCGGTCA CGCTGCAGGG TGGCGGCACC TACAGCGCAA CGCACGACGC GAACGTGAAT
CTGCAGGGCG ATTACACGGC AGCGGCCGAC ACCCAGTTCA ACGTCGGTCA CGATCTTGCC
TTCACGCTGC CCGGCACTTT CACGAACAAT GCGAACCTGC AATCGGTCAA CAACCTGAGC
GTCAACGCGG GCAACATCGT GAACGTGGGC GCGCTGGCTG CCGGCGGCCT GCTGTACACG
CAATCGACCA ACCTGACCAA CACGGGCGCG CTCGTCGGCG CGAGCGTATC GCTGAACGCA
ACGAACACAA TTTCGAACCT CGGCCCGACC GCGCTGATCG GTGCATCGGA CAGCAACGGG
ACGCTCGAAA TCCTCGCGCG CGACATCGAG AACCGCGACG ACACGACGGC GACCGATTCG
ATGGCCACGA CCGCCCTCTT CGGCATGGGC AAGGTCGTAC TGGCAGGCGG CAAGGACGCG
AGCGGCAATT ACACGAACGC GGCTCTCGTC AACAACGTGT CGGCGTTGAT CCAGTCCGAA
GGCGATATGG AGTTGCACGC GGACAAGGTG ACGAGCACGC GGCGCGTGAT GAAGACGTCC
ACCAGTCAGA TCGATCCTGC GTCACTCGCG CAGTTCGGCA TTTCGATCAG CGGCCGCACT
GGCCAGGTTG GCGTGAAGGA TCCGGACAGC ATCGGCGGTG TCTACACCGA TCCGCCTCAT
GGCGGTCAGT GGAACAGCAC ATATCAGTTC ACGACCTACT ACGCGGACAG TGCGACGGCG
ACGACCGTGA CAGACATCAG CCCGGCCGCG CAGATCGTGT CCGGTGGCAA GATCGATGCG
TCGTCGGTCG GTACCCTGCA AAACTACTGG AGCAACATCG CGGCGGTCGG CGACGTCAAG
ATGCCGGGTC ACTACGACGC TGACGGTTGG GCGGCATCCG GCCAGAAACT GCCGGGCGTG
ACGGTTTCCT ACTCGGGGCA GTATCACTAC AACAACTACG ACAATTCCGA ACACGACTGG
CAGTTGCCGT TCGGCAATGC GCCGTTCGTC GGCAGCCGTC CGGGCGGCTA CACGCAGGCG
GCACCGGCAT CCATCAAGGA ATACACGCTG CCAGGCTATT TTTCTACCCT GAGCTCGAAT
GGCACGATTT CGGGCACGGG CGTCAGCGTC AGCAATACGG CGGCCAACGC ATCGATTCCG
TCGCTCGGCT TGCTGCCGGG GCAGTCGGTG CCGGGTCTCA CGCCCACCAA CCTGAGCGGT
AATGCGAGCG GCGCGAAATC GGGAGCGTCG GCAGTGCACG GCGGTCCGCC GGCGCCGGTC
GATCCGATCA TCGCCAGCGC GACGGCGCTG AACGTGCTGA ACAACCTCAC GATTCCGCAG
GGCGGGCTGT ACCGGCCGAC TACCGCGCCA AATGCGAACT ACGTAATCGA GACGAACCCG
GCGTTCACGA ACCAGAAAAA CTTCATCTCG AGCGACTACT TCTTCGGACA GCTCGGCGTA
GACCTCACGC ATATTCCGAA GCGTCTCGGT GACGGCTTCT ATGAGCAGCA GCTCGTGCGC
AATCAGGTGA CGGCGTTGAC CGGCAAGGCG GTGCTGGGGC CGTATGCCGA CTTGCAGACG
ATGTACCAAT CGTTGATGAC GGCGGGTGCT GATCTGTCGA AGTCGCTCGA TCTGCCGATG
GGCGCGAGCC TGTCGGCCGA GCAGGTGTCG AAGCTGACCA GCAACGTGAT CATGATGGAA
ACGCGCGTGG TCGACGGTCA GTCGGTGCTT GTGCCGGTCG TGTATCTCGC GCAGGCGAAC
CAGCAGAATA TCAACGGACC GCTGATTACC GCGGCGAACA TCGACTTTCA GAACGCGCAG
TCGTTCACGA ACAGCGGCAC GATTAAGGCG GACAACACGC TGGCCATTCA GGGCAAGCAG
ATCGACAACG CGTTCGGCGC GCTGCAAAGC GGCGGGCTGA TGTCGCTGAA GACCGAGAAC
AACATCGACC TGACGTCGGC GAACGTGAAG GCCGGCAGTC TGCAGCTGGA CGCCGGGAAA
GACCTGATTC TCGATACGGC AACGAAGACG AACACGCGCG TGAGCCGCGA CGGTGCGACG
AGCGTGGTGA CGACACTCGG GCCGACTGCC AAGCTGGACG TCGCGGGCAA CGCGTCCATC
GTCACCGGTG GCAACTTCCA GCAGAACGCG GGCAACTTGT CGGTCGGCGG CAATCTCGGG
ATGAACGTCG GCGGCAACTG GGATCTCGGT GCGGTGCAGA CCGGCGAGCA CAAGATCGTG
CAGCGGGCGA ACGGCGTGTC GAATACCGAC ATCAACAAGG TCACCGGTAG CTCGGTGACG
GTCGGTGGAC AGTCGAGCAT CGGCGTCGGC GGAGACCTGA CGGCAAAGGG CGCGCAGATC
GACCTCGGCC AGGGCGGCAC GATCGCGGCC AAAGGCAACG TGACGCTCGG CGCGGCGAGC
ACGACATCGA CGGTGAACAG CAACAGTTCG GGTAGCGACA GTCACGGCAG TTACGCCGAG
ACGCTGCATA CGTCGGATCA GGCGCTCACC GGCACGACGC TCAAGGGCGG CGACACCGTT
ACGATTGCGT CAGGCAAGGA TCTCACGATC AGTGGCAGCA CGGTCAGCCT GGATAAAGGC
AATGCGAATC TGATGGCGAG CGGCGATGTG AATATCGGTG CGGCAACAGA GACGCACGAG
CTGAATTCGC ACGAAACGCA CAGCCACAGC AATGTGGTGA GCGGCGCGAA GATCGCGAGC
GGGATCGACC AGACGGCGAC CTATAGCCAG GGCAGCACGG TATCAGCCGA CGGCATCAAC
ATCGTCAGCA ACCGCGACAT CAACGTGACC GGCAGCAACG TCGTGGGTAC GAACGACGTC
ACGTTGCAGG CTAAGCGCGA CGTCAATATC AAGACATCAC AAGATACGAC TCAATCGTCC
AGCTACTACG AGAAGAAGGA ATCCGGCCTG CTCACAAATG GTGGTCTGTC GGTCACGGTG
GGTTCGCGCT CGACCGCACA GCAAGACCAA ACCAGCTCGG TGACGAATAA CGGGAGCGTG
ATTGGATCAT CGCAGGGCAA TGTCACGATC CAGGCCGGCA AGGATGCGAC GATTACGGGT
AGCACGATTG TTGCGGGCCA GGATGTCGGA ATCGCCGCTC AGAACGTGAC GGTGAATGCC
GCGTACGACA CCTACAAGGA CGCGCAGTCG CAGCAATTCA GCCAGTCGGG ATTGAGTGTC
GGACTGGGCG GCGGTCTCGT CGGGCTCGGG CAGTCGATGG CGGGCGCGGT CCGCCAAGGC
GAGCAGTCGG GCGATTCGCG CCTCGCGGCG GTGCAAGCCG TGGCGGCAGC CGAACAGGCA
TACCAGAACC GTGGCGGGAT CAAGGATGCC GCCAATGCCC TGTCGAACGG AAACGTGAGC
GATGCCGCAA AGGGTGTGCA GGTGCAGCTA AGCATCGGAT CGAGCCATAG CAGCAGCAAT
TCGACGACAT CGATCTCGGA TGCGAAAGGT TCGTCGATCA TCGGTAACGG CAACGTATCC
ATTATCGCGA CCGGCACGCC GGACGCGAAC GGAAACGCCC AGGCGGGTAC CGGGAACATT
GCGATGACCG GAGCGTCGGT GCTCGGCAAG AACGTCGTGC TCGATGCCAA CAACGCGATT
ACGCTGCAAA GCGCCCAGAG TACGGAGCAG AATACGAGTT CGAACAGTTC GACCGGCTGG
AATGCTGGCG TCGCGATCGG GGTGGGAAAG AACACGGGCA TCAGCGTTTT TGCTAACGGC
TCGAACGCAC ATGGTCAAGG CAACGGGGAC AGCGTTACGC AGACCAACAC GACGGTGGCG
GCTGGCAACA ACCTGACTAT GAAGTCGGGC GGCGACACGA CGCTGTCGGG CGCGCGGGTG
TCGGGCGATA AGGTCAAGGT GGATGTCGGC GGCGACCTGA CGATGACGAG TCTTCAGGAC
ACGTCGAACT ACAGTAGCAA CCAGCACAAC ACGGGGGTGA GCGGCAGCTT TACGTTTGGC
TACGGTGGTG GCGTCGATGC ATCGATCGGC CACACCAGCA TCGACGCGAA TTATGCGTCG
GTGAATCAGC AAACCGGCAT TGTGGCGGGT AAGGAAGGTT TCGACGTCAA TGTGGTGGGC
CACACCCAAC TCAATGGTGC ACAGATCGCG AGCGCTGCGC CGGCCGATAG CAACACGCTG
ACAACCGGCA GCCTTGGATT TACCGACATC CAGAACAAGA TGTCGTATTC AGGCTCGTCG
GAAGGCTTTT CGACGGCGGG TGGGCCGAGC TTCGCGCAGA CTGGCGACAG CGCGAGCGGT
GTCACACACG CTGCGGTGAG CCCGGCAAAG ATCGTCGTGA AGGCGGACGA ACAGAACGGC
ACGGACAGCA CGGCCGGCCT GTCGCGCGAT ACGGCAAACG CGAACCAGAC AGTTGAGAAT
ACTTTCAACC TGCAGAAGGT TCAGAACAAT CTGGCGTTCG CGCAGGCGTT CGGCAAGGCC
GCAACTTTCG CCGTAGCGGA AGCAGCGACG CAGCTCGAAA ATAGCAGTCC GCAGATGAAG
GCCTTGTTCG GCGAAGGTGG TGCCGGTCGT GACGCGTTGC ACGCCGCGGT GGCGGCAATC
GGGGCTGCGC TGTCAGGCGG GAACGTTGGC GGGGCGATTG CAGGTTCTCT GGCGGGAGAT
GTGTTGCAGT CTCTGGCGCA GCCAATCATT GATCAGACGG TAAGCCAGTT GCCGCTGGAC
GCGCAAACCG CTGCGCGGAA GGCTCTGAAT GAGATCGTGG CGACAGCCGG CGGTGCGGCG
GGCGGCGCAC TCGCGGGTGG TGGCTCGTCG GGCGCTCTCG CCGGAGCGGG GTCGGCCAGC
AACAACGAAC TTTACAATCG GCAACTTCAC CAGAGCGAAG CGGACAAGCT CAAGCAGCTT
CAGAAAGGGC AAAGCCCCGA GCAGCAGTAC AGACTCGCAG CCGCCGAATG TGCGCTCGTG
CATTGTGCGG ACAATATTCT GGACAATGAC CCGAATAAGG CCGCGTTGCA AAAGATTCAA
AACGACGGCG CGCAGTACAC GTACGAACAA AACGTGCTGA AGAAAGCCGG TGCATTCGAC
GGCTATGGAG ACCTGGACCG CTTATCCGAC ACCTACGATC GAAACCAGGT CTCCAATCGT
CTTGTGGGTG CTGTTCAGGG CGTTGGAAGC GTGGCGGTTG CGGCTGGCGC AGTGACGGGC
GGATGCGCGT CAGTGGCAGG CTGTGCGCTA GGTGCGGCCA TTGCGGTAGG CGCGGTCGAC
TATGCAAAGT CAGGCTTCAC CCAGATGATG TCGGGCAATT TGACGCCGAC CTATGGCGAG
CAAGCCTTGC AAAGCCTCGG TATGAGCCCA ACGGGAGCGG CACTGGTATA TGGTGCCCTT
AATCTCGGCG GAGCTGCTGC TCAAGTCGTG GTAACTGGGC GTGCTGTCGA TGCTGCTGCA
GCGGCGAATG CCTGGGCGCG AGGAACATAT AATGGATCAA GTAGTGCTCA ATATTCTGGA
GAGCTCTATC GCTATACGAT GCCGGAATAC GCCGAGGGAA CGTGGAATCT CTACAAAGGG
AACATCGATG CTAATCATCG ATATTCGCCA CCAGGAGTTG GTGCAATTTA CGCTGGAACC
ACCCCTCAGA CGTCGCTGGC AGAAATCACT AGCTATGAGC CTCTGAAGGG GCAGGTATTG
GTGACCAAGA ATTTCGTTAT TAACAACGTC TTGGATTTGA CGAATCCGGC AGCGCGGCAA
GCGTTGGGAG TGACGGTCGA CCAGCTTACG CAGACCAGTC ATGGTGGTGC TGCTTACGAC
GCTACGCAAG CAATTAGCAC ATGGGCCAGG GAGCAAGGTT ACCAAGCAAT TTTGGCTCCC
TCTGCGCAAT TGCCAGGTGG TGTCAACTTG ATTTCATTTA AATCGTTGGG GAAACAGTAA
 
Protein sequence
MWFPFNDLAI VDSHSFCVSV GWRSIINERT RAASTRMNKD THRLVFSRLR GMVVAVAETA 
TAEGKSATGE ARARPRSSTS GVRVVVAGVA ALGMLPTLSD AQIVPTPGTS THVIQTPNGL
PQVNVAAPSG AGVSVNTYNQ FDVSRAGAIL NNSATMVQTQ QAGWINGNPN YSAGQAARII
VNQVNSPNPS QIRGALEIAG SRAELVLANP SGIYLDGASF INTSRATLTT GVPYYGADGS
LAGYNVNRGL VTVAGAGLNA ANIDQVDLIA RAVQVNAAVY AKNLNVIAGA SQVNHDTLAA
TPIAGDGPAP SVAIDVSQLG GMYSDRIFLA SNENGVGVGN AGTIAAQAGD LTLQSNGRLV
LTGKTTASGN LALSAAGGIQ NSGTTYAQQS LLASTSADLA NSGTLAAQQN TTVNAGSVNS
TGTLGASVNN DGSVTHSGDL NLTASGQLTA AGQNVAGGNA SLAGGSVNLA GSQTAANGNL
SLNATAGDVN LSNATTSAQG TIQANASGTV INDHGSLSSG GGTTLTGGNL SNQSGKVSSQ
GPLSVNVAGQ IANQSGELVS ESTADLSGDA IANNQGTLQS AAGMTVAGAS LDNTAGRITS
LNGDGLSVTT SGQLTNVAGT TANGAQGGVI GGNGDVSIQG ANVVNRGAIT SNTNLRVSGQ
AVDNGGGTLQ AAQKVAVDAG ARLINNGGSI VGRTAALTGT TLDNSAGTVQ ADQMSLNATD
LVNHGGTITQ TGAGAISVNV SDMLDNSSGG TLQTNSTDLT LAPVALVNDG GTITHAGNGT
LTLGSGSGSV SNVGGAIASN GRVVAQTGAL NNTSGSINAQ NGLTATVAGT LNNANGKLLS
NTDLSVTSGV LANDGGQIGA GTNATIRTGS MTNQGGSIVA PNLSVTADST LDNSGGKLET
NQLTLTAANL TNHGGTITQY GTAAMGVNVS GTLDNSAAGV IQTNSTDLTL TPAELNNAGG
TITHAGTGTL TIAPGNGASA LNNASGTIVT KGQAIVNAAT WNNASGILAA QRGLNATIAG
DVNNAQGLLR SDASLSLKNG GALSNRGGHI QAGQSVAGDT STLAIQSASI DNADGAIVDL
GAGKMTVQGG SQIANSHAGG VAGMGAITGN GDVTVSAASI SNTQTGQLSG ASLHVQGNTL
DNSGGTIGNV TNSNGDVDVT TTGAITNTNG QISSTHDLSV TAVTLQGGGT YSATHDANVN
LQGDYTAAAD TQFNVGHDLA FTLPGTFTNN ANLQSVNNLS VNAGNIVNVG ALAAGGLLYT
QSTNLTNTGA LVGASVSLNA TNTISNLGPT ALIGASDSNG TLEILARDIE NRDDTTATDS
MATTALFGMG KVVLAGGKDA SGNYTNAALV NNVSALIQSE GDMELHADKV TSTRRVMKTS
TSQIDPASLA QFGISISGRT GQVGVKDPDS IGGVYTDPPH GGQWNSTYQF TTYYADSATA
TTVTDISPAA QIVSGGKIDA SSVGTLQNYW SNIAAVGDVK MPGHYDADGW AASGQKLPGV
TVSYSGQYHY NNYDNSEHDW QLPFGNAPFV GSRPGGYTQA APASIKEYTL PGYFSTLSSN
GTISGTGVSV SNTAANASIP SLGLLPGQSV PGLTPTNLSG NASGAKSGAS AVHGGPPAPV
DPIIASATAL NVLNNLTIPQ GGLYRPTTAP NANYVIETNP AFTNQKNFIS SDYFFGQLGV
DLTHIPKRLG DGFYEQQLVR NQVTALTGKA VLGPYADLQT MYQSLMTAGA DLSKSLDLPM
GASLSAEQVS KLTSNVIMME TRVVDGQSVL VPVVYLAQAN QQNINGPLIT AANIDFQNAQ
SFTNSGTIKA DNTLAIQGKQ IDNAFGALQS GGLMSLKTEN NIDLTSANVK AGSLQLDAGK
DLILDTATKT NTRVSRDGAT SVVTTLGPTA KLDVAGNASI VTGGNFQQNA GNLSVGGNLG
MNVGGNWDLG AVQTGEHKIV QRANGVSNTD INKVTGSSVT VGGQSSIGVG GDLTAKGAQI
DLGQGGTIAA KGNVTLGAAS TTSTVNSNSS GSDSHGSYAE TLHTSDQALT GTTLKGGDTV
TIASGKDLTI SGSTVSLDKG NANLMASGDV NIGAATETHE LNSHETHSHS NVVSGAKIAS
GIDQTATYSQ GSTVSADGIN IVSNRDINVT GSNVVGTNDV TLQAKRDVNI KTSQDTTQSS
SYYEKKESGL LTNGGLSVTV GSRSTAQQDQ TSSVTNNGSV IGSSQGNVTI QAGKDATITG
STIVAGQDVG IAAQNVTVNA AYDTYKDAQS QQFSQSGLSV GLGGGLVGLG QSMAGAVRQG
EQSGDSRLAA VQAVAAAEQA YQNRGGIKDA ANALSNGNVS DAAKGVQVQL SIGSSHSSSN
STTSISDAKG SSIIGNGNVS IIATGTPDAN GNAQAGTGNI AMTGASVLGK NVVLDANNAI
TLQSAQSTEQ NTSSNSSTGW NAGVAIGVGK NTGISVFANG SNAHGQGNGD SVTQTNTTVA
AGNNLTMKSG GDTTLSGARV SGDKVKVDVG GDLTMTSLQD TSNYSSNQHN TGVSGSFTFG
YGGGVDASIG HTSIDANYAS VNQQTGIVAG KEGFDVNVVG HTQLNGAQIA SAAPADSNTL
TTGSLGFTDI QNKMSYSGSS EGFSTAGGPS FAQTGDSASG VTHAAVSPAK IVVKADEQNG
TDSTAGLSRD TANANQTVEN TFNLQKVQNN LAFAQAFGKA ATFAVAEAAT QLENSSPQMK
ALFGEGGAGR DALHAAVAAI GAALSGGNVG GAIAGSLAGD VLQSLAQPII DQTVSQLPLD
AQTAARKALN EIVATAGGAA GGALAGGGSS GALAGAGSAS NNELYNRQLH QSEADKLKQL
QKGQSPEQQY RLAAAECALV HCADNILDND PNKAALQKIQ NDGAQYTYEQ NVLKKAGAFD
GYGDLDRLSD TYDRNQVSNR LVGAVQGVGS VAVAAGAVTG GCASVAGCAL GAAIAVGAVD
YAKSGFTQMM SGNLTPTYGE QALQSLGMSP TGAALVYGAL NLGGAAAQVV VTGRAVDAAA
AANAWARGTY NGSSSAQYSG ELYRYTMPEY AEGTWNLYKG NIDANHRYSP PGVGAIYAGT
TPQTSLAEIT SYEPLKGQVL VTKNFVINNV LDLTNPAARQ ALGVTVDQLT QTSHGGAAYD
ATQAISTWAR EQGYQAILAP SAQLPGGVNL ISFKSLGKQ