Gene Rpic12D_5058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_5058 
Symbol 
ID8017347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012855 
Strand
Start bp342273 
End bp350633 
Gene Length8361 bp 
Protein Length2786 aa 
Translation table11 
GC content63% 
IMG OID644828746 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002979946 
Protein GI241589921 
COG category 
COG ID 
TIGRFAM ID[TIGR01222] septum site-determining protein MinC
[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGC CTGGCCAGAT GCAGATCGCT GGCGCGCATG CGTCGGGCGA GACTGCAGCA 
CATCAAGAAC AAAAGCAGGT CAATGAGACA GATGCTCATA CGAGCCGTCA TGATCGCGCA
CGTCATTCGC GGCTGATTCG CTCAATCGCG GCTGTCGTCG CATTGATCAG CTGGTTGGGG
CCAGTGCAAA TCACCTGGCA GGCCGCGCAA CGCAGTGCCG CCATCTTCGC GGTGCGTACC
GGGTCTCCCT TTGACGATTT CGGCAACCAC GTTGCATCGT GGCGCACAAC GGGTCGCTTG
CTCGTACGCT GGGGAATACA GCAGGCCGAA GCTGCGCCGA TTACCGATCC CACTGCGCCG
ATCCGCTTCA CGCCCACCAT CACGCAGACC ACGGCGGGTG TCCCGACGAT CAACGTGACA
ACTCCGAATG CTTCAGGGCT TTCGTACAAC CTGCTGCAGT CGCTGACAGT GGATAGCGGT
GTCGGTTTGG TGCTGAACAA CTCGCTCACG GGCGGCGGCA CTTTCCTGGG CGGGAATGTT
TCGGGCAATC CGAACTTGGC TGCATCTGGG CCGGCTTCGA CCATCCTCAC GCAAATCACC
AGCACGACGC CAATCAGGAT AAACGGGGGC GTTGAGGTTT TTGGGGCGCC GGCATCCGTC
ATCTTTGCCG CCCCCGGGGG CGTCTACTTG GCGGGAGCTG GATTTACGAA CACGCCCAGC
GTCACGCTTG TCACCGGCAC GCCGCAGTTT CTGAACGGCA GTGGTAGCCC GGTTGGCTTC
GACCAAGCGA CCGCCATAGG CTACACCGTT AACAGCGGCC GGGTCCAGAT CGATTCGGCA
GCCGGCACCA CCAATGGCGC GGGCATCGAA GGTACGGTTG GAAGCCTGAA CGTCATCGCG
CAAAGCATCG GCGTCAATGC TGCCCTCTAT ACCGGGCAAC AACTGAACCT CATCGCCGGT
AACCAGGTGG TCACGCCGGT GGCCGCCGGT ACCGGTCGCG CAGGTTCGGA TTGGCAGGTC
AGCAGCAATG GCCCCAACAC CGCCGCCAAC AGCCCGAGCG CACAGAACGG CCTCGCTATC
GACGCCACCG CGTTCGGTGC AATGACGAGC GGACAGATCA AAATCGTCTC GACCGCACAG
GGACTTGGGG TGCGCGCCGC CGGCGACATG GCTGCGAACA CGTCGAACGT GAACATCGAC
AGTAACGGTA ACGTTTCGGT CGGCAACGTC TACGGCCAGC AGAACGTTGG TATCACATCC
GCGGGTTCCG TCTCCACCAC GGGTACGGTC AAGGCACTGC AGGATGTTTC CGTTTCCGCA
AACGGCGATG TCAACGTTGG TGGTGCCGCG CAGGCCGGCA ACAACCTGAG CCTGAACGCC
GGCGGCAATT TGACCGGCCC AGGCAATCTG TCTGCGGCCA AGACGCTGAC CGCAACCGCC
GGCAACAGCG TCAACCTCAC GGGCGATCTG AACGCCGCCA ACATTGCCGT CACCGCGCGG
GGGCAGGATG GTACGGGCGA TATCGCCCTG GGCGGCAAAG TGTCGTCGCC GAACACCATC
GACCTGAATG CCGCGCGAGA TCTCGCACTG GCAGGGCAGG TGACCACTAA CGGTGACATC
CAGGCGACGG CAGGCCGCGA TGTGGCTATC TCCGGCACGA CACAAAGCTC AGGCGCCACC
ACGCTCAAGG CCGGCCGCAA CATCGGCGTG ACCAATACCG GCGCGGTGAC GGCCAGCACC
ACGACGACCG CCACAGCAGG CGAGAACATC AACGTCGCTG GCGCGATCGG GTCGCAAGGC
AATCTGCAAT TGGCCACGAC CAATGGCGCC ATCACGACCA CCGGCAGTCT GGCGTCCAAC
GGAGCCATCA CCGCCGACGC GGGCGGTGCC AACGGCGATA TCGCCCTGGG CGGTACCGTC
ACGACGCCAA ACGTATTGAC ATTGACGGCG GCGCGCAATG CGACAGTCGG TGGCCAGCTC
ACGACAGGAA ACAACCTGCA GGCAAGTGCC GGTCTGGATC TCGTCGTCTC TGGAACCACG
CAGAGCGTGG GGGCGACCAC GCTCAGTGCG GCGCAGGATC TCACGGTTGC AGGTACTGGC
TCAGCCACGG CTGGCACAAC GCTGTCGGCC ACAGCAGGCC GTAACCTCGC CGTGTCGGGC
GGCACCGCCT CCGGTGGCGA TACCACCTTG AACGCTACCA GCGGAACGCT TTCCACCTCG
GGCACCGTGC TGGCGGGCGG CAATGTCGCG GCCACCGGCC AAGCCGGCAC GACGCTGGGC
GGGACCGTCT ACGCGACCCA GAGCGTCACC GCCCATTCTG GCGTCGGCAC CACCAGCGCG
ACCGGCAATG TGATTGCGCA CAACGGTAGT GTCTCCCTCA CGGGCACCAA CGTCAGTGTT
TCTGGCGGCA CGCAATCAGC CACGGACACC ACCCTCACTG CGACACAGGG CAACGCCACC
GTCGACGGTC AGGTCGCCGC CCTGGGCAAG CTCGGGGTTT CCGCAACCCA GGACATCACC
GGCAAGGGAA GCACAGCGAG CGTCGGCGAC ACCACGCTCG CCGCAGGCAA CAACATTGTG
CTAACCGGCA ATAGCCAAAC GGCTGGCAAC CTGACCGCGA CCGCAACCAA TGGCGTCTCC
ATCGCGGCAC TGCCTGTCGT GGGCGGAAAC GCCAGCCTGA GCGGTGCCAA TGTGTCGTTG
GGCAGTAACG GCACGACCAG CCAGGTCAAC GGCACGCTGA CCGCAACCGG CACGCAGAGC
GTTGTCACGG CCGGCACCAT CGCCACCAGT TCAGCCAACC TGACCGGCGG AACCGTGACG
AATGCCGGCA CCGTGACCGC GTCCAATGCG CTCACGGCTA CCGGCACCAC GGTCACCAAT
ACCGGCACAC TCGGCGGCGC CACGACAACC GTGCACGGCA CCGACGTGGC CAACTCCGGT
CTGATCGGCG GCCAGACCGT CAACGTGACG GCCGACAACA GCCTCTCGAA CCAGAACGGC
ACGCTGCTGG GTACGCAAGC GCTGAACGTC ACCGCCAACG TGCTGACAAG CAACCAAAAC
GGCGTCATGT TTGCGGGTAG CCCGTCGGGT TCGACCACCA GCGGCGACTT GACCGCCACG
GTCAAGGGCG GCAACGGCAG CTTCAACAAC GCTGGCGGGC AAATCCTTGC CGCCAACAAT
GCCACCCTGA ATCTGCAGAA CCAAACTATT GATGGCGCTA GCAACCTGGG CACGGTCAAT
GCGGGAAATC AACTGACATA CAACGTCGGG GCCGTAGCCA ATACGGGTGC GTGGACACTG
GGCGGCAAAA GCGCCACGAT CAACGCGGCC AATGGCATCA GCAACACGGG TTCGATCCAG
CATGCGGGCG ACCTGACGCT GTCCACGCCG GGCGCTGTAA CGAACAGTGG CCAGATCATT
GCGGGCAAGA ACCTCTCGGT TTCGGGCGGC ACAGTGAGCA ACGCCGCGGG CGCCACGCTG
CATTCAGACA ACGATCTGTC CGTGAGCGGT GCGACCACCA ACCGTGGCAC GGTCGAAGCG
CTCAACGATG TGAACATCAC TGGGGCTGGC TACGACAACG CAGGCGCGTT GACTCAGGCC
AACCGCGACG TGAACGTCAA CGTGTCGGGC AACGTCCTGA ATCAAGCCGG CACGATTGGC
GCCAAGCGCG ACGTGAGCCT GACCGCGAAT CAGATCATCA ACGACGCGAC CGCATCTACG
GGCAGTGGCA CCACCGTCGT CACCGGGCAG GAGGTTAATC CTACTTATCT GTCATCGGTG
GTGATTGGCA CCAAACAGGT TCAACTGCCG ATCCCTGGAG CCGGTTCGGC AGACGGCGGT
CCGGTGTTTG GACCGTATCG CTTCGCCGTC ACGATCGGGG ACTTGAAGCC GGACGCCAAT
GGGGTGATTT CGGCATATCA GGGTGTCGAG ACCTATACCC CAGCCAATGG CGGCGGCCCC
AACGGCAACA TCACGCCATC GCTGAACCTG TGGCATTTCC TGAGCCCGGA TACCACCAGT
TTCCCGAGCG CCGACTATCA GCCTGCCCCT GGGACGCCAA CGCCGTTCAT CACCCTGCCT
ACCGTCACGC GCACAGAGAC GACAACGCAG GACGGTGTCA GCGGGATCAT TCAGGCGGGT
CGCAATCTGG CTGTGACAGC CTCGTCCCTG TCCAATAACG GCGGACAGAT CAGCGCAGCC
AACGACATCA AGATGGCCGT CGGTACGCTC TCTAATGGAG CGTCGGCGGG TTCCTCCAAG
ACCATCACTG AGTCCATCAG CCAAGGCGCC CTCAACGCCT TCATGCAGCA ATTGGCGACG
CAGTTGGGTT ACAACCCCCA GTCGGGGGGC GGCTATCCGC TCGCTGTACT CAACGATGGC
TGCATGCCGG GTGAGTGCAA CAACAGCGGC AACGGCAATG GCCCACCCCC AGGGCCGGAA
GCGCCGCACT GGGTGTGGTT CGGTCCGTCT TCCAATATTG GGTTCAATGG CAACGTGAAC
ATCCCCGGCC TGGCCGGCTT CACCGCCACG CCGCCCGCCC CGCAAACCAC GGTGCAGCAG
ACCGCAGGCA AGCAGGGCGT CATTGCCGCT GGCGGAAACA TTGACCTGGC CCAAGTCGGG
ACGCTGAACA ATGGCGGCCA GATCGCCGCC GCCGGCAATA TCACACTCGG CGGCTCGGTG
AACAACGTTG GCCAACTGAA CGTCAACCGC ACGACGCTGC CCGGCTGCGT CGGCAACCCG
GCAACGTGCA CCGACCCAGG CGTCTCCGGT GGTGGCAGCC CGTACCACGA AGTCATCGAT
CCGAAGCAGC AGGTGGCGAG CATTGTGGCA GGCGGCACGC TCACGGCGAA TCTGTCACAG
CTAACCAACC AGACGGGGAC CGTTGCAGCG GCTGGGGCAG TGCAGATTAC CGCGCCCACG
GTGACGAATA CCGGTGGCAC GATCCAGTCG ACCACAGGGT CAGTGACAAT CAATGCCGCC
AACGGTCTTA CGAACCAGGC GGCACCGACC ACCACGGTCT ACGCGAGCCA CGGCTCGGAC
ACGTCGGTTT GCGGCAAGGT CGGCGGTACC GCATGCGCCA CGGCCACGCA GACGGCCACG
GGCGATGCCG GCATGATCCT GGCGGCCACG GACCTCAACA TCACCGCGGG GTCCGTGCGG
AATAACGGTG GCGCCATTGT CGCCGGTGGG AACAACACGA TTACGACCGG CAGTTTCGAT
AACAGCCCGG TTTTCCTGCG GCAGTATTAC CACTGGTCGT ACTACAACCA GAACTCAACA
GCCAGCGACA CCTGGGGCTG CGATGCAGCA GGCGATATCT CCGGTTGCCA GCGGGTGTTC
GGCAGCAACC TGACCAATGG CTGGAACGGT ACCGCTGAAA ATGCCCCGAC CATCGGGCAA
CTGAACTCCT ATGTGAGCGG TGGCAACCTC GCCATCCGCT CGGGTGGCGC CCTGATCAAC
AGCGGCAACA TTGAGGGTAC GGCGATTTCG CTCTCGGGCG CCACGATCAC GAACGGCATC
ACCAATCCGT CGATCCAGAC CCCGCCGTCG ACCAGCGGCC AGCAAGTGGT GAACTTGGGG
CCGATCGGCA CGCCCAATGC CCAGTTGCCC GCGACGGGCA CGCCGGACAC CTTTAGCGGC
CCCACAACGG TGCTGCAGAA GGGCGTGCCG AACCCGACCA ACCCAGGCAC CTCGAACGGC
CAGTGGCAGT TCAACCCTGT CGTGGTGACT GAGCAAAGCG GTCAGACGGT GACGTGGCAC
TTCAATGCGC CGCAGCTTGG CGGCGCGCTG ACGTCGCCGA CGGCGACGGG CTCGACTGCG
CAATACCTGA CCAGCAGCCC CGCGACAGCG GTGTTGGGTG GTGTGGGCCC GGCCACGCTC
ATCAATGCGC TGCCGGCCGA CTTGCGCCCG GGCAGCACGC CGTTCTACTA CGACCCGCAG
GCGGAAAACG CGCGCCTTGA TCAGCAGGCC TTGGCTACGA CGGGGCGCAC CAGCTTCGTC
AATGGCCTCA CGTATGACAA TCAGAACCAC CTGACGGTCG ACGACCAGCA GAAGCTGATC
CTGTATCAGA ACGCGGTCGA CTACGCGAAG GCCCACAACA TTCAGCTTGG CCAGGCGCTC
ACGCCGGAAC AGCTTGCCGC GCTGGACAAG CCGATGCTGT GGTACGTAAG CCAGTCGGTG
CCGGACCCGA GTTGCATGAG CGGCGCGTGC CCGATGGTGA CGGCGTTGGT GCCACAGGTG
TACCTGCCGC AGGGCTACAG CGGCATTGAG CCGGGCGGCT CGATCGTGGC AACGAAGAGC
CTGGACCTGA ACGCCAACAG TCCAATCCAG AACACCGGCA CGCTGGGCTC CTACGGCACG
CTCACGAGCA ACACGACGAT CGTGAACGAG CAGCGCGCAG CCGATATGAC GGCGGCGTGG
CAGCCCATTG AGGACGGCTG GGCGCGCGTC ACAGGCCAGC AAGGCCAGGC GAACAGCGGC
TTCGTGTTCG CGGCCAACGC GGCGGACATC GCTGGCCAAG TGCAGAACAT CAACGGCGTG
CTGGCGCAGC TCAATGCTGA CGGCAGCATG AGCGCAGCGG AATCGGCGCG CGTGGCGCAA
GCTGTACAGG CTGGCATGGC GGCAGTCACC AGCACGCACA CCGACACGTT TGTGCGTGCG
CCGGACGTGA TGGGGGAAAT CTTCTCCAGC GTGGTGATGG TGGCCATCGG GATCATGACC
GGTGGCGCGG CGATGGCGGC GTATGCCGGG GTGGGCGCGA CGCTGACGGT GGGCGAAATG
ATGGCCCAGG CCGCGATTAC GTCGATGACG ACCAACGCCA TGCAGCAGGC CAGCAGCGGC
ATGGGTTTCA GCTTCGGGGC ACTGGTCAAG GCGGGTGCAA CTTCAGCGCT GACTGCGGGG
CTAACGCAAG GCATCACCAT CAATGCAGAC GGCTCGCTTG GCATGGTCGA TGGCCTGGGC
TCGCTACCGT CGGATCGCAG CATTGCGGCC TTGTCTGGTA CGAAGGTAAT CGGCGACGGC
CTGACACAGG CGGGTGCCGC TACGGGGACA CTCGGCCAAC AAATCGCAGC GCTTGCTCTG
GACGCCACCA TCAAGGCGGG CGTCAACACG GGTATCAACG GTGGCAGTTT CCTGACGAAT
CTGCGGAACA GCGCGGTCAA TGACCTTGCG GCCGTGGGTG CGTACAGCAT CGGCAACCTG
AATGAGAACG GCGCACTAGT CGGGCCGGCG TATTACGCCG CGCATGCTCT CCTTGGCTGC
GCGAGTTCGG CTGCATTGGG TACAGGATGC GGTGGTGGTG CAATTGGAGG GGTTACCAGT
GCGGCGTTGA CGCCGTACTT CTTGGACGCA ATCACTTCGA ATGGTGCGCC TCTGCAGATG
GATGCTGGTC TGCAAGCCAT GATATCGACG GTTGCCGGCT TGGCCGGCGG CGGGCTAGCC
GACGCCCTTG GGCAGAATGC TCAGGGTGGC ATGGCGGCAG CACAGAACGA GGCGCTCAAC
AACACTACGC AGCACTATCA GAACGTCAAT AATCCGCTCT TCAAGGCCAA CGTGAAGGCC
CTGGGAGACT GTGTCGATCT CGTGTCTTGC CGCTCCAACG CAGCTTTCTT GCAGAAGCAG
ATGGATGCGT TGAGCGATAG CAATATCGCC GGGATGTGTG GAACCAACTC GGCTTGCGTC
TCCGCCCGAC AGCAAGAGCG ACAGCTGTAC CAAGATGCCT ATGGCCAAGC CGTCGCGCAT
ATGAATCCTG ATGTGGCTGC ACGAGACTAC CTGGCTGCAC AGAATCAGGC CCAAGGAAAT
CGCTTCACTG CCAACGATCT AGCAAACGCG TTGCAGCGCT ACCAATCTGG CACTAGCGAT
CCAGCCAATT CTGCCGACGC CTTTGTGACC AAGGCCATTG TCGGTAACGC TGCCCTGTTT
GGAGCAATCA AGGGAATAAC TGCAGTTGAT AGCGATTCGG GTGGCGGCGG CAGGCTTCCA
GGAACTGCGA AGATGTCCCT TGCAGATGCA CAAGCCTATG CTAGTAATCG TGCGACCCAG
CTTCAGTCGC AACTTCCTGA AGGGTCGCAA GGCCGAGTAA CTATGGGCGT AGGTGTTGTA
CAAGATGTCA ATGGAAACCA GATCGTCGTT GTATCAACCA GTGAGCCAAG AGGCTACTTG
CGTCCAGGCG TAACACTGAA GCCCGGCGAG CTTGTAGTTC CCGGTACTGG CCATGCTGAA
GCGGATATAG TCAATTGGGC TAGCCAGAAC GGCTACACAG TCTTGACCGT CGGAGCGGGA
AGGCCTATTT GTCCTTCCTG TGCCGCTGCA CTGACCGGTG CGAACGCTAC TCCAGCAACC
ACCTTGAAGC TGCCGAAATG A
 
Protein sequence
MRLPGQMQIA GAHASGETAA HQEQKQVNET DAHTSRHDRA RHSRLIRSIA AVVALISWLG 
PVQITWQAAQ RSAAIFAVRT GSPFDDFGNH VASWRTTGRL LVRWGIQQAE AAPITDPTAP
IRFTPTITQT TAGVPTINVT TPNASGLSYN LLQSLTVDSG VGLVLNNSLT GGGTFLGGNV
SGNPNLAASG PASTILTQIT STTPIRINGG VEVFGAPASV IFAAPGGVYL AGAGFTNTPS
VTLVTGTPQF LNGSGSPVGF DQATAIGYTV NSGRVQIDSA AGTTNGAGIE GTVGSLNVIA
QSIGVNAALY TGQQLNLIAG NQVVTPVAAG TGRAGSDWQV SSNGPNTAAN SPSAQNGLAI
DATAFGAMTS GQIKIVSTAQ GLGVRAAGDM AANTSNVNID SNGNVSVGNV YGQQNVGITS
AGSVSTTGTV KALQDVSVSA NGDVNVGGAA QAGNNLSLNA GGNLTGPGNL SAAKTLTATA
GNSVNLTGDL NAANIAVTAR GQDGTGDIAL GGKVSSPNTI DLNAARDLAL AGQVTTNGDI
QATAGRDVAI SGTTQSSGAT TLKAGRNIGV TNTGAVTAST TTTATAGENI NVAGAIGSQG
NLQLATTNGA ITTTGSLASN GAITADAGGA NGDIALGGTV TTPNVLTLTA ARNATVGGQL
TTGNNLQASA GLDLVVSGTT QSVGATTLSA AQDLTVAGTG SATAGTTLSA TAGRNLAVSG
GTASGGDTTL NATSGTLSTS GTVLAGGNVA ATGQAGTTLG GTVYATQSVT AHSGVGTTSA
TGNVIAHNGS VSLTGTNVSV SGGTQSATDT TLTATQGNAT VDGQVAALGK LGVSATQDIT
GKGSTASVGD TTLAAGNNIV LTGNSQTAGN LTATATNGVS IAALPVVGGN ASLSGANVSL
GSNGTTSQVN GTLTATGTQS VVTAGTIATS SANLTGGTVT NAGTVTASNA LTATGTTVTN
TGTLGGATTT VHGTDVANSG LIGGQTVNVT ADNSLSNQNG TLLGTQALNV TANVLTSNQN
GVMFAGSPSG STTSGDLTAT VKGGNGSFNN AGGQILAANN ATLNLQNQTI DGASNLGTVN
AGNQLTYNVG AVANTGAWTL GGKSATINAA NGISNTGSIQ HAGDLTLSTP GAVTNSGQII
AGKNLSVSGG TVSNAAGATL HSDNDLSVSG ATTNRGTVEA LNDVNITGAG YDNAGALTQA
NRDVNVNVSG NVLNQAGTIG AKRDVSLTAN QIINDATAST GSGTTVVTGQ EVNPTYLSSV
VIGTKQVQLP IPGAGSADGG PVFGPYRFAV TIGDLKPDAN GVISAYQGVE TYTPANGGGP
NGNITPSLNL WHFLSPDTTS FPSADYQPAP GTPTPFITLP TVTRTETTTQ DGVSGIIQAG
RNLAVTASSL SNNGGQISAA NDIKMAVGTL SNGASAGSSK TITESISQGA LNAFMQQLAT
QLGYNPQSGG GYPLAVLNDG CMPGECNNSG NGNGPPPGPE APHWVWFGPS SNIGFNGNVN
IPGLAGFTAT PPAPQTTVQQ TAGKQGVIAA GGNIDLAQVG TLNNGGQIAA AGNITLGGSV
NNVGQLNVNR TTLPGCVGNP ATCTDPGVSG GGSPYHEVID PKQQVASIVA GGTLTANLSQ
LTNQTGTVAA AGAVQITAPT VTNTGGTIQS TTGSVTINAA NGLTNQAAPT TTVYASHGSD
TSVCGKVGGT ACATATQTAT GDAGMILAAT DLNITAGSVR NNGGAIVAGG NNTITTGSFD
NSPVFLRQYY HWSYYNQNST ASDTWGCDAA GDISGCQRVF GSNLTNGWNG TAENAPTIGQ
LNSYVSGGNL AIRSGGALIN SGNIEGTAIS LSGATITNGI TNPSIQTPPS TSGQQVVNLG
PIGTPNAQLP ATGTPDTFSG PTTVLQKGVP NPTNPGTSNG QWQFNPVVVT EQSGQTVTWH
FNAPQLGGAL TSPTATGSTA QYLTSSPATA VLGGVGPATL INALPADLRP GSTPFYYDPQ
AENARLDQQA LATTGRTSFV NGLTYDNQNH LTVDDQQKLI LYQNAVDYAK AHNIQLGQAL
TPEQLAALDK PMLWYVSQSV PDPSCMSGAC PMVTALVPQV YLPQGYSGIE PGGSIVATKS
LDLNANSPIQ NTGTLGSYGT LTSNTTIVNE QRAADMTAAW QPIEDGWARV TGQQGQANSG
FVFAANAADI AGQVQNINGV LAQLNADGSM SAAESARVAQ AVQAGMAAVT STHTDTFVRA
PDVMGEIFSS VVMVAIGIMT GGAAMAAYAG VGATLTVGEM MAQAAITSMT TNAMQQASSG
MGFSFGALVK AGATSALTAG LTQGITINAD GSLGMVDGLG SLPSDRSIAA LSGTKVIGDG
LTQAGAATGT LGQQIAALAL DATIKAGVNT GINGGSFLTN LRNSAVNDLA AVGAYSIGNL
NENGALVGPA YYAAHALLGC ASSAALGTGC GGGAIGGVTS AALTPYFLDA ITSNGAPLQM
DAGLQAMIST VAGLAGGGLA DALGQNAQGG MAAAQNEALN NTTQHYQNVN NPLFKANVKA
LGDCVDLVSC RSNAAFLQKQ MDALSDSNIA GMCGTNSACV SARQQERQLY QDAYGQAVAH
MNPDVAARDY LAAQNQAQGN RFTANDLANA LQRYQSGTSD PANSADAFVT KAIVGNAALF
GAIKGITAVD SDSGGGGRLP GTAKMSLADA QAYASNRATQ LQSQLPEGSQ GRVTMGVGVV
QDVNGNQIVV VSTSEPRGYL RPGVTLKPGE LVVPGTGHAE ADIVNWASQN GYTVLTVGAG
RPICPSCAAA LTGANATPAT TLKLPK