Gene RoseRS_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2737 
Symbol 
ID5209706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3405525 
End bp3414119 
Gene Length8595 bp 
Protein Length2864 aa 
Translation table11 
GC content63% 
IMG OID640596337 
Productglycosyltransferase 36 
Protein accessionYP_001277059 
Protein GI148656854 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTG CCACGTTGAC CAGCGATCTC TCGCACCGGT TTGACACATT GACCCGTACC 
GTGCAGCAGA CGTTTTCCTT CATCCGACGG CGCCAGCCGC TGTCGGATCG CCCGATCCGC
GGGGTGATCC TGACGACCGG GCAACTCGAA GAACGCGCGC GGGCGCTCGC CGCCGAACAC
ACGGTCGTTC TGCAACCCGG CAAAGCGCGC GCTTTGCTCG CGTCGGTGGA TCGCAATCAT
CGCCTGCTGC GCCTGGCGTA CCAGACGCTC GCCGCCGATG CTGCGCAACG CCAGCCGCTC
ACTCCTGCCG CTGCCTGGCT GGTGGACAAC TACCACGTTA TTGTCGGTCA GATCCGCGAA
ATTCGTCAGG ACCTGCCCGG CGGGTACTAC CACGAACTTC CGAAACTGAA AGGCGGACCC
CACGACGGTA AACCGCGCGT CTATGCGTTG GCGCTCGAAC TGATCGAGCA TACCGACGGT
CGGATCGATC TCGAACAGTT GACGCGCTTT GTCCTGGCAT ATGAAGCAGG GGCGCCGCTC
TCGATCGGTG AGATTTGGGC AATCCCGATC ATGCTGCGTG TCGGGTTGAT CCAGAACCTG
GGACGCCTGG CGCGCCTGAT GATCGATGAG CGTCGCCTGC GCCTCGAAGG AGCAGCGTGG
GCGGAACGGA TCCTGGCGCA GAAGCATGTT GGCTCCTTCG AGGGAAATAC TGCCTTTCGA
CAACTGGCGC GGATGCACCC CCAACTTCCG CTGCCGCTCG CCGTCGAACT GATCCAGCGC
CTGCGCAACC AGGAAGGGGA GTTCGACATC GCCCAACTGA TGGTCTGGCT GGAACAGCAA
CCTTCCATCC CCTATAACAC CGCCGAGGAA ATCATTCTCG CCGAACAGCG GCGTCAATCC
GCCAATCAGG TATCGGTCGC CAATACGATC ACCAGCATGC GGACGATGGA CGCGGTCGAC
TGGCCCGACT GGTTCGAGCG CGTCAGTCTG GTCGAACAGG TGTTGCGTCA GGACCCGGCT
GGCGCGTATG GGCAAAGCAC CTTTGCCACC CGCGACCGCT ACCGACACGA ACTGGAGCGT
CTGTCGCGCC GTAGTGGATT GAGCGAAGAC GCCATTGCGC GACGGTTGAT CCACATTGCT
GCACGCGCCC GGGAAGAGGG ACGTCCACTG CGCGAAACGC ACATCGGGTA TTACCTGGTC
GATGAAGGGC GCTTTGCCTT TGAGGCGGCG CTGGGGTGTC GCCTGACGCC CGGCGAGGTG
GTGCGTCGTG CAGTTCTGCG TCATCCCGAA GTGGTCTATC TTGGCGCGAT TACGGCAGGA
ACTGTCGCCA TCACCGCTGC CGGCATTCGT TTTGCCCGCC CGACTGCGCG CAGCGAGCCT
TCCGCCGTTC CACCGATCCT TACCGCCGCC CTTTCGCTTA CGGCGGTCAT TCCGGCATTC
GCGCTGGCGA AAGAACTGGT TGATCGCGCA GTTACGCGCC TGATACCCCC GCGTACCCTG
CCACGACTCG ATTTTCGTCA TGGCATTCCG CGCGAACTGC GCACAATCGT TGTCGTGCCG
ACATTGCTGC TCACACCCGA TAGCATCCGC ACGCAGATCG AGTCGCTGGA AGTGCTGGCG
CTTGCCAATC AGGACCCGCA CCTGCATTTC GCACTGCTCA CCGATTTCGC CGATGCGCCT
CAACAGCATA TGCCGGAAGA TGAGCGCCTG CTCGCCCTGG CAGTCAGCGG TATCGAAAGG
TTGAACGAAC GCTACAGCAG CGACCGCTTC TTCCTGCTGC ATCGCCGCCG CGTGTGGAAT
GAGCGACAGG GTTGCTGGAT GGGATGGGAG CGCAAACGCG GGAAACTTGA AGAGTTCAAC
CGGTTGCTCG CTGGCGCACA CGATACCACC TACGAGACCT TCATCGGCGA TCTCAGCATT
CTGCCACAGA TCCGCTATGT CATTACGCTC GACGCCGATA CGCAACTGCC GCGCGATACG
GCGCGCGCGT TGATCGGCAC ACTGGCACAC CCACTCAACC AGGCAGTGAT CGACCCGCAG
ACGCAGCGTG TGGTGCAGGG TTACGGCATT CTTCAGCCGC GTGTCGGCAT CGACCTGCCG
AGTGCGTTGC GCAGTCGCTT CGCGCGTATT TCATCGGGGA ATGTCGGTGT CGATCCCTAC
ACCACTGCGG TCTCCGATGT CTACATGGAC CTGTTCGGCG AGGGGATTTT CGCCGGGAAA
GGCATCTACG ACCCGGTCGC AATGCGGACG GCGCTGCACG ACCGTTTCCC GGAGAATACG
CTCCTCAGCC ACGACCTGAT CGAGGGATGT TATGCGCGCG CGGCACTCCT CTCCGATGTC
GAACTGCTCG ACAGTTACCC AACGACATAT GCGGCTTACT CGGCGCGTCA GCACCGCTGG
GTGCGCGGTG ACTGGCAGAT TGCAGGATGG CTCCTGCCGC GCGTTCCACG TGCATCGGGC
AGCAGCGCAC CGAACATCCT GCCGTTGATC AGTCGCTTCA AAATCGCCGA TAATCTGCGT
CGCAGTCTGA CTCCGCCAGC AACGCTCGCT CTCCTGGTCG CCGGATGGTT CGTTCTCCCC
GGTCGCCCTG CCGTCTGGAC GCTGCTGGCG CTTGGGCATT ACCTTGCGCC GCTGACCTTT
GCCGTCATAA GCCTGCGGTT GATGCCAGCC GACTGGCGCT ACCTGCGCAC TCATCTGATC
ACGACGATCG GAAGCCTGCG CTGGACGGCA TTGCAACTGC TGTTGAACGT TGCCATGCTT
CCCGATCAGG CATGGCTGAA TGTGGACGCT ATTGTCCGCA CTCTCTGGCG CATGGGAGTG
ACCCATCGGC GGATGCTCGA ATGGGAGACG GCGGCACAGG CGCAGCGACG CCTCACCGAT
TCGTTCGACT ATCTCGTCAG GCGGATGGCG CCGTCGGCGG TTGTGGCGCT GGCACTGGTA
GTGATGCGCA TTGATCGCCT GCGCGCGTCG TGGCTTCCCG CGCTGCCGGT TGTGGCGTCG
TGGGTTACGG CGCCCTTCTT CGCTCGCTGG CTCGATCAGG AGTACATCCC GCGCCTTGTT
GCACCGCTGA GCGTCGATGA TCGCCGGATG CTGCGCCGCG TGGCGCGCGC AACCTGGGCA
TACTTCGACC GGTTCGTTGT GCCGGAGCAG AACTTCCTGG CGCCGGATAA CTTTCAGGAA
ACGCCGCGAC CGGTGGTGGC GGAGCGCACT TCGCCAACCA ACATCGGGTT GCAACTGCTC
GCCGACCTGG CTGCCGTCGA TTTTGGCTAC CTGGGTGCGC GCAGTCTGGT GGAACGCGTC
GAGCGTGTAT TCGACACGAT TCAACGGATG GAGCGCTTCC GCGGACATCT GTACAACTGG
TACGATACGC GCACCCTGCA TCCCTTGCCG CCGCTCTATG TTTCGACCGT TGATAGCGGC
AATCTTGCCG GTCACCTGCT CACTCTGCGC CACGGGTTGC GCGCACTCAT CGAACGCCCG
GCATACGGAC CGTGGATCAT CGAAGGGCTG CGTGACATCC TGGAAATCAT CCAGGAACGG
TTGCCCGCCG ATGCGCGCGG GCGCACCTCG CTGGTTGCGC TCCTTCAGGC GCTCGATGCC
ACCCCGGAGA CGCTGGAAGG GTACCGCGCG CGGTTGCTGC AGGCTGCCGA CCTGGCGATC
ATCCTGGCGC GCGAGTCGCG TGTGGCTGAG TGGGCAGAAG CGCTGGCACG TCAGGTGTAC
AGCCTCCTCG ACGATATGCC GCACGATGAT GTGCCGTCAG CGGCGGAAGA TCCGCAGTTC
CGCGCTGCTG TCGAGCGACT GACAGACATG TGCATCGTTC TGATCACAGA GATGGACTTC
CGCTTCCTCT ACGACGAGCG TCGCCGCCTG TTCGCCATTG GTTACAATGT CAGCGAAGGA
CGGCGCGACA ATTCGTACTA CGATCTGCTT GCGTCTGAAG CTCGACTCGC AAGTTTCATC
GCCATTGCGC TTGGCGAAGT GCCGCAGGAA CACTGGTTCT ATATCGGGCG CAAGATTTCG
CCAGCGGTGG CAACCCCGAC GTTGCTCGCC TGGAGCGGCA CGATGTTCGA GTACCTTATG
CCGCTGCTGG TGATGCGCAA TTTTCCCGAA ACCCTGCTCG ACTCGACCTA TCGCGGGGCG
GTTGAACGTC AGATCGCCTA CGCCGCCGGG CGCGGTATTC CGTGGGGCAT CTCCGAATCG
GCGTTCAATC TGCGCGATTC GCAGATGAAC TACCAGTATC GCGCATTCGG CGTGCCAGGG
TTGGGATTGC AGAGCGGACT GGCAAACGAT CTGGTCGTTG CGCCATATGC GACGCTGCTG
GCATTGCAGG TCGCACCCAA CGAAGCCATC GCCAACCTCC GTACTCTGAT GGAGTACGGC
GCATTCGGCG CGTATGGCTT CTACGAGGCG ATCGACTTTA CGCCCGAACG CCTGCCGCCT
GGCGTCAGGC ACGCCATCGT TCAAACCTAT ATGGTGCATC ACCAGGGGAT GGGACTGCTG
GCGCTCGATA ATTTGTTGCA CGACAACATC ATGCAGCGTC GCTTCCATGC CGAGCCAATC
GTGCAGGCCA CTGAATTGCT GCTCCAGGAG AAGATCCCTG CCGACCGGCC CCTGCCGTTG
CCGCAGGAGA GCGCCGTCAG CGAAATCACA ACCACACCGG ACGTCGCGCT CGAACGTCAC
TTCACAACGC CGCATACTGC AGCACCATAT GCCTACATCC TCTCCAACGG CGTCCTGACC
ACGATAGTCA CCAACGCTGG CGGCGGCGGG AGTCGCTTTA CCCTGCCCGA ACGCACAACG
ACCGTTGCGC TTACCCGCTG GCGACCCGAC CCGACCCGCG ATGCGTCCGG CAGTTTCGTC
TATGTGCGCG ATGTACGCAG CGGCGTCACC TGGTCGCCGA CGTACCAGCC GCTCCGAACA
CTTGGCGACG ATTACCGCGT CACCTGCGGC GCCGGACGGG TCGAGTTCCG CCAGCACTAC
GCTGGCGTCG ACACGCGCCT GGAGATGACC GTATCGCCCG AAGACCACGT CGAGGTGCGG
GTTCTGACTC TGGTGAACAC GACGGCGCAA CCACGCGAAC TGGAAATTAC CACCTATGCC
GAGATTGTGC TTGCGCCCGA CGCCGCCGAC GCCGCCCATC CGGTCTTCTC GAACCTGTTC
GTCGAGACAT CGTTCGATCC CACAAGCGAA GCGTTGCTGG CGACACGCCG ACCGCGCGCG
CCGAACGATG AACGCCTGTG GGTTGCTCAG ACGATCGGCG TGCGTGGGCG CGCCCTGGGT
GAGCCGGAGT ACGAAACCGA CCGCATGACT TTCATCGGGC GCGGGCGCGA TCCTTCACGT
CCACAGGCGC TCGACCGCCC GCTGAACGGT CGCATTGGCG CAGTGCTCGA CCCGATCTTC
AGCCAGCGAC GCCGGGTGCG GATCGTGCCG GGCGGACAGG CGCAGGTCAT CATGACGATG
GCGGTTGCGG CGACACGCGA GGATGCGATC CGCCTGGCGG ATCACTACCG CGATGCTGTC
ATTGCAATGC GTGCCTTCGA TATGGCGCGC ATCCAGGCGC AGGTCGAACT GATGCATCTG
GGCATCAACG CCGACCAGGC GCACCAGTTC CAGCGCCTGG CATCACTCAC TCTCCTCCCC
GATCCGGTGC GCCGCGCCGC ATCTGAGGCG CTGCTGCGCA ACACCAAAGG GCAGCCAGGT
CTGTGGGCAT ACGGCGTCTC CGGTGATTAC CCGATTGTTG TCGGGCGCAT TGCGCCCACT
TCTGATACAT CACTGGCGCG CTCGCTGATC CAGGCGCACG AGTACTGGCG CCTGAAGGGG
GTGCTCATCG ATCTGGTGCT GCTGGTCGAG GATGGCGCCG ATTATCGCCA GGAGCGGTAT
GAGCAGATCA TGGCGCTGGT GCGCAGCAGC CGGTCGAGTC GCTGGCTCAA CCAGCGTGGC
GGGGTCTTCG TGCTGCGCAC CGGGATCATG CCTGAAGCCG ATCAGATTCT CTTCGAGGCG
GTGAGCCGCA TCACACTCCA CAGTCGGCGT GGCGACCTGT CCTACCATCT GCGCCGTCGC
CTGCCCGATA AAGCGCCACC TCCGCCGCCG CTCACGATGC CGCCGTTCGA TGACGCTCCG
CTGCCTGCAG AGAATCTGAT GCTGACCACG GAGTATGGCG GGTTTACCGT TGATGGACGT
GAGTTCGTCA TCGAGGTTGC ACCCGGCAAC TCCACGCCAT TGCCGTGGGT CAATGTCGTT
GCCAACCCAC GCGCCGGTTT CATCGTCTCA GAGAGTGGCT GCGGCTACAC CTGGGCGGAA
AACAGTCGCG AGAACCGTCT GACCCCCTGG TCGAACGATC CGGTCAGCGA TCCGCCCGGC
GAGGCGATCT ACCTGCGCGA CGAAGCCAGC GGCGTGATCT GGTCGCCGCT GCCGCGTCCG
TGCGCAAGCG GGCGCGTTCG CGTTTACCAC GGGATGGGAT ACTCCCGCTT CCTGCAACAG
TCCAATGGCA TCGAGAGCGA AACCACCCTC AGCATTGCGC CTGACGATCC GGTGAAAATC
ATTCGCCTGC GCCTGCGCAA CCGCTCCGCT CACGAGCGTC GCCTGAGCGC GACCATGTAT
GTTGAGTGGG TGCTTGGCGT GCTGCGCGAA CAAACGGCGC TCTTCATCGT CACATCGACG
GCGCCGGAGC GGAGCGCATT GCTGGCGCGC AACGCCTACA GCCACGATTT CGTCGGTCGG
GTTGCATTCC TGGCGTGCAG CGAACCAGAG GTCGCATTCT GTGGCGATCG CGCAGCATTC
ATCGGGCGCA ACGGCGATCT GGCGCGCCCG ATTGCGCTGG CGGGCGCACA CGATGGCGCC
TTCGACAATC GGATCGGCGC AGGTCTTGAC CCATGCGGCG TTGTTACAAC GACATTCACC
CTCAACCCTG GTGAAACACG CGACCTGTTC TTCCTGCTGG GTCAGGGAAC AGATGAGGCT
GAGGCGCTTA CCCTGATTGA CCGCTACCGT GATCCTGCCG CTGCGACCGG CGCCATCGAA
GAGACCATCG CACGCTGGCG CACCCTGGTC AGCACGCTGC GGGTGCGCAC CCCCGACCCG
GCGCTCGATG TGTTGCTCAA CGGATGGCTG ATCTACCAGA CCCTCGTCTG CCGTATCTGG
GGGCGCTCGG CATTCTACCA GTCGGGCGGC GCCTACGGCT TCCGCGATCA GTTGCAGGAT
GTGATGGCAC TGACCATGAT CGAGCCATCG ATTGCACGTG ACCATATCCT GCGCGCCGCA
GCGCGCCAGT TCGTCGAAGG CGATGTGCAA CATTGGTGGC ACCCGCCGCT CGGTCGCGGC
ATCCGTACCG CATTCTCCGA TGATTACCTG TGGCTGCCCT TCGTTGTGTG TCACTATGTC
GAAACGACCG GCGACCGGGC ATTGCTCGAT GCAGTTGCGC CGTACATCAA AGGGCGACCG
CTCGCGGAGG ACGAAGCCGA ATACTATGAT CTCCCCGAGC AGGCGAACGA GGCGGGCAGT
ATCTACGACC ACTGCATTCG CGCGATTGAT CGTGCGTTGA GGCGCACAGG CGCGCATGGG
CTGCCACTGA TGGGATCGGG GGATTGGAAT GATGGTATGA ACCTGGTAGG ACACGGCGGA
CGCGGCGAGA GCGTGTGGGT CGCCTGGTTT TTGATTGTTA TCCTGAACCG CTTCGCTCCG
ATTGCCGAAC AGCGGCGCGA CATCGAGCGC GCAGCGCGCT ATCGCGCCGA AGCGCGCCGG
TTGAGCGAGG CGATTGATCG CCACGCCTGG GATGGCGACT GGTATCTGCG CGCCTTCTAC
GACGACGGCA CGCCGCTCGG CTCGGCGCGT GACGATGAGT GCCGCATCGA TTCCCTGAGT
CAGTCGTGGG CGGTCATTGC CGGCGCCGCC GATCCGACGC GGGCGCGGCA GGCGATGGAG
GCAGTCGACC GCCACCTGGT TGACCGTGAC AATGGCATTA TCAAACTGTT CACGCCGCCG
TTCGACCAGA CGCCGCGCAA TCCCGGCTAT ATCAAAGGGT ATGTGCCCGG TGTCCGTGAA
AACGGCGGGC AGTACACCCA CGCTGCAATC TGGGTCGCCT GGGCATGGAC AATGCTGGGT
GAATACGCGC GCGCGGGTGA ACTGCTGCGG ATGCTCAATC CAGTTCACCA CGCACAATCC
CGTGGTCGCG TCTACGCCGT CGAGCCGTAT GTCATCGCGG CAGACATCTA CAGCGCGCCG
CAGCACCTGG GGCGCGGCGG ATGGACGTGG TACACCGGCT CGGCGGCATG GTTCTACCGG
CTTGGCATCG AGCGTATCCT GGGTATCCAG CGCCACGGCG ATCACCTGAC TCTGACGCCC
TGCCTGCCGC CCGACTGGCC AGGTTATGAG GCGTGGTACC GCTACGGTTC GAGTGAGTGC
CATATCGTCG TCGAGCGCGG CAGCGATGGG TATGCGCTGA CAATCGACGG CGTTCCTGCG
CGCGAACTGA CCATCCCGCT GCACGACGAC GGGCTGCGGC ACGAGGTGCG CCTGTTCCTG
CCAGCAACGG AAGTGAGCGC GCCTGGCAGC GATGGCGCAG GCACAATCCA GGAGCGCAGC
GCTGCACCGC AATGA
 
Protein sequence
MNLATLTSDL SHRFDTLTRT VQQTFSFIRR RQPLSDRPIR GVILTTGQLE ERARALAAEH 
TVVLQPGKAR ALLASVDRNH RLLRLAYQTL AADAAQRQPL TPAAAWLVDN YHVIVGQIRE
IRQDLPGGYY HELPKLKGGP HDGKPRVYAL ALELIEHTDG RIDLEQLTRF VLAYEAGAPL
SIGEIWAIPI MLRVGLIQNL GRLARLMIDE RRLRLEGAAW AERILAQKHV GSFEGNTAFR
QLARMHPQLP LPLAVELIQR LRNQEGEFDI AQLMVWLEQQ PSIPYNTAEE IILAEQRRQS
ANQVSVANTI TSMRTMDAVD WPDWFERVSL VEQVLRQDPA GAYGQSTFAT RDRYRHELER
LSRRSGLSED AIARRLIHIA ARAREEGRPL RETHIGYYLV DEGRFAFEAA LGCRLTPGEV
VRRAVLRHPE VVYLGAITAG TVAITAAGIR FARPTARSEP SAVPPILTAA LSLTAVIPAF
ALAKELVDRA VTRLIPPRTL PRLDFRHGIP RELRTIVVVP TLLLTPDSIR TQIESLEVLA
LANQDPHLHF ALLTDFADAP QQHMPEDERL LALAVSGIER LNERYSSDRF FLLHRRRVWN
ERQGCWMGWE RKRGKLEEFN RLLAGAHDTT YETFIGDLSI LPQIRYVITL DADTQLPRDT
ARALIGTLAH PLNQAVIDPQ TQRVVQGYGI LQPRVGIDLP SALRSRFARI SSGNVGVDPY
TTAVSDVYMD LFGEGIFAGK GIYDPVAMRT ALHDRFPENT LLSHDLIEGC YARAALLSDV
ELLDSYPTTY AAYSARQHRW VRGDWQIAGW LLPRVPRASG SSAPNILPLI SRFKIADNLR
RSLTPPATLA LLVAGWFVLP GRPAVWTLLA LGHYLAPLTF AVISLRLMPA DWRYLRTHLI
TTIGSLRWTA LQLLLNVAML PDQAWLNVDA IVRTLWRMGV THRRMLEWET AAQAQRRLTD
SFDYLVRRMA PSAVVALALV VMRIDRLRAS WLPALPVVAS WVTAPFFARW LDQEYIPRLV
APLSVDDRRM LRRVARATWA YFDRFVVPEQ NFLAPDNFQE TPRPVVAERT SPTNIGLQLL
ADLAAVDFGY LGARSLVERV ERVFDTIQRM ERFRGHLYNW YDTRTLHPLP PLYVSTVDSG
NLAGHLLTLR HGLRALIERP AYGPWIIEGL RDILEIIQER LPADARGRTS LVALLQALDA
TPETLEGYRA RLLQAADLAI ILARESRVAE WAEALARQVY SLLDDMPHDD VPSAAEDPQF
RAAVERLTDM CIVLITEMDF RFLYDERRRL FAIGYNVSEG RRDNSYYDLL ASEARLASFI
AIALGEVPQE HWFYIGRKIS PAVATPTLLA WSGTMFEYLM PLLVMRNFPE TLLDSTYRGA
VERQIAYAAG RGIPWGISES AFNLRDSQMN YQYRAFGVPG LGLQSGLAND LVVAPYATLL
ALQVAPNEAI ANLRTLMEYG AFGAYGFYEA IDFTPERLPP GVRHAIVQTY MVHHQGMGLL
ALDNLLHDNI MQRRFHAEPI VQATELLLQE KIPADRPLPL PQESAVSEIT TTPDVALERH
FTTPHTAAPY AYILSNGVLT TIVTNAGGGG SRFTLPERTT TVALTRWRPD PTRDASGSFV
YVRDVRSGVT WSPTYQPLRT LGDDYRVTCG AGRVEFRQHY AGVDTRLEMT VSPEDHVEVR
VLTLVNTTAQ PRELEITTYA EIVLAPDAAD AAHPVFSNLF VETSFDPTSE ALLATRRPRA
PNDERLWVAQ TIGVRGRALG EPEYETDRMT FIGRGRDPSR PQALDRPLNG RIGAVLDPIF
SQRRRVRIVP GGQAQVIMTM AVAATREDAI RLADHYRDAV IAMRAFDMAR IQAQVELMHL
GINADQAHQF QRLASLTLLP DPVRRAASEA LLRNTKGQPG LWAYGVSGDY PIVVGRIAPT
SDTSLARSLI QAHEYWRLKG VLIDLVLLVE DGADYRQERY EQIMALVRSS RSSRWLNQRG
GVFVLRTGIM PEADQILFEA VSRITLHSRR GDLSYHLRRR LPDKAPPPPP LTMPPFDDAP
LPAENLMLTT EYGGFTVDGR EFVIEVAPGN STPLPWVNVV ANPRAGFIVS ESGCGYTWAE
NSRENRLTPW SNDPVSDPPG EAIYLRDEAS GVIWSPLPRP CASGRVRVYH GMGYSRFLQQ
SNGIESETTL SIAPDDPVKI IRLRLRNRSA HERRLSATMY VEWVLGVLRE QTALFIVTST
APERSALLAR NAYSHDFVGR VAFLACSEPE VAFCGDRAAF IGRNGDLARP IALAGAHDGA
FDNRIGAGLD PCGVVTTTFT LNPGETRDLF FLLGQGTDEA EALTLIDRYR DPAAATGAIE
ETIARWRTLV STLRVRTPDP ALDVLLNGWL IYQTLVCRIW GRSAFYQSGG AYGFRDQLQD
VMALTMIEPS IARDHILRAA ARQFVEGDVQ HWWHPPLGRG IRTAFSDDYL WLPFVVCHYV
ETTGDRALLD AVAPYIKGRP LAEDEAEYYD LPEQANEAGS IYDHCIRAID RALRRTGAHG
LPLMGSGDWN DGMNLVGHGG RGESVWVAWF LIVILNRFAP IAEQRRDIER AARYRAEARR
LSEAIDRHAW DGDWYLRAFY DDGTPLGSAR DDECRIDSLS QSWAVIAGAA DPTRARQAME
AVDRHLVDRD NGIIKLFTPP FDQTPRNPGY IKGYVPGVRE NGGQYTHAAI WVAWAWTMLG
EYARAGELLR MLNPVHHAQS RGRVYAVEPY VIAADIYSAP QHLGRGGWTW YTGSAAWFYR
LGIERILGIQ RHGDHLTLTP CLPPDWPGYE AWYRYGSSEC HIVVERGSDG YALTIDGVPA
RELTIPLHDD GLRHEVRLFL PATEVSAPGS DGAGTIQERS AAPQ