Gene Rcas_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1870 
Symbol 
ID5539348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2391084 
End bp2399678 
Gene Length8595 bp 
Protein Length2864 aa 
Translation table11 
GC content64% 
IMG OID640894008 
Productglycosyltransferase 36 
Protein accessionYP_001431979 
Protein GI156741850 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTG TTCCGTTTGC CAGCGATCGC TCACATCAAC TTGCAGCACT TGCCGCCGCT 
GTGCAACGCT CTTTTCCATG GGTGCGCCGT CGTCCGCCGC TGTCGGATCG CCCGATTCGC
GGGGTGATCC TGACGACGGA GCAACTCGAA GCGCGCGCGC GCACACTTGC TGCCAGCCAT
ACGATTGTCA TGCAGCCGGG GAGAGCAGGT GCGCTGCTCG CATCGGTGGA CCGCAATCAT
CGCCTGCTAC AACTGGCATA CCAGACACTC GCCGCAGATG CTGCGCAACA CCAACCGCTC
ACTCCGGCGG CTGCCTGGCT GGTCGATAAC TATCACGTGA TTGTCGGGCA GATCCGTGAG
ATCCGACAGG ACCTTCCAGG TGGGTATTAC CACGAACTCC CGAAATTGAA GGGGGGTCCT
CACAACGGTA AGCCGCGCGT GTATGCCATG GCGCTCGAAC TCGTCGAGCA TACGGACGGA
CGCATCGACC TCGAACAACT GACGCGCTTT GTGCTGGGAT ACGAAGCCGT TGCGCCGCTG
TCGATCGGCG AAATCTGGGC GATTCCGATC ATGCTGCGCG TCAGCCTGAT CCAGAACTTG
GGGCGCCTGG CGCGCCTTAT GCTCGATGAG CGCCATCTTC GACTCGAGGG CGCGGCATGG
GCGGAACGGA TTCTGGCGCA GAAAGATGTG GGGTCGTTTG AGGGGAATAC GGCCTTCCGT
CAACTCGCGC GCACCCATCC GCAACTCCCG TTGCCGCTTG CGGTCGAGTT GATCCAGCGC
CTGCGCAACC AGGAAGGAGA GTTCGACATT GCCCGGTTGA TGGTCTGGAT CGAGCAGCAA
CCTTCCATTC CCTACAATAC CGCCGAAGAA ATCATTCTCG CCGAGCAGCG CCGCCAGTCC
GCCAATCAGG TGTCGGTCGC CAATACGATC ACAAGCATGC GGACGATGGA TGCGGTGGAC
TGGCCCGACT GGTTCGAGCG CGTCAGCATG GTCGAGCAGA TTCTGCGCCA GGACCCGGCA
GGCGCGTATG GACGGAGCAC CTTCGCAACC CGTGACCGCT ATCGGCACGA ACTCGAGCGT
CTGTCGCGGC GGAGTGGTCT ACGCGAGGAC GCAATTGCGC GACGTTTAAT CCGGGTTGCA
GCGCAGGCGC GGGAAGCGGG TCGTCCGCTG CGCGAAACAC ATATCGGCTA CTACCTGGTG
GACGAAGGGC GCACCGCATT CGAGATGGCG CTTGGGTGTC GTCTGACACC TGGCGAGGTG
GCGCATCGCG CCGTGCTGCG GCATCCCGAA GCGGTCTATT TTGGAGCGAT CGCCGCAGGA
ACTGTGGCTC TTACTGTGGT AAGCCGGCGT CTTGCGCAAT CCGATAACGG ACGCCGCGCC
TCCGGCGCTC CGCCGGTCCT GACTGCTGCG CTGTCACTCA CGGCGACTTT GCCCGCATTC
GCACTGGCAA AAGAACTGGT TGATCGCGCG GTGACGCGCC TGACCCCGCC GCGTGTTCTG
CCGCGCCTTG ATTTCCGCGA CGGTATCCCG CGCGAACTGC GCACGATCGT GGTCGTGCCG
ACGCTACTGC TCACGCCGGA TAGCATCCGC ACGCAGATCG AATCGCTGGA GGTGCTGGCG
CTCGCCAATC AGGACCCGCA CCTGCACTTT GCGCTGCTCA CCGATTTCGC CGACGCGCCG
CAGCCGCACA TGCCCGAAGA TGAGGCGTTG CTTGCGCTGG CGGTCGAGCG TATCCAGGCG
CTCAACGAAC GCTATGGCAG TGATCGCTTC TTCCTGCTCC ACCGCCGCCG CGTCTGGAAC
GAGCGCCAGG GATGCTGGAT GGGTTGGGAG CGCAAACGCG GCAAGCTCGA AGAGTTCAAT
CGTTTACTGA TTGGCGCCAC GGACACAACC TACGAAACAT TCATCGGCGA CCTCAGCATC
CTGCCGCAGA TCCGCTATGT CATCACGCTC GATGCCGACA CCCAACTGCC GCGTGATGCA
GCGCACCGGC TGGTGGGCAC GCTGGCGCAT CCGCTCAATC AGGCAGTGAT CGACCCGCAG
ACACGGCGCG TGGTGAAGGG GTATGGCATT CTCCAACCGC GCGTCGGCAT CGATCTGCCG
AGCGCCCTGC GCAGTCGCTT TGCGCGCATT TCGTCGGGCA ATGTTGGCGT TGATCCATAT
ACCACCGCCG TTTCTGATGT CTACATGGAC CTGTTTGGCG AAGGGATCTT TGCCGGCAAG
GGAATCTACG ACCCGGTCGC CATGCGCGAG ACGTTGCACG ACCGCTTCCC CGACAATACG
CTCCTCAGCC ACGACTTGAT CGAAGGGTGC TATGCGCGCG CGGCGCTGCT CTCCGATATC
GAATTGCTCG ACAGTTACCC GACGACGTAT GCCGCTTATT CAGCGCGCCA GCATCGCTGG
GTGCGCGGCG ACTGGCAGAT CGCCGGATGG TTGCTGCCGC GTGTGCCGCG CGCGTCGGGA
GGGTATGCGC CGAATGTTCT GCCGCCGATC AGCCGGTTCA AGATCGCCGA TAACCTGCGC
CGCAGCCTGA CCCCACCGGC GACTCTGGCA GTGCTCATCG CCGGTTGGTT TGCGCTCCCC
GGTCGTCCGG CAGTCTGGAC GTTGCTCGCG CTTGGGCATT ATCTTGCGCC GTTGACGTTT
GCGATTCTGA ATACGCGCCT CGCGCCGACA GACTGGCGCT ATCTCCACGT CAGCCTGATC
GCAGCTGCGG AAAACCTGCG CTGGCCCGTG CTGCAATTGC TGTTGAATGT TGCCATGCTC
CCCGATCAGG CATGGCTGAA CCTGGACGCT GTCGCGCGCA CGCTCTGGCG TATGGGTGTG
ACCCATCGGC GGCTGCTCGA ATGGGAAACC GCGGCGCAGG CGCAGCGCCG CCTCACCGAT
TCGTTCGATT ACCTGATCAA ACGCATGGGT CCTTCAGCAG TAGTGTTCGT GGCGCTGGCT
GCTGTGCGTT TTGAGCGCCT GCGCGACGCT TGGGTCCCTG CCCTGCCGGT GACGCTCGCA
TGGGGCGCAG CGCCCTTCTT TGCCCGCTGG CTCGATCAAG CATACGTTCC CCGCCGTATC
GAACCGCTCT CCGCCAATGA TCGCCGGATG CTGCGGCGCG TGGCGCGCGC AACGTGGGCA
TACTTCGAGC GCTTCGTCGT TCCAGAACAG CACTTCCTGG CGCCGGACAA CTTCCAGGAG
ACGCCGCGAC CGGTTGTGGC AGAGCGCACC TCGCCGACCA ACATTGGCTT GCAGTTGCTC
TCCGACCTGG CGGCGGTCGA TTTCGGATAC CTGGGTGTGC GCAGTCTGGC CGAGCGCGTC
GGACGTGTGT TCGAGACGAT CCAACAGATG GAACGGTTCC GCGGGCATCT GTACAACTGG
TACGACACCC GCACGCTTCA CCCCCTGTCG CCACTCTATG TGTCCACGGT CGATAGCGGC
AACTTCGCCG GACACCTGAT CACGCTGCGG CAGGGATTGC TTGCGTTGGC AGAACGTCCG
CCATACGGTC CCTGGATCAT CGAAGGGTTG CGCGACACTC TTGAGATCAT TCAGGAGCGC
CTGCCCGCCG ACGCCAGAGG ACGCACGTCG CTTGTGGCGC TGCTCCAGGC GCTCGATGCC
ACGCCTGACA CCCTGGAGGG GCATCGAGCG CGCCTGCTGC AAGCGGCCGA TCTGGCGATC
ATCCTGGCGC GTGAGTCGCG GGTAGCGGAG TGGGCGGAGG CGCTTGCCCG ACAGGCATAT
AGCCTGCTCG ACGATATGCC GCACGACGGG AAACCCGCTG CGCCGGAGGA CCCGCAGTTC
CGCACCGTCG TCGAGCGGTT GTCGGGTATG TGCATTGATC TGATCGCCGA GATGGATTTC
CGCTTCCTCT ATGACGAACG CCGCCGTTTG TTCGCCATTG GGTACAACGT GAGCGAAGGG
CGACGCGACA ATTCCTACTA CGATCTGCTC GCATCCGAAG CGCGCCTGGC GAGTTTCCTC
GCAATCGCGC TTGGAGAAGT TCCACAGGAG CACTGGTTCT ACATTGGGCG CAAGATTTCG
CCGGCAGTGG CGACGCCGAC ACTGCTCGCC TGGAGCGGTA CGATGTTCGA GTATCTGATG
CCGCTTCTGG TTATGCGCAA CTACCCCGAA ACGCTGCTCG ACGCCACGTA CCGCGGCGCA
GTTGAACGTC AGATTGCCTA CGCCGCCGAA CGCGGCATTC CATGGGGGAT TTCTGAGTCG
GCGTTCAACC TGCGCGACTC GCAGATGAAC TATCAGTATC GCGCATTCGG CGTACCGGGG
CTTGGGTTGC AGAGCGGGCT GGCGAACGAC CTGGTGGTTG CGCCATACGC CACGCTCCTG
GCGTTGCAGG TCGCGCCAAA CGAGGCAGTC GCCAATCTCC GCGCCCTGAT GGAGTATGGC
GCATTCGGGC ACTATGGCTT CTACGAAGCG ATCGACTTTA CGCCGGCCCG CCTGCCGCCC
GGCGTGAAGA GCGCCATTGT GCAGACGTAT ATGGTCCATC ATCAGGGCAT GAGTCTGCTG
GCGCTCGATA ATGTATTGAA CGACAACCTG ATGCAGCGTC GCTTCCACGC CGAGCCAATT
GTGCAGGCGA CCGAATTGCT CTTGCAGGAG AAGATTCCTG CCGACCGACC GCTGCCGCTG
CCGCAGGAGA GCGCCGTCAG TGAGATCGCA GTCACGCCGG ATGTTGTGCT GGAACGCCAC
TTCACGACGC CGCATACTGC GGTTCCATAC GCTTACATCC TCTCGAACGG CGTTCTGACT
TCGATAGTGA CGAATGCTGG CAGTGGAGGC TGCCGCTATA CCCTGCCCGA ACGCACGACG
TCGGTCGCCA TCACGCGCTG GCGTCCCGAC CCGACGCGCG ATGCGTCCGG CAGTTTTGTG
TATGTGCGCG ATGTGCGTGG CGGCGCAACA TGGTCGCCGA CGTACCAACC GCTGCGCGTC
CTCGGCGATG AGTATCGGGT CACCTACGGG GCAGGGCGGG TCGAGTTCCG CCAGCAGTAT
GCAGGAATCG ATACACGCCT GGAGATTGCG GTGTCCCCCG AAGACAATGT CGAGGTGCGC
ATGTTGACAC TGGTCAATCT GACGGCGCAG CCGCGTGAAC TGGAGATCAC CGCCTATACC
GAGATTGTGC TGGCGCCTGA CGTTGCCGAC GCTGCACATC CGGCCTTCTC GAATCTGTTC
GTCGAGACAG CGTTCGACCC GCAGAGCGAG GCGTTGCTGG CAACACGCCG TCCGCGCTCA
CCGAATGATG AGCGTCTGTG GGCAGCACAA GCGATTGGTG TGCGTGGCCG CGCCCTGGGA
GAAGCAGAAT ACGAAACGAA CCGCCTGACG TTCATTGGAC GTGGGCGCGA CCCGTCACGT
CCGCAGGCGC TCGACCGTCC GCTGAACGGC AGCGTCGGCG CCGTGCTCGA CCCGATCTTC
AGCCAGCGCC GCCGGGTACG GATCGTGCCG GGCGGACAGG CGCAGGTGAT CGTGACCATG
GCGGTCGCGG CGACACGCGA TGATGCGCTG CGCCTGGCGG CGCACTATCG CGATGCCGCC
ATTGCTATGC GCGCCTTCGA TATGGCGCGC ACCCAGGCGC AGGTCGAGCT GATGCACCTC
GGCATCAACG CCGATCAGGC GCACCAGTTC CAGCGCCTGG CATCGCTGGC GCTCCTGCCC
GACCCGGCGC GCCGTGCGTC GGCCGAAACG CTGGCGCGCA ATAGCAAAGG GCAGCCGGGA
CTGTGGGCAT ACGGCGTCTC CGGCGACTAT CCGATCGTGG TTGGGCGTAT CGCGCCAACG
TCCGATACCT CGCTGGCGCG CACTCTGATC CAGGCGCACG AGTATTGGCG GCTGAAAGGA
GTACTGATCG AACTTGTGCT CCTGGTCGAA GATAGCGCCG ATTATCGGCA GGAACGACAC
GAGCAGGTCA TGAGTCTGGT GCGCAGCAGT CGGTCGAGTC GCTGGCTGAA CCAGCGCGGC
GGGGTGTTTG TGTTGCGCAC CGGAGTGATG CCCGATGCCG ATCAGGTGCT GTTCGAGACG
GTGAGCCGGA TGACGCTCTA CAGTCGGCGT GGTGATCTTG CCTACCATCT GCGTCGCCGC
CTGCCCGACA AGGCGCCGCC GCCGCCAACG GTTGCGCTGC CGGTCGGCGG CGAGCCGCTG
CCGGCAGAAG ATCTGACGCT GACGACAGAG TATGGCGGGT TCACACCCGA TGGGCGCGAG
TATGTGATTG AGGTGACGCC CGACAAGCCA ACGCCGCTGC CATGGGTGAA TGTGGTTGCC
AACCCACGCG CCGGATTCAT TGTCTCGGAA AGCGGCGGCG GCTACACCTG GGCGGAGAAT
AGCCGCGAAA ATCGCCTGAC TCCCTGGTCG AACGATCCGG TCAGTGATCC GCCTGGCGAG
GCGCTCTACC TGCGCGACGA ATCGAGTGGC GCGCTCTGGT CGCCGCTGCC GCGTCCCTGC
GGCGCAGGGC GCGTGCGCGT CTGCCACGGG ATGGGATACA CGCGCTTCCT GCAACGGTAC
GATGGCATCG AGAGCGAAAC GACCCTCAGC ATCGCTCCCA ACGATCCGGT GAAGATCGTC
CGGCTGCGTC TGCGCAACCG ATCCGATCGC CAGCGACGCC TGAGCGCAAC GATGTATGTC
GAGTGGGTGT TGGGCGTGCT GCGCGAACAG ACGGCGCTCT TCATCGTGAC CTCGACTGTG
CCGGAACTGA GCGCGCTGCT GGCGCGCAAC ACCTATAGTC ACGATTTCGC CGGGCGCGTC
GCTTTTCTGG CATGCAGCGA GCCAGAGGCG CTCTTCTGCG GCGACCGGGC AGCGTTCATC
GGGCGCAACG GCGACCTGGC GCGCCCGATT GCCCTGGCAG GCGAGCACGA TGGCGCCTTC
GACCATCGCA TCGGCGCCGG TTTCGACGCG TGCGGCGTCG TCTCGACGCG GCTGACCCTC
GAATCTGGCG AGGAACGCGA TATCTTCTTT ATTCTGGGTC AAGGCGCAAA TGAAGACGAT
GCAATCCGCC TGATAGCGCA CTACCGCGAT CCTGTGGTCG CTGCCGCCGC AATCGCGGAG
ACGATTGCCT GCTGGCGCGA TCTGGTGGGC AGACTTCAGG TGCGCACCCC CGACCCGGCG
CTCGACGTGC TGCTCAATGG CTGGCTGATC TACCAGACCC TCGGCTGCCG GGTGTGGGGA
CGCTCGGCGT TCTACCAGTC GGGTGGCGCG TATGGCTTCC GCGATCAGTT GCAGGATGTG
ATGGCGCTGA CGATGATCGA GCCATCGATT GCGCGCGAAC ATATCCTGCG CGCTGCGGCG
CGGCAGTTCG TCGAGGGCGA TGTGCAACAC TGGTGGCACC CGCCGCTGGG GCGCGGCATT
CGCACCGCGT TCTCCGACGA CTATCTGTGG CTGCCGTTCG TCGTGTGTCA TTATGTCGAA
ACAACCAGCG ACCACGCACT GCTCGATGCG GTGGCGCCCT ACCTCAAGGG GCGACCACTG
GCGGAAAACG AAGCCGAATA CTACGATCTT CCCGAACCGG CGAACGAGCA CGGCAGCCTG
TACGAGCACT GCATCCGCGC AATTGATCGC GCACTGACGC GCATGGGGGC GCACGGGCTG
CCGCTCATGG GTGCGGGCGA CTGGAACGAC GGGATGAACC TGGTCGGGCA CGAAGGGCAC
GGCGAAAGCG TCTGGGTCGC CTGGTTCCTG ATCGTCATTC TGAACCGCTT CGCACCAATT
GCCGAACAGC GGCGCGATGT CGAACGCGCA GCGCGCTACC GCGCTGAGGC GCGCCGGCTG
AGCGAAGCGT TGGACCGGCA CGCCTGGGAC GGCGACTGGT ACCTGCGCGC GTTCTACGAC
GACGGCGCGC CGCTTGGTTC AGCGCAGAGC GATGAGTGCC GCATCGACTC GCTGAGTCAG
TCGTGGGCGG TCGTTGCCGG AACCGCCGAT CCAACCCGCG CGCGGCGAGC GATGGAGGCA
GTCGATATTC ATCTGGTTGA TCGCAAGACC GGGATCATCA AACTGTTTAC TCCTCCCTTC
GACCAGACGC CGCGCAACCC CGGCTACATC AAGGGGTATA TTCCCGGCGT GCGCGAAAAC
GGCGGGCAGT ACACCCACGC CGCCATCTGG GTGGTTTGGG CATGGACGCT GCTCGGCGAC
AACACGCGCG CCGGCGAGTT GCTGCGCATG CTCAACCCGG TGCGACACGC GCAGCAGAAC
GGGCGCGTGT ATGCGGTCGA ACCTTACGTC ATCGCAGCCG ACATCTACGC TGCACCGCAA
CACCTGGGGC GCGGTGGCTG GACGTGGTAC ACCGGGTCGG CAGCGTGGTT CTATCGTCTC
GGTATCGAGC GCATCCTGGG CATTCAGCGC TACGGCGACT ATCTCACCCT GACGCCCTGC
ATGCCTCCCG ACTGGCCCGG CTACGAAGCA TGGTATCACT GCGGTTCGAG CGACTACCAC
ATTATCGTTG AGCGGAGCAG CGGCGATGGC TATGCGCTGA CGATTGACGG CGCGCCGGCG
AGCGATGGGC GCATCCCGCT ATACGACGAC GGACGCGAAC ACATCGTCCG GCTGGCATTG
CCCGGGTCGG GAGAGGTCAC ACCCTCACGC GACGGCAGAC AGGCGACAGC GAAACCGGTG
GAAAAAGGGG AATAG
 
Protein sequence
MNPVPFASDR SHQLAALAAA VQRSFPWVRR RPPLSDRPIR GVILTTEQLE ARARTLAASH 
TIVMQPGRAG ALLASVDRNH RLLQLAYQTL AADAAQHQPL TPAAAWLVDN YHVIVGQIRE
IRQDLPGGYY HELPKLKGGP HNGKPRVYAM ALELVEHTDG RIDLEQLTRF VLGYEAVAPL
SIGEIWAIPI MLRVSLIQNL GRLARLMLDE RHLRLEGAAW AERILAQKDV GSFEGNTAFR
QLARTHPQLP LPLAVELIQR LRNQEGEFDI ARLMVWIEQQ PSIPYNTAEE IILAEQRRQS
ANQVSVANTI TSMRTMDAVD WPDWFERVSM VEQILRQDPA GAYGRSTFAT RDRYRHELER
LSRRSGLRED AIARRLIRVA AQAREAGRPL RETHIGYYLV DEGRTAFEMA LGCRLTPGEV
AHRAVLRHPE AVYFGAIAAG TVALTVVSRR LAQSDNGRRA SGAPPVLTAA LSLTATLPAF
ALAKELVDRA VTRLTPPRVL PRLDFRDGIP RELRTIVVVP TLLLTPDSIR TQIESLEVLA
LANQDPHLHF ALLTDFADAP QPHMPEDEAL LALAVERIQA LNERYGSDRF FLLHRRRVWN
ERQGCWMGWE RKRGKLEEFN RLLIGATDTT YETFIGDLSI LPQIRYVITL DADTQLPRDA
AHRLVGTLAH PLNQAVIDPQ TRRVVKGYGI LQPRVGIDLP SALRSRFARI SSGNVGVDPY
TTAVSDVYMD LFGEGIFAGK GIYDPVAMRE TLHDRFPDNT LLSHDLIEGC YARAALLSDI
ELLDSYPTTY AAYSARQHRW VRGDWQIAGW LLPRVPRASG GYAPNVLPPI SRFKIADNLR
RSLTPPATLA VLIAGWFALP GRPAVWTLLA LGHYLAPLTF AILNTRLAPT DWRYLHVSLI
AAAENLRWPV LQLLLNVAML PDQAWLNLDA VARTLWRMGV THRRLLEWET AAQAQRRLTD
SFDYLIKRMG PSAVVFVALA AVRFERLRDA WVPALPVTLA WGAAPFFARW LDQAYVPRRI
EPLSANDRRM LRRVARATWA YFERFVVPEQ HFLAPDNFQE TPRPVVAERT SPTNIGLQLL
SDLAAVDFGY LGVRSLAERV GRVFETIQQM ERFRGHLYNW YDTRTLHPLS PLYVSTVDSG
NFAGHLITLR QGLLALAERP PYGPWIIEGL RDTLEIIQER LPADARGRTS LVALLQALDA
TPDTLEGHRA RLLQAADLAI ILARESRVAE WAEALARQAY SLLDDMPHDG KPAAPEDPQF
RTVVERLSGM CIDLIAEMDF RFLYDERRRL FAIGYNVSEG RRDNSYYDLL ASEARLASFL
AIALGEVPQE HWFYIGRKIS PAVATPTLLA WSGTMFEYLM PLLVMRNYPE TLLDATYRGA
VERQIAYAAE RGIPWGISES AFNLRDSQMN YQYRAFGVPG LGLQSGLAND LVVAPYATLL
ALQVAPNEAV ANLRALMEYG AFGHYGFYEA IDFTPARLPP GVKSAIVQTY MVHHQGMSLL
ALDNVLNDNL MQRRFHAEPI VQATELLLQE KIPADRPLPL PQESAVSEIA VTPDVVLERH
FTTPHTAVPY AYILSNGVLT SIVTNAGSGG CRYTLPERTT SVAITRWRPD PTRDASGSFV
YVRDVRGGAT WSPTYQPLRV LGDEYRVTYG AGRVEFRQQY AGIDTRLEIA VSPEDNVEVR
MLTLVNLTAQ PRELEITAYT EIVLAPDVAD AAHPAFSNLF VETAFDPQSE ALLATRRPRS
PNDERLWAAQ AIGVRGRALG EAEYETNRLT FIGRGRDPSR PQALDRPLNG SVGAVLDPIF
SQRRRVRIVP GGQAQVIVTM AVAATRDDAL RLAAHYRDAA IAMRAFDMAR TQAQVELMHL
GINADQAHQF QRLASLALLP DPARRASAET LARNSKGQPG LWAYGVSGDY PIVVGRIAPT
SDTSLARTLI QAHEYWRLKG VLIELVLLVE DSADYRQERH EQVMSLVRSS RSSRWLNQRG
GVFVLRTGVM PDADQVLFET VSRMTLYSRR GDLAYHLRRR LPDKAPPPPT VALPVGGEPL
PAEDLTLTTE YGGFTPDGRE YVIEVTPDKP TPLPWVNVVA NPRAGFIVSE SGGGYTWAEN
SRENRLTPWS NDPVSDPPGE ALYLRDESSG ALWSPLPRPC GAGRVRVCHG MGYTRFLQRY
DGIESETTLS IAPNDPVKIV RLRLRNRSDR QRRLSATMYV EWVLGVLREQ TALFIVTSTV
PELSALLARN TYSHDFAGRV AFLACSEPEA LFCGDRAAFI GRNGDLARPI ALAGEHDGAF
DHRIGAGFDA CGVVSTRLTL ESGEERDIFF ILGQGANEDD AIRLIAHYRD PVVAAAAIAE
TIACWRDLVG RLQVRTPDPA LDVLLNGWLI YQTLGCRVWG RSAFYQSGGA YGFRDQLQDV
MALTMIEPSI AREHILRAAA RQFVEGDVQH WWHPPLGRGI RTAFSDDYLW LPFVVCHYVE
TTSDHALLDA VAPYLKGRPL AENEAEYYDL PEPANEHGSL YEHCIRAIDR ALTRMGAHGL
PLMGAGDWND GMNLVGHEGH GESVWVAWFL IVILNRFAPI AEQRRDVERA ARYRAEARRL
SEALDRHAWD GDWYLRAFYD DGAPLGSAQS DECRIDSLSQ SWAVVAGTAD PTRARRAMEA
VDIHLVDRKT GIIKLFTPPF DQTPRNPGYI KGYIPGVREN GGQYTHAAIW VVWAWTLLGD
NTRAGELLRM LNPVRHAQQN GRVYAVEPYV IAADIYAAPQ HLGRGGWTWY TGSAAWFYRL
GIERILGIQR YGDYLTLTPC MPPDWPGYEA WYHCGSSDYH IIVERSSGDG YALTIDGAPA
SDGRIPLYDD GREHIVRLAL PGSGEVTPSR DGRQATAKPV EKGE