Gene EcSMS35_2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2659 
Symbol 
ID6144015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2719950 
End bp2728025 
Gene Length8076 bp 
Protein Length2691 aa 
Translation table11 
GC content56% 
IMG OID641617530 
ProductRatA-like protein 
Protein accessionYP_001744695 
Protein GI170680333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.262746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTAT GTCTGAAACC AGGAAAAATC ATTGTGCTTC TGGGCATGTT GGCAGCCTTT 
ATGCTGTCTG ACTTTGCACG GGCAGGAGTG GAATGGCAGA CATATCCTGG TTCTACAGGC
GAGTTTAACG GTACAGTTCC TATTGCCGAT AGCGCTAGCG TACCCGTTTA TCAGGGAAGC
GTTCAGCTTG ATCCTGCTGC ATCTCATGAT GTGGCGTTTT CTGCTAAACC GAATGAATTT
AGCGTTGACG ATGACGCTGC TAACCTGATA GTCGCTAATC CTCAGGACAG TGAAGGCGAC
CAGTTTTCCA CACCACCTGC GCTGCGCTGG GAAAACCAGA CGCCGCCGAC CGTCAGTCTG
GTGTGGGCGG ATGCCGCCAC GCCGGATACG CCGCTCAACC CGCAGCCCAT CGCCAACCGC
AGCTTTTGCG CGCAAGGTCT GGCCGGGCGT TCGCTGGTGG CCTGGCCGCA GATAGATACA
CAACAAACCA TTCCACTGCT TTACCTGCTG ACCTCGACCG GTTATCCCTA TGAGGGGACG
GTGGCGCTGG CTGAACAGAA AGTGACGCTG AACATCGCGC CCGCACAGGG CGATTTAATT
TCGGTGAGCG CCAGTGGTTA TAACGAAACG ATCGCGGCGG CAAAAACCAC CGTTGGTGGC
ACTATCACGC TGACCGTCAC CACCAAAGAT TGTCAGGGCA ACGTCGCGGG TAATATTCCG
TTTATTATCA AACGTAAAGA TGCCCAAAAC CGTCAGGGCG CGGTGAATAA TACCGCGCCG
GTGGTGCTGG GGAGTACCGA GTTAACGACG ACGGCAACCG AGTATCGTGG CACATCGGAT
GCCAACGGTA CGGCAACCAT CACGGTGACC CAACCTAACG GCCCCGGCGT CAAAACGCCG
CTGGTGGCGA GTATTTCCGG CATTGCGCAA ACCAGCGAAA CGGCGGTGAT TTTTACCGTA
CTTACCAGCC CGGACGTGGC GCAGGCCACG ATGTGGGGCC ATATGGCGGA AACGGTTGAG
GCACACGGCT ACACGTTCAG CAGGCCAAAA CTGGCGGCGG AAGTCAGCAA TGAAAATGCC
ACGGTCGTCG ATCACAACGA AACCTGGTCC ACTTTCACCT GGAGCGGGGC AGACAGCCAT
TGCACCGTGC TGCCGGGCAT GCGCCATTTC GGCGCACTGG CGACGGTGAT TCCCTCCACG
GTACAGACTG TCCTTGGCTG GCCGATGCAG GATGATTATT ACTGGACATC GCTGGCGGGC
ACTACCGGGC AGCACCATGC AGCGGATGTC TCTAACCGTG CTGAAGCGCA AAAGCCGGAT
AGCACCACGT TTCTTGTAAG CTGCGTGGAC AAACCCGCGC CTGACGTCGA ACCGAAAATC
GTCCTTACGC CAGAAAACTA TGATGATACG GCACAGGCGA TGAAAGCGAA GGTGGGTGAG
GATGCCACGA TGCGTCTGAC GATCACCGAC AATAAAAACA ACGATCAACC GCTGGCGTAT
TATTACTTCT CGCTGCATCT GGATGACGGC GTTAACCGTA AGAATCAAAC CGACTCCGCC
TGGGAAGCCC ATCCGGTGCA GATTACTGGC GGCAGTAATT TTCGCCAGGT CGACGCACAT
ACCTATGAAG GCATGACCGA CGCCAACGGC CAGGCTTCGC TCACCCTGAG CCAGCCCGGC
GGCGCGGGGG TGAAAACCCA TATCACGGCG AAAATGCGCA GCGATTTTAA CGCCACGGAT
GCCAAAGATG TGATTTTTAC CGTAATCACC AGCCCGGACA GCGACAAAGC GCGGATGTGG
GGCCATATGC GCGGCATTAT TGAGTCCGGC AGTCTGTATA AACGCCCGTT GCTGGCGGAT
GAAACCGAGC ATGAACTGGG AACAGTGCGG GAAAACAACG AAGACTGGGC GTTGTACGAT
CAAAACACCA GTATGCAGGC GGAATGCGGG GTGGGTCATA TACCGCGTCA GAGCTCACTG
GAAAGCCTGT TTTCAGCCCA TCCCGGTAAC GCGATTGGCA CTGAGTATGG CTGGCCTACC
GCACAACAGA GCTACCTAAG CGCGGTGGAG CAAGAAACGC ATTCATCGGT GAATCTGGGG
AACGGGAGCG TCGACAGCTA TTCCGGCTTT AAGCAGAATT ACCTCTCCTG TTCGGGTAAT
GAGATGGTAG CGAATGTTGA AGTCAGCACC GATCATGATG TGTCTGTTGG CACCCAAGCG
CAAGCCAAAG TGGGTGACAC CATCGTAATG ACCGTGCGTA CGATTAACGC GCTAAATAAT
TCCCCGGTAC CCTTTAGCGC ATTCACCATT ACCAAAGGTA TGGGATATAA CCGAGCGGGA
CAGGTGAGCG GGTTTGATGA CCCGAGTAGC GGCTCAATCA CGATGGATAA CTCGCAATAT
GGTACATCAC AGCCATCGAT GGTTTATGCG GGAACAACCG ATGTCCGGGG CGTGGCGACG
GTCGAGATTA AGCAACCACA GGGCGTCGGG CTAAAAACGG TGCTGAGCGT AACACCGGTA
AATTCCTACC TGCCGAATAC CGTTAACTAC AGCGTTATTT TTACTACCCC AACCAGCCCG
GACGTTTCCG GCGCGCAGAT GTGGGGCCAT ATGGATGAAA CGATTACCGT TGATTCATCA
ACCTTTACCC GGCCTAGGCT GGCGGCGGAA ATCGCCAGCC CGGACGGAAC CCTGACTGAA
AATAACGAAA TCTGGGCGCG CGTGAGCCAG GCCAATACGT CCAGTACCAG CAAAGGCGGC
TGCGGCACCA ATATGCTGCC GCGTCGCTCC CAGCTAAGCG CGTTGTATAG CGCCAACAGT
GGTAATGCAG TGCAAACGAC GCACGGCTGG CCAACGCAGC GTCAGCCCTA CTGGAGCAGC
TCTCCGGCGG ATGTGACGCC GCACTTTTTC ACGATAGCGC TGAACGATGG CGCACAAGCT
ATCGGCGGCG ATACGCCGGT TTACGTCAGT TGCCTGACGA CGGCGAACAA ACCTGCCAGC
AGCATTACGC TGGAGGTGGT TGATAAAGCG CAATGGAATG CCGGGAATAA CGCGGCAACG
CTGAAAAAAG GCGAAACGCT ACAGGTTAAG GTGACGGTAA AAGATGCACA GGGCAACGCG
TTAGCGGATA TGCCATTTAC CCTCAGTCGC GGTGATGGCT ATACCCGTAG CGGCGAAAAA
CATATTGCCG GTAGCGGCGA TGCGCTGGTG GCGCCGGTGG TGGTCAATGG CGGGCTGGCC
GATGAAACAA CGTTGAATGA CACCGCAACG GTGTATACCG CCATGACCGG CAGTGACGGC
AGCAAAATTC TTAATATTAC CCGTCCGGAT ACCCACGGCA CCAAAACGGC GCTGACGGCA
ACGCTTTACT CCGATGCGAC GAAAAAAGAC AGCCTGGACA CGATCTTTAC GGTGGTTACC
AGCCCGGATA GCAGCCAGGC GAAAATGTGG GGGCATATGC CGGAAACGGT GACGGCTGAG
GACGGTACGG TCTTTAAACG CCCGCGGTTA CTGAAGGAAC TGAGCAGTCA GACTGGGCGA
ACATCAACGC TTGAAGACAA CGAAAACTGG GCGTTGTTTA ACATTAATTA TGCGAGCTCT
TCCACGACGT ATAGTGGCTG TGGAACCAAC TATATTCCGA CTCAGGCCGG TCTGACGTCA
CTGTTTGCTA ATAATGCGGG CAACACCATG AAAACCGTGC AGGGCTGGCC GGTGGCAACG
CGCTATTTGT CCAACACGTC CGACAATGGC AGCATGGAGC AGCGTAACTA CAAGGCGGTC
GATCTCAGCA ATGGCACGAG TGCGGCAGTG TCATCAACGG CGCTGCAACT GCTGACCTGT
CAGACTACAC CGGTAACCAC GGTGAGCCAA ATCCTGCTGG AAGCGGCAGA TCCGGCGACG
CTGGATACCA CCTATAACGT GGTAAAAGCG AAAAAAGGCG AAGAATCCGT GGTGCGCGTC
ACCACCAAAG ATGCACAAGG CAACCTGGTG GGCAATACGG CATTTATCCT GACGCGTGCC
AACAGCGTGA GTCGGGCTAA TGCATCAGCG ACGATGAGCG TGGGCTCGTT GACGGTGACG
GATGCCTGGG GCAATACGCG AAATAATTTC CAGTCGACAA GTGAAACCAT TTACGGCGTG
ACCGGTGCCG ACGGGTCAAC GACATTGACC CTGAAACAGG ATAACAGTAC CGGGTTAAGA
ACCGATCTCA CGGCAAAACT GGATACGTCC AGCAGCGTCA AGGCAACGCT ACCGGTGGTC
TTTACCGTCG TGACCAGCCC GGATTCGCCA AAAGCGAATT TTTGGGGACA TATGGCCGAA
ACGGTGGCGG CCTCCGATGG TTCAGTCTAT AAACGCCCAC TGTTGCTGGC AGAGCTGGCC
AACACTGGCG GGCGACAAAT CAGTTCAGAG AACGGGGAAA GCTGGGTACG ATTCACCTGG
AATCAGTCGA CAGATCCTTC TGTAAGCGGC TGCGGCGTTG CGTATATGCC AACGTTGGCA
GGCCTGCAAG CGTTGTATGA CGCTAATAGT GGTAATGCAA TGAGTACGGT ACAGGGATGG
CCGGTAAAAG CGACCTATCT CACCAATACC CCAAGTGACA CGCAAACGGG GAGCCGTTAC
TACAATGTGG TGCAACTGGA TTCTGGCGCA GCCTCGCAAA TCACGACGAA CACGGGTGTA
CTGCAAACCT GCCGTACAAC GCCGTTGACG GCTGCCAGCC AGATTACGCT GGAGGCCGCG
GATCCTGCGC AATTTGTCAG CATTGACAGC ACGCTTAGCG CTGTCAAAGT GCAGAAAGGC
GATAGCGTGC CGATCCGCAT TTCGACTAAA GATGCGCAGG GCAATTTCGT CGGCAATACG
CCGTTTGCCC TGAAACACGC GAACTCGATC AATCGTCAAA ATGTGTCGTC ATCGCAGAAA
GTGTCGGTCA CGACGGAGGT GGGTACGACG GTGGATACCA GTGCAACAAC CACCCTGTAT
GGCGTAACCG GGCCAGACGG AACCACCACG ATGACGCTCC GACAGGATGC GTCCACCGGG
TTAAGAACCG ACTTTTATGC CCTGTTGAAC GATACCGGCG TGTCGTCGGA CACCTTGCCG
GTGATCTTTA CCGTCATCAC CAGCCCGGAT ACGCCGCTGG CGGCCAACTG GGGCCATATG
GCGGAAACCT TTACCAGTAG CGAAGGCGTC ACCTTCAAGC GGCCTTCCCT CAAAGCTGAA
CTCTCTGGCG GTACGGCCAT AACGGTCAAT AATGAAGTCT GGTCGCGCCT GACAGCGGCG
GAGAAAGTGG ATGCCAGCAA GGCCGGGTGT GATGAGACGT ATCAACCGCT GGTGAATGAC
ATGCAGGGGT TGTATGCGGA TTACCCTAAC GGCCAGTTAG GATCGGTGCT CGGCTTGCCG
ACGTCTGCCG GGTATTGGTG GGCCTACGAC ATGATGATGG TTTCTGGCAA CTGGACTAAT
CAGGCATTTT CACCTGTTAA TGGTCAGTTG ATGCAGGCAT CGTCCAGCTA TACGGCGATC
GTGATGTGTC TGGTAGAACC ACATACGGAA GCAGCCACAA TAGAGCTTAC TTCAACGGCG
CAGGACGCAG CAAAATCGGC CAGTAATGGT GGACGACCGA GCGCCGTAGC GAAGAAAGGC
GAGACGATTC CGTTGACGGT GACCGTGCGT GATAGCGCCG GGAACCCGAT GCCATATGCT
GAGTTTACCC TCACGCGTGA AACGACGCTG GATCGCTCGA AAGCAACCGT TAACACGAGT
GCGGACGACC TTACCGTAAC GGCATTGGTA CCCGCGAATA CAAATAGTGT GTTGGCAGCA
AGTGGAGCAA AACTGATCGG CACGACGGGC AGCGATGGCA AAGCGACCTT TGAGGTGAGT
CAGAATGCGA CGACCGGGCT GGCGACGCCG TTAACCGTGA CGCTGGCGCG TGATACGACG
AAAGCCGCCA CGCTGGATGT TATCTATACG GTGATCACCA GCCCGGATTC GCCGAGTGCG
AAATTCTGGG GGCATATGCC GGACACCTTT ACCAGCTCGA CGGGCGTCAC CTTCAAACGA
CCGCTGCTGA AAGCAGAAAC CAGCTCAGGC TCATCGATTA GCTCTAACGG CGAAGTTTGG
TCATATATGA GCAACGCGCA AAATTTAACA TCGACAGATT GCCCGCTCGA AAATCAACCG
CGCAGTAATG AGCTGTTGGA TTTGTATAGT GATCACCCGA ACGGCACGCT AATGACCGAC
CTGGGGTTAC CCGTATATGC AGGTAACTGG TGGGCCTATG ATATGGTCAT GCTGAGTGGA
ACGTCGTGGA CTTATCAACT CATCAGCCTG AAAGACGGAA CCATTACGCA GAAGGGGACA
ACCAGTGCGT TGATGCTTTG CCTGGCACAG CCGCATCCGG CCGGCGTCAG CGTAACATTG
ACGTCATCCG CACAGGATGA CGCCAGAACC ACCAGCAATG GCGGCCGTCC CAGCGCGTCA
GCGAAAAAAG GTAACTCAAT ACCGATTGTG GTAACGGTAA AAGATCGTGA TGGCAACCCG
TTGGCGGGCG AAGCGGTGAC GCTCAAACGT GATTCAGCGT TGAGCCGTTC AGGAACAATC
GTGGGGACGC CAGCGAATGA AATAAAGTTG ACGGAACTGA CGCCCGTGTC GGCGACATTC
CCGCTGGTTT CAAATGGTAC ACAATGGCTT GGTTTCACCG GTAGCGATGG TACGGCGACA
TTCAATGTTG AACAGCCTGA TACTGTCGGC CTGGCAATCC CCTTTAACGC CATCCTGGCC
CGTGACACGG CAAAGGTATC TACTCTGGAT CTGATTTTTA CCGTCCTCAC CAGCCCGGAT
GCCGCATTAG CAACTTACTG GGGCCATATG CCGGATACCG TCACGGCGGA AAACGGTGCT
GTCTTTGAGC GCCCTAAACT GTGGAAAGAG CTGCCGTCGA CATCCGGGGT AAGTCAGATA
ACGAATAGCA ATGAAGCCTG GCCAGTGTTT AACGCAACGC AGAGGGCTGA CAGCAATCTC
AGCCCTTGTG AAGCGGCGCG TCGACCATTG AGCGATGATT TAGAAGCGCT CTATACACGT
TATCCAGGAA ACACGATTAC CGCGCAAATT GGCTGGCCTA CGCAGTATGC ATGGTGGGCG
GTGGATAACT GGGAAGGGGG CAGTAATCCG CAGGCCGTCA GCCTGGTAAA TGGTTCGAGA
GGCCAATATA ATCCCGCTCA TCAGGCCTGC CTGGTTAACC CGCGTGCGAG CGTTTCCAGC
GTCACGCTGA CATCAACGGC GCTGGATGAC GCCACACAGG CGGCGACGGT GAAAAAAGGC
GACGCCGCGC CCATTGTCGT TACCGTAAAA GACAGCGCCG GGAAGCCGGT GCCGAATATC
GCTTTTATCC TAAAACGCGG TGAGGCCGTG CCGCGTAACA GCGGGGCGAC GCTGTATGGC
GATGTGGATG CGATGGACGA TTTGACTGTG CAACCTTCAT CCGGTGCGGC AGTCACGTTG
GCTGACAGTG GAAATACCAT CGACGGAGTC ACGGGAGCGG ACGGTACGGC AAGCTTTACC
GTTCGTCAGG ATAATACGCC AGGATATAAA ACGCCGTTGA CGGTGACGCT GACGGATAAC
GCCACCATAA CGGCGACGCT GGACACGATT TTCACCGTGC CGACCAGCCC GAATGTCGCC
ACGGCGTATT TCTGGGGTCA TATGGCGGAT ACCGCCACGG TAAGTGGCAA GACGCTGCAT
CGCCCGCTAT TAAAATCGGA GTTGCCGTCA GGCGCTACCG CAGCAGCAAC GCCTGATGTA
AATAATGAAA CCTGGGCACT GGCGCATGTT ATGGATTCCT CTAAATGGGA TGTCGCGCAG
CAGTGTGGCA GTATGAATAA CGTGCCGAGC AGTGCCGAAT TACAAACATT ACACTCCGGC
TTTAGCACAT TGGGATGGCC GTCCTCGATA AGCTTCCCGT ATTTATCTAC TGATAAGGCT
GGGTCGTTCT ATTGTGGTGT TGATGAAGGC TCTGGCAGTC TCAACTGTGG CATTCAGCCT
GCGAAAACGC CGGGGTTTGC AACCTGCTTC CAGTAA
 
Protein sequence
MSVCLKPGKI IVLLGMLAAF MLSDFARAGV EWQTYPGSTG EFNGTVPIAD SASVPVYQGS 
VQLDPAASHD VAFSAKPNEF SVDDDAANLI VANPQDSEGD QFSTPPALRW ENQTPPTVSL
VWADAATPDT PLNPQPIANR SFCAQGLAGR SLVAWPQIDT QQTIPLLYLL TSTGYPYEGT
VALAEQKVTL NIAPAQGDLI SVSASGYNET IAAAKTTVGG TITLTVTTKD CQGNVAGNIP
FIIKRKDAQN RQGAVNNTAP VVLGSTELTT TATEYRGTSD ANGTATITVT QPNGPGVKTP
LVASISGIAQ TSETAVIFTV LTSPDVAQAT MWGHMAETVE AHGYTFSRPK LAAEVSNENA
TVVDHNETWS TFTWSGADSH CTVLPGMRHF GALATVIPST VQTVLGWPMQ DDYYWTSLAG
TTGQHHAADV SNRAEAQKPD STTFLVSCVD KPAPDVEPKI VLTPENYDDT AQAMKAKVGE
DATMRLTITD NKNNDQPLAY YYFSLHLDDG VNRKNQTDSA WEAHPVQITG GSNFRQVDAH
TYEGMTDANG QASLTLSQPG GAGVKTHITA KMRSDFNATD AKDVIFTVIT SPDSDKARMW
GHMRGIIESG SLYKRPLLAD ETEHELGTVR ENNEDWALYD QNTSMQAECG VGHIPRQSSL
ESLFSAHPGN AIGTEYGWPT AQQSYLSAVE QETHSSVNLG NGSVDSYSGF KQNYLSCSGN
EMVANVEVST DHDVSVGTQA QAKVGDTIVM TVRTINALNN SPVPFSAFTI TKGMGYNRAG
QVSGFDDPSS GSITMDNSQY GTSQPSMVYA GTTDVRGVAT VEIKQPQGVG LKTVLSVTPV
NSYLPNTVNY SVIFTTPTSP DVSGAQMWGH MDETITVDSS TFTRPRLAAE IASPDGTLTE
NNEIWARVSQ ANTSSTSKGG CGTNMLPRRS QLSALYSANS GNAVQTTHGW PTQRQPYWSS
SPADVTPHFF TIALNDGAQA IGGDTPVYVS CLTTANKPAS SITLEVVDKA QWNAGNNAAT
LKKGETLQVK VTVKDAQGNA LADMPFTLSR GDGYTRSGEK HIAGSGDALV APVVVNGGLA
DETTLNDTAT VYTAMTGSDG SKILNITRPD THGTKTALTA TLYSDATKKD SLDTIFTVVT
SPDSSQAKMW GHMPETVTAE DGTVFKRPRL LKELSSQTGR TSTLEDNENW ALFNINYASS
STTYSGCGTN YIPTQAGLTS LFANNAGNTM KTVQGWPVAT RYLSNTSDNG SMEQRNYKAV
DLSNGTSAAV SSTALQLLTC QTTPVTTVSQ ILLEAADPAT LDTTYNVVKA KKGEESVVRV
TTKDAQGNLV GNTAFILTRA NSVSRANASA TMSVGSLTVT DAWGNTRNNF QSTSETIYGV
TGADGSTTLT LKQDNSTGLR TDLTAKLDTS SSVKATLPVV FTVVTSPDSP KANFWGHMAE
TVAASDGSVY KRPLLLAELA NTGGRQISSE NGESWVRFTW NQSTDPSVSG CGVAYMPTLA
GLQALYDANS GNAMSTVQGW PVKATYLTNT PSDTQTGSRY YNVVQLDSGA ASQITTNTGV
LQTCRTTPLT AASQITLEAA DPAQFVSIDS TLSAVKVQKG DSVPIRISTK DAQGNFVGNT
PFALKHANSI NRQNVSSSQK VSVTTEVGTT VDTSATTTLY GVTGPDGTTT MTLRQDASTG
LRTDFYALLN DTGVSSDTLP VIFTVITSPD TPLAANWGHM AETFTSSEGV TFKRPSLKAE
LSGGTAITVN NEVWSRLTAA EKVDASKAGC DETYQPLVND MQGLYADYPN GQLGSVLGLP
TSAGYWWAYD MMMVSGNWTN QAFSPVNGQL MQASSSYTAI VMCLVEPHTE AATIELTSTA
QDAAKSASNG GRPSAVAKKG ETIPLTVTVR DSAGNPMPYA EFTLTRETTL DRSKATVNTS
ADDLTVTALV PANTNSVLAA SGAKLIGTTG SDGKATFEVS QNATTGLATP LTVTLARDTT
KAATLDVIYT VITSPDSPSA KFWGHMPDTF TSSTGVTFKR PLLKAETSSG SSISSNGEVW
SYMSNAQNLT STDCPLENQP RSNELLDLYS DHPNGTLMTD LGLPVYAGNW WAYDMVMLSG
TSWTYQLISL KDGTITQKGT TSALMLCLAQ PHPAGVSVTL TSSAQDDART TSNGGRPSAS
AKKGNSIPIV VTVKDRDGNP LAGEAVTLKR DSALSRSGTI VGTPANEIKL TELTPVSATF
PLVSNGTQWL GFTGSDGTAT FNVEQPDTVG LAIPFNAILA RDTAKVSTLD LIFTVLTSPD
AALATYWGHM PDTVTAENGA VFERPKLWKE LPSTSGVSQI TNSNEAWPVF NATQRADSNL
SPCEAARRPL SDDLEALYTR YPGNTITAQI GWPTQYAWWA VDNWEGGSNP QAVSLVNGSR
GQYNPAHQAC LVNPRASVSS VTLTSTALDD ATQAATVKKG DAAPIVVTVK DSAGKPVPNI
AFILKRGEAV PRNSGATLYG DVDAMDDLTV QPSSGAAVTL ADSGNTIDGV TGADGTASFT
VRQDNTPGYK TPLTVTLTDN ATITATLDTI FTVPTSPNVA TAYFWGHMAD TATVSGKTLH
RPLLKSELPS GATAAATPDV NNETWALAHV MDSSKWDVAQ QCGSMNNVPS SAELQTLHSG
FSTLGWPSSI SFPYLSTDKA GSFYCGVDEG SGSLNCGIQP AKTPGFATCF Q