Gene EcSMS35_4295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4295 
SymbolrhaB 
ID6144349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4398407 
End bp4399876 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID641619116 
Productrhamnulokinase 
Protein accessionYP_001746240 
Protein GI170683095 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0758535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG 
GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCTATCG TTTTAACAAT
GGCCTGCATA GCCAGAACGG TTATGTCACC TGGGATGTGG ATAGCCTGGA AAGTGCCATT
CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT
ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CTACAGGGTA AGCGTGTGGG TCTGCCCGTT
GCTTATCGCG ATAGCCGCAC CAATGGCCTG ATGACGCAGG CACAACAGCA ACTCGGCAAA
CGCGATATTT ATCAGCGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAGTTG
CGTGCGCTGA CGGAGCAACA ACCTGAGCTT ATTCCACACA TTGCTCACGC TCTGCTGATG
CCGGATTACT TCAGTTATCG CCTGACCGGC AAGATGAACT GGGAGTACAC CAACGCCACG
ACCACGCAAC TGGTCAATAT TAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC
GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT TGGGCACTGG
ATTTGCCCGC AGGGTAATGA GATTCCGGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC
GCGGTTATCG CCTCGCCGTT AAACGGTTCA CGCGCCGCTT ATCTCTCTTC TGGCACCTGG
TCATTGATGG GCTTCGAAAG CCAGACACCA TTTACCAATG ACACGGCGCT GGCAGCCAAC
ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA
TGGCTGCTTC AGCGGGTGCT TCAGGAGCGG CAAATCAACG ATCTTCCGGC GCTTATCGCC
GCGACACAGG CACTGCCGGC TTGCCGCTTC ACCATCAATC CCAATGACGA TCGCTTTATT
AATCCTGACG AGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAACGGC GCAACCAATC
CCGGAAAGTG ATGCTGAACT GGCGCGCTGT ATTGTCGACA GTCTGGCGCT GCTGTATGCC
GATGTGTTGC ATGAGCTGGC GCAACTGCGC GGTGAAGATT TCTCACAACT GCATATTGTC
GGCGGTGGCT GCCAGAACGC GCTGCTCAAC CAGTTATGTG CTGATGCCTG CGGTATTCGG
GTGATCGCCG GGCCTGTTGA AGCCTCGACG CTCGGCAATA TCGGCATCCA GTTAATGACG
CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG
ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCA GATTCACTCT
ACACGACAGA CAAAGGAGCT TTGTGCATGA
 
Protein sequence
MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIYRFNN GLHSQNGYVT WDVDSLESAI 
RLGLNKVCEE GIRIDSIGID TWGVDFVLLD LQGKRVGLPV AYRDSRTNGL MTQAQQQLGK
RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT
TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS
AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLQER QINDLPALIA ATQALPACRF TINPNDDRFI NPDEMCSEIQ AACRETAQPI
PESDAELARC IVDSLALLYA DVLHELAQLR GEDFSQLHIV GGGCQNALLN QLCADACGIR
VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVAQIHS
TRQTKELCA