Gene ECH74115_5356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5356 
SymbolrhaB 
ID6966779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4999052 
End bp5000521 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID643389012 
Productrhamnulokinase 
Protein accessionYP_002273421 
Protein GI209396044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG 
GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCCATCG TTTTAACAAT
GGGCTGCATA GTCAGAACGG CTATGTCACC TGGGATGTGG ATAGCCTTGA AAGTGCCATT
CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT
ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CAACAGGGTC AGCGTGTGGG CCTGCCCGTT
GCTTATCGCG ATAGCCGCAC CAATGGCCTA ATGGCGCAGG CACAACAACA ACTCGGCAAA
CGCGATATTT ATCAACGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAGTTG
CGTGCGCTGA CGGAGCAACA ACCTGAACTT ATTCCACACA TTGCTCACGC TCTGCTGATG
CCGGATTACT TCAGTTATCG CCTGACCGGC AAGATGAACT GGGAATATAC CAACGCCACG
ACCACGCAAC TGGTCAATAT CAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC
GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT AGGTCACTGG
ATTTGCCCGC AGGGTAATGA GATTCCAGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC
GCGGTTATCG CCTCGCCGTT AAACGGTTCA CGTGCTGCTT ATCTCTCTTC TGGCACCTGG
TCATTGATGG GCTTCGAAAG CCAGACGCCA TTTACCAATG ACACGGCACT GGCAGCCAAC
ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA
TGGCTGCTTC AGCGAGTGCT TCAGGAGCGG CAAATCAACG ATCTTCCGGC GCTTATCTCC
GCGACACAGG CACTTCCGGC TTGCCGCTTC ATTATCAATC CCAATGACGA TCGCTTTATT
AATCCTGAGA CGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAACGGC GCAACCGATC
CCGGAAAGTG ATGCTGAACT GGCGCGCTGC ATTTTCGACA GTCTGGCGCT GCTGTATGCC
GATGTGTTGC ATGAGCTGGC GCAGCTGCGC GGTGAAGATT TCTCGCAACT GCATATTGTC
GGCGGAGGCT GCCAGAACAC GCTGCTCAAC CAGCTATGCG CCGATGCCTG CGGTATTCGG
GTGATCGCCG GGCCTGTTGA AGCCTCGACG CTCGGCAATA TCGGCATCCA GTTAATGACG
CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG
ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCG GATTCACTCT
ACACGACAGA CAAAGGAGCT TTGCGCATGA
 
Protein sequence
MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIHRFNN GLHSQNGYVT WDVDSLESAI 
RLGLNKVCEE GIRIDSIGID TWGVDFVLLD QQGQRVGLPV AYRDSRTNGL MAQAQQQLGK
RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT
TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS
AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLQER QINDLPALIS ATQALPACRF IINPNDDRFI NPETMCSEIQ AACRETAQPI
PESDAELARC IFDSLALLYA DVLHELAQLR GEDFSQLHIV GGGCQNTLLN QLCADACGIR
VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVARIHS
TRQTKELCA