Gene EcE24377A_4435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4435 
SymbolrhaB 
ID5589027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4423652 
End bp4425121 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID640928050 
Productrhamnulokinase 
Protein accessionYP_001465394 
Protein GI157155428 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG 
GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCCATCG TTTTAAAAAT
GGGCTGCATA GCCAGAACGG TTATGTCACC TGGAATGTGG ATAGCCTGGA AAGTGCCATT
CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT
ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CAACAGGGTC AGCGTGTGGG CCTGCCCGTT
GCTTATCGCG ATAGCCGCTC CAATGGCCTA ATGGCGCAGG CACAGCAACA ACTCGGCAAA
CGCGATATTT ATCAACGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAACTG
CGTGCGCTGA CGGAGCAACA ACCTGAACTT ATTCCACACA TTGCTCACGC TCTGCTGATG
CCGGATTACT TCAGCTATCG CCTGACCGGC AAGATGAACT GGGAGTACAC CAATGCCACC
ACCACACAAC TGGTCAATAT CAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC
GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT TGGGCACTGG
ATTTGCCCGC AGGGTAATGA GATTCCGGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC
GCGGTTATCG CCTCGCCGTT AAACGGTTCA CGCGCCGCTT ATCTCTCTTC TGGCACCTGG
TCATTGATGG GCTTCGAAAG CCAGACGCCA TTTACCAATG ACACGGCGCT GGCAGCCAAC
ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA
TGGCTGCTTC AGCGAGTGCT TCAGGAACGG CAAATCAACG ATCTCCCGGC GCTTATCGCC
GCGACACAGG CACTTCCGGC CTGTCGCTTC ATCATCAATC CCAATGACGA TCGCTTTATT
AATCCTGAAG CGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAACGGC GCAACCGATC
CCGGAAAGTG ATGCTGAACT GGCGCGCTGT ATTTTCGACA GTCTGGCGCT GCTGTATGCC
GATGTGTTGC ATGAGCTGGC GCAGCTGCGC GGTGAAGATT TCTCGCAACT GCATATTGTC
GGCGGCGGCT GCCAGAACAC GCTGCTCAAC CAGCTATGCG CCGATGCCTG CGGTATTCGG
GTGATCGCCG GGCCTGTTGA AGCCTCAACG CTCGGCAATA TCGGCATCCA GTTAATGACG
CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG
ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCA GATTCACTCT
ACACGACAGA CAAAGGAGCT TTGCGCATGA
 
Protein sequence
MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIHRFKN GLHSQNGYVT WNVDSLESAI 
RLGLNKVCEE GIRIDSIGID TWGVDFVLLD QQGQRVGLPV AYRDSRSNGL MAQAQQQLGK
RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT
TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS
AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLQER QINDLPALIA ATQALPACRF IINPNDDRFI NPEAMCSEIQ AACRETAQPI
PESDAELARC IFDSLALLYA DVLHELAQLR GEDFSQLHIV GGGCQNTLLN QLCADACGIR
VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVAQIHS
TRQTKELCA