Gene EcHS_A4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4133 
SymbolrhaB 
ID5592696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4123988 
End bp4125457 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID640923235 
Productrhamnulokinase 
Protein accessionYP_001460694 
Protein GI157163376 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG 
GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCCATCG TTTTAACAAT
GGGCTGCATA GTCAGAACGG CTATGTCACC TGGGATGTGG ATAGCCTTGA AAGTGCCATT
CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT
ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CAACAGGGTC AGCGTGTGGG CCTGCCCGTT
GCTTATCGCG ATAGCCGCAC CAATGGCCTA ATGGCGCAGG CACAACAACA ACTCGGCAAA
CGCGATATTT ATCAACGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAGTTG
CGTGCGCTGA CGGAGCAACA ACCTGAACTT ATTCCACACA TTGCTCACGC TCTGCTGATG
CCGGATTACT TCAGTTATCG CCTGACCGGC AAGATGAACT GGGAATATAC CAACGCCACG
ACCACGCAAC TGGTCAATAT CAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC
GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT AGGTCACTGG
ATTTGCCCGC AGGGTAATGA GATTCCAGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC
GCGGTTATCG CCTCGCCGTT AAACGGCTCA CGTGCTGCTT ATCTCTCTTC TGGCACCTGG
TCATTGATGG GCTTCGAAAG CCAGACGCCA TTTACCAATG ACACGGCACT GGCAGCCAAC
ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA
TGGCTGCTTC AGCGAGTGCT TCAGGAGCAG CAAATCAACG ATCTTCCGGC GCTTATCTCC
GCGACACAGG CACTTCCGGC TTGCCGCTTC ATTATCAATC CCAATGACGA TCGCTTTATT
AATCCTGAGA CGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAACGGC GCAACCGATC
CCGGAAAGTG ATGCTGAACT GGCGCGCTGC ATTTTCGACA GTCTGGCGCT GCTGTATGCC
GATGTGTTGC ATGAGCTGGC GCAGCTGCGC GGTGAAGATT TCTCGCAACT GTATATTGTC
GGCGGAGGCT GCCAGAACAC GCTGCTCAAC CAGCTATGCG CCGATGCCTG CGGTATTCGG
GTGATCGCCG GGCCTGTTGA AGCCTCGACG CTCGGCAATA TCGGCATCCA GTTAATGACG
CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG
ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCG GATTCACTCT
ACACGACAGA CAAAGGAGCT TTGCGCATGA
 
Protein sequence
MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIHRFNN GLHSQNGYVT WDVDSLESAI 
RLGLNKVCEE GIRIDSIGID TWGVDFVLLD QQGQRVGLPV AYRDSRTNGL MAQAQQQLGK
RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT
TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS
AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLQEQ QINDLPALIS ATQALPACRF IINPNDDRFI NPETMCSEIQ AACRETAQPI
PESDAELARC IFDSLALLYA DVLHELAQLR GEDFSQLYIV GGGCQNTLLN QLCADACGIR
VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVARIHS
TRQTKELCA