Gene EcHS_A2182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2182 
SymbolrfbB1 
ID5594768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2160574 
End bp2161638 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content59% 
IMG OID640921315 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001458854 
Protein GI157161536 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0103955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTC TTGTCACCGG TGGTGCAGGC TTTATCGGCT CTGCTGTAGT TCGTCATATC 
ATTGAAAATA CCCGGGATGA AGTCCGCGTG ATGGACTGCC TGACCTATGC CGGCAACCTC
GAATCCCTGG CGCCGGTGGC CGGGAGCGAA CGCTACTCGT TTTCCCAGAC CGATATCACC
GATGCCGCTG CCGTGGCGGC CCAGTTCAGC GAGTTCCGCC CGGATATCGT GATGCATCTG
GCGGCAGAAA GTCATGTGGA CCGTTCCATT GATGGCCCGG CCGCCTTCAT CCAGACCAAC
GTGATCGGCA CCTTCACTCT GCTGGAGGCG GCCCGTCACT ACTGGTCCGG GCTTGGGGAC
GCGCAGAAGC AGGCCTTCCG CTTCCACCAT ATTTCCACCG ATGAGGTGTA CGGCGACCTG
CACGGCACCG ATGACCTGTT CACCGAAGAG ACTCCGTACG CCCCGAGCAG CCCGTACTCT
GCCTCCAAAG CGGGCAGCGA CCATCTGGTT CGCGCCTGGA ACCGCACCTA CGGCCTGCCG
GTGGTGGTGA CCAACTGCTC CAACAACTAT GGTCCGTATC ACTTCCCGGA GAAACTGATC
CCGCTGACTA TCCTTAATGC CCTGGCGGGT AAACCCCTGC CGGTGTATGG CAACGGGGAG
CAGATCCGTG ACTGGCTGTA TGTTGAGGAC CATGCCCGTG CGCTGTATAA AGTGGCGACC
GAAGGCAAGA GCGGCGAAAC CTACAATATT GGCGGTCATA ACGAGCGTAA AAATATCGAT
GTGGTGCGCA CCATCTGCGC CATTCTCGAC AAGGTGGTGG CGCAGAAGCC GGGCAACATC
GCCCACTTCG CTGACCTGAT CACTTTTGTC ACCGACCGTC CGGGACACGA CCTGCGTTAT
GCCATTGATG CCGCGAAAAT TCAGCGCGAT CTGGGCTGGG TGCCGCAGGA GACGTTCGAG
AGCGGGATTG AAAAAACCGT GCACTGGTAT CTTAACAACC AGACCTGGTG GCAGCGCGTG
CTGGATGGCT CCTATGCCGG TGAGCGTCTG GGCCTAAATA ACTGA
 
Protein sequence
MKILVTGGAG FIGSAVVRHI IENTRDEVRV MDCLTYAGNL ESLAPVAGSE RYSFSQTDIT 
DAAAVAAQFS EFRPDIVMHL AAESHVDRSI DGPAAFIQTN VIGTFTLLEA ARHYWSGLGD
AQKQAFRFHH ISTDEVYGDL HGTDDLFTEE TPYAPSSPYS ASKAGSDHLV RAWNRTYGLP
VVVTNCSNNY GPYHFPEKLI PLTILNALAG KPLPVYGNGE QIRDWLYVED HARALYKVAT
EGKSGETYNI GGHNERKNID VVRTICAILD KVVAQKPGNI AHFADLITFV TDRPGHDLRY
AIDAAKIQRD LGWVPQETFE SGIEKTVHWY LNNQTWWQRV LDGSYAGERL GLNN