Gene EcHS_A4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4005 
SymbolrfbB2 
ID5592287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4000935 
End bp4002002 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID640923109 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001460580 
Protein GI157163262 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TTCTGATAAC AGGTGGTGCC GGGTTTATTG GCTCGGCGCT GGTGCGTTAT 
ATCATCAACG AAACGAGCGA TGCGGTGGTA GTGGTCGATA AGCTGACCTA CGCCGGAAAC
CTGATGTCGC TGGCACCGGT CGCGCAAAGC GAGCACTTTG CCTTTGAGAA AGTTGATATC
TGCGATCGGG CAGAACTGGC ACGCGTATTC ACTGAGCATC AGCCAGACTG TGTCATGCAT
CTGGCAGCCG AAAGCCATGT TGACCGTTCT ATTGACGGCC CGGCAGCGTT TATTGAAACC
AACATTGTCG GGACTTATAC ATTGCTTGAA GCGGCGCGGG CTTACTGGAA TACGCTGACG
GAAGATAAAA AATCAGCGTT CCGTTTTCAT CATATCTCCA CCGACGAAGT ATATGGTGAC
CTGCACTCGA CGGATGATTT CTTCACCGAA ACCACGCCGT ATGCGCCGAG CAGCCCTTAT
TCCGCGTCAA AAGCCAGCAG CGACCATCTG GTGCGCGCCT GGCTGCGGAC CTACGGTCTG
CCGACGCTGA TCACCAACTG CTCGAATAAC TACGGTCCTT ACCACTTTCC GGAAAAACTG
ATCCCGCTGA TGATCCTCAA CGCGCTGGCG GGTAAACCGC TGCCGGTATA TGGCAACGGG
CAGCAAATCC GTGACTGGCT GTATGTGGAA GATCACGCCC GCGCGCTGTA TTGCGTGGCG
ACCACCGGGA AAGTCGGTGA AACCTATAAT ATTGGTGGTC ACAACGAGCG TAAGAATCTC
GATGTTGTGG AAACCATTTG CGAGCTGCTG GAAGAACTGG CTCCGAACAA GCCGCACGGC
GTGGTGCACT ATCGTGACTT GATCACCTTT GTCGCTGACC GTCCGGGGCA TGATCTGCGT
TATGCCATTG ATGCTTCGAA AATTGCCCGT GAACTTGGTT GGCTGCCACA GGAAACCTTT
GAAAGTGGAA TGCGTAAAAC GGTGCAGTGG TATCTGGCTA ATGAAAGCTG GTGGAAGCAG
GTGCAGGACG GCAGCTATCA GGGCGAGCGT TTAGGTCTGA AAGGCTAA
 
Protein sequence
MRKILITGGA GFIGSALVRY IINETSDAVV VVDKLTYAGN LMSLAPVAQS EHFAFEKVDI 
CDRAELARVF TEHQPDCVMH LAAESHVDRS IDGPAAFIET NIVGTYTLLE AARAYWNTLT
EDKKSAFRFH HISTDEVYGD LHSTDDFFTE TTPYAPSSPY SASKASSDHL VRAWLRTYGL
PTLITNCSNN YGPYHFPEKL IPLMILNALA GKPLPVYGNG QQIRDWLYVE DHARALYCVA
TTGKVGETYN IGGHNERKNL DVVETICELL EELAPNKPHG VVHYRDLITF VADRPGHDLR
YAIDASKIAR ELGWLPQETF ESGMRKTVQW YLANESWWKQ VQDGSYQGER LGLKG