Gene ECH74115_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5221 
SymbolrfbB2 
ID6969387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4869024 
End bp4870091 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID643388886 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_002273306 
Protein GI209399629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.175839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGA TTCTGATAAC AGGCGGTGCC GGGTTTATTG GCTCGGCGCT GGTGCGTTAT 
ATCATCAACG AAACGAGCGA CGCGGTGGTA GTGGTCGATA AGCTGACCTA CGCCGGAAAC
CTGATGTCGC TGGCACCGGT CGCGCAAAGC GAGCGCTTTG CCTTTGAGAA AGTTGATATC
TGCGATCGGG CAGAACTGGC GCGCGTATTC ACTGAGCATC AGCCAGACTG TGTCATGCAT
CTGGCAGCCG AAAGCCATGT TGACCGTTCT ATTGACGGCC CGGCAGCGTT TATTGAAACC
AACATTGTCG GGACTTATAC ATTGCTTGAA GCGGCGCGGG CTTACTGGAA TGCGCTGACG
GAAGATAAAA AATCAGCGTT CCGTTTTCAT CATATCTCCA CTGACGAAGT ATATGGTGAC
CTGCACTCGA CGGATGATTT CTTCACCGAA ACCACGCCGT ATGCGCCGAG CAGCCCTTAT
TCCGCGTCAA AAGCCAGCAG CGACCATCTG GTGCGCGCCT GGCTGCGAAC CTATGGTCTG
CCAACGCTTA TCACCAACTG CTCGAATAAC TACGGTCCTT ACCACTTTCC GGAAAAACTG
ATCCCGCTGA TGATCCTCAA CGCGCTGGCG GGTAAACCGC TACCGGTATA TGGCAACGGG
CAGCAAATCC GTGACTGGCT GTATGTTGAA GATCACGCCC GCGCGCTGTA TTGCGTGGCG
ACCACCGGGA AAGTCGGTGA AACCTATAAT ATTGGTGGTC ACAACGAGCG TAAGAATCTC
GATGTTGTGG AAACCATTTG CGAGCTGCTG GAAGAACTGG CTCCGAACAA GCCGCAAGGC
GTGGTGCATT ATCGTGACTT GATCACCTTT GTCGCTGACC GTCCGGGGCA TGATCTGCGC
TATGCCATTG ATGCTTCGAA AATTGCCCGT GAACTTGGCT GGCTGCCGCA GGAAACCTTT
GAAAGTGGAA TGCGTAAAAC GGTGCAGTGG TATCTGGCTA ATGAAAGCTG GAGGAAGCAG
GTGCAGGACG GCAGCTATCA GGGCGAGCGT TTAGGTCTGA AAGGCTAA
 
Protein sequence
MRKILITGGA GFIGSALVRY IINETSDAVV VVDKLTYAGN LMSLAPVAQS ERFAFEKVDI 
CDRAELARVF TEHQPDCVMH LAAESHVDRS IDGPAAFIET NIVGTYTLLE AARAYWNALT
EDKKSAFRFH HISTDEVYGD LHSTDDFFTE TTPYAPSSPY SASKASSDHL VRAWLRTYGL
PTLITNCSNN YGPYHFPEKL IPLMILNALA GKPLPVYGNG QQIRDWLYVE DHARALYCVA
TTGKVGETYN IGGHNERKNL DVVETICELL EELAPNKPQG VVHYRDLITF VADRPGHDLR
YAIDASKIAR ELGWLPQETF ESGMRKTVQW YLANESWRKQ VQDGSYQGER LGLKG