Gene ECH74115_5683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5683 
Symbol 
ID6967054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5323006 
End bp5324538 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content57% 
IMG OID643389316 
Producthypothetical protein 
Protein accessionYP_002273709 
Protein GI209398092 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000899207 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA ACCCCGTAAG TATACCACAC ACCGTCTGGC ACGCCGACGA TATCCGCCGC 
GGAGAACGCG AGGCGGCAGA TGCGCTGGGG CTCACACTCT ATGAGCTGAT GCTTCGCGCT
GGCGAGGCCG CATTCCAGGT GTGTCGTTCG GCGTATCCTG ACGCCCGCCA CTGGCTGGTG
TTGTGCGGTC ATGGTAATAA CGGCGGCGAT GGCTACGTGG TCGCGCGACT GGCCAAAGCG
GTCGGCATTG AGGTCACGTT GTTGGCCCAG GAGAGCGATA AACCGTTGCC GGAAGAGGCC
GCGCTGGCAC GCGAGGCATG GTTAAACGCG GGAGGCGAGA TCCATGCTTC GAATATTGTC
TGGCCCGAGT CGGTAGCTCT GATTGTTGAT GCGCTGCTCG GTACCGGCTT GCAGCAAGCA
CCCCGCGAAT CCATTAGCCA GTTAATCGAC CACGCTAATT CCCATCCTGC GCCGATTGTG
GCGGTTGATA TCCCTTCCGG CCTGCTGGCT GAAACCGGCG CTACGCCAGG CGCAGTGATC
AACGCCGATC ACACCATCAC TTTTATTGCG CTGAAACCAG GCTTGCTCAC TGGAAAAGCG
CGGGATGTTA CAGGACAACT GCATTTTGAC TCACTGGGGC TGGATAGCTG GCTGGCAGGT
CAGGAGACGA AAATTCAGCG GTTTTCGGCA GAACAACTTT CTCACTGGCT GAAACCGCGT
CGCCCGACTT CGCATAAAGG CGATCACGGG CGGCTGGTGA TTATCGGTGG CGATCACGGC
ACGGCGGGGG CTATTCGTAT GACGGGGGAA GCGGCGCTAC GTGCTGGTGC TGGTTTAGTC
CGAGTACTGA CCCGCAGTGA GAACATTGCG CCGCTGCTGA CTGCACGACC AGAATTGATG
GTGCATGAAC TGACTATGGA CTCTCTTACC GAAAGCCTGG AATGGGCCGA TGTGGTGGTG
ATTGGTCCCG GTCTGGGCCA GCAAGAGTGG GGGAAAAAAG CCCTGCAAAA AGTTGAGAAT
TTTCGCAAAC CGATGTTGTG GGATGCCGAT GCATTGAACC TGCTGGCAAT CAATCCCGAT
AAGCGTCACA ATCGCGTGAT CACGCCGCAT CCTGGCGAGG CCGCACGGTT GTTAGGCTGT
TCCGTCGCTG AAATTGAAAG TGACCGCTTA CATTGCGCCA AACGTCTGGT ACAACGTTAT
GGCGGCGTAG CGGTGCTGAA AGGTGCCGGA ACCGTGGTCG CCGCCCATCC TGACGCTTTA
GGCATTATTG ATGCCGGAAA TGCAGGCATG GCAAGCGGCG GCATGGGCGA TGTGCTCTCT
GGTATTATTG GCGCATTGCT TGGGCAAAAA CTGTCGCCGT ATGATGCAGC CTGTGCAGGC
TGTGTCGCGC ACGGTGCGGC AGCTGACGTA CTGGCGGCGC GTTTTGGAAC GCGCGGGATG
CTGGCAACCG ATCTCTTTTC CACGCTACAG CGTATTGTTA ACCCGGAAGT GACTGATAAA
AACCATGATG AATCGAGTAA TTCCGCTCCC TGA
 
Protein sequence
MKKNPVSIPH TVWHADDIRR GEREAADALG LTLYELMLRA GEAAFQVCRS AYPDARHWLV 
LCGHGNNGGD GYVVARLAKA VGIEVTLLAQ ESDKPLPEEA ALAREAWLNA GGEIHASNIV
WPESVALIVD ALLGTGLQQA PRESISQLID HANSHPAPIV AVDIPSGLLA ETGATPGAVI
NADHTITFIA LKPGLLTGKA RDVTGQLHFD SLGLDSWLAG QETKIQRFSA EQLSHWLKPR
RPTSHKGDHG RLVIIGGDHG TAGAIRMTGE AALRAGAGLV RVLTRSENIA PLLTARPELM
VHELTMDSLT ESLEWADVVV IGPGLGQQEW GKKALQKVEN FRKPMLWDAD ALNLLAINPD
KRHNRVITPH PGEAARLLGC SVAEIESDRL HCAKRLVQRY GGVAVLKGAG TVVAAHPDAL
GIIDAGNAGM ASGGMGDVLS GIIGALLGQK LSPYDAACAG CVAHGAAADV LAARFGTRGM
LATDLFSTLQ RIVNPEVTDK NHDESSNSAP