Gene EcolC_3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3846 
Symbol 
ID6066898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4201463 
End bp4202995 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content57% 
IMG OID641603258 
Producthypothetical protein 
Protein accessionYP_001726777 
Protein GI170021823 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000624619 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000292473 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAA ACCCCGTAAG TATACCACAC ACCGTCTGGT ACGCCGACGA TATCCGCCGC 
GGAGAACGCG AGGCGGCAGA TGTGCTGGGG CTCACACTCT ATGAGCTGAT GCTTCGCGCT
GGCGAGGCCG CATTCCAGGT GTGTCGTTCG GCGTATCCTG ACGCCCGCCA CTGGCTGGTG
CTGTGCGGTC ATGGTAATAA CGGCGGCGAT GGCTACGTGG TCGCGCGACT GGCCAAAGCG
GTCGGCATTG AGGTCACGTT GTTGGCCCAG GAGAGCGACA AACCGTTGCC GGAAGAGGCC
GCGCTGGCAC GCGAAGCATG GTTAAACGCG GGTGGCGAGA TCCATGCTTC GAATATTGTC
TGGCCCGAAT CGGTAGATCT GATTGTTGAT GCGCTGCTCG GTACCGGTTT GCGGCAAGCG
CCCCGCGAAT CCATTAGCCA GTTAATCGAC CACGCTAATT CCCATCCTGC GCCGATTGTG
GCGGTTGATA TCCCTTCCGG CCTGCTGGCT GAAACTGGCG CTACGCCAGG CGCGGTGATC
AACGCCGATC ACACCATCAC TTTTATTGCG CTGAAACCAG GCTTGCTCAC TGGAAAAGCG
CGGGATGTTA CCGGACAACT GCATTTTGAC TCACTGGGGC TGGATAGTTG GCTGGCAGGT
CAGGAGACGA AAATTCAGCG GTTTTCAGCA GAACAACTTT CTCACTGGCT AAAACCGCGT
CGCCCGACTT CGCATAAAGG CGATCACGGG CGGCTGGTAA TTATCGGTGG CGATCACGGC
ACGGCGGGGG CTATTCGTAT GACGGGGGAA GCGGCGCTGC GTGCTGGTGC TGGTTTAGTC
CGAGTACTGA CCCGCAGTGA AAACATTGCG CCGCTGCTGA CTGCACGACC GGAATTGATG
GTGCATGAAC TGACGATGGA CTCTCTTACC GAAAGCCTGG AATGGGCCGA TGTGGTGGTG
ATTGGTCCCG GTCTGGGCCA GCAAGAGTGG GGGAAAAAAG CACTGCAAAA AGTTGAGAAT
TTTCGCAAAC CGATGTTGTG GGATGCCGAT GCATTGAACC TGCTGGCAAT CAATCCCGAT
AAGCGTCACA ATCGCGTGAT CACGCCGCAT CCTGGCGAGG CCGCACGGTT GTTAGGCTGT
TCCGTCGCTG AAATTGAAAG TGACCGCTTA CATTGCGCCA AACGTCTGGT ACAACGTTAT
GGCGGCGTAG CGGTGCTGAA AGGTGCCGGA ACCGTGGTCG CCGCCCATCC TGACGCTTTA
GGCATTATTG ATGCCGGAAA TGCAGGCATG GCGAGCGGCG GCATGGGCGA TGTGCTCTCT
GGTATTATTG GCGCATTGCT TGGGCAAAAA CTGTCGCCGT ATGATGCAGC CTGTGCAGGC
TGTGTCGCGC ACGGTGCGGC AGCTGACGTA CTGGCGGCGC GTTTTGGAAC GCGCGGGATG
CTGGCAACCG ATCTCTTTTC CACGCTACAG CGTATTGTTA ACCCGGAAGT GACTGATAAA
AACCATGATG AATCGAGTAA TTCCGCTCCC TGA
 
Protein sequence
MKKNPVSIPH TVWYADDIRR GEREAADVLG LTLYELMLRA GEAAFQVCRS AYPDARHWLV 
LCGHGNNGGD GYVVARLAKA VGIEVTLLAQ ESDKPLPEEA ALAREAWLNA GGEIHASNIV
WPESVDLIVD ALLGTGLRQA PRESISQLID HANSHPAPIV AVDIPSGLLA ETGATPGAVI
NADHTITFIA LKPGLLTGKA RDVTGQLHFD SLGLDSWLAG QETKIQRFSA EQLSHWLKPR
RPTSHKGDHG RLVIIGGDHG TAGAIRMTGE AALRAGAGLV RVLTRSENIA PLLTARPELM
VHELTMDSLT ESLEWADVVV IGPGLGQQEW GKKALQKVEN FRKPMLWDAD ALNLLAINPD
KRHNRVITPH PGEAARLLGC SVAEIESDRL HCAKRLVQRY GGVAVLKGAG TVVAAHPDAL
GIIDAGNAGM ASGGMGDVLS GIIGALLGQK LSPYDAACAG CVAHGAAADV LAARFGTRGM
LATDLFSTLQ RIVNPEVTDK NHDESSNSAP