Gene EcE24377A_4724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4724 
Symbol 
ID5590045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4726191 
End bp4727738 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID640928336 
Producthypothetical protein 
Protein accessionYP_001465664 
Protein GI157158402 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000041383 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGCACGCC 
GACGATATCC GCCGCGGAGA ACGTGAGGCG GCAGATGCGC TGGGGCTTAC ACTCTATGAG
CTGATGCTTC GCGCTGGCGA AGCCGCATTC CAGGTGTGTC GTTCGGCGTA TCCTGACGCC
CGCCACTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA CGTGGTCGCG
CGACTGGCCA AAGCGGTCGG CATTGAGGTC ACGTTGTTGG CCCAGGAGAG CGACAAACCG
TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGAGG CGAGATCCAT
GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC
GGCTTGCAGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT
CCTGCGCCGA TTGCGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC CGGCGCTACG
CCAGGCGCAG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG
CTCACTGGAA AAGCGCGGGA TGTTACCGGA CAACTGCATT TTGACTCACT GGGGCTGGAT
AGTTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CGGCAGAACA ACTTTCTCAC
TGGCTAAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTAATTATC
GGTGGCGATC ACGGCACGGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTACGTGCT
GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA
CGACCGGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTACCGAAAG CCTGGAATGG
GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCACTG
CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG
GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA
CGGTTGTTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT
CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC
CATCCTGACG CTTTAGGCAT TATTGATGCC GGAAATGCAG GCATGGCGAG CGGCGGCATG
GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT
GCCGCCTGTG CGGGCTGTGT CGCGCACGGT GCTGCAGCTG ACGTACTGGC GGCGCGTTTT
GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG
GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATTCCG CTCCCTGA
 
Protein sequence
MTDHTMKKNP VSIPHTVWHA DDIRRGEREA ADALGLTLYE LMLRAGEAAF QVCRSAYPDA 
RHWLVLCGHG NNGGDGYVVA RLAKAVGIEV TLLAQESDKP LPEEAALARE AWLNAGGEIH
ASNIVWPESV DLIVDALLGT GLQQAPRESI SQLIDHANSH PAPIAAVDIP SGLLAETGAT
PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH
WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA
RPELMVHELT MDSLTESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL
AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA
HPDALGIIDA GNAGMASGGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF
GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNSAP