Gene EcHS_A4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4408 
Symbol 
ID5594085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4418802 
End bp4420349 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID640923507 
Producthypothetical protein 
Protein accessionYP_001460948 
Protein GI157163630 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000123487 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGCACGCC 
GACGATATCC GCCGCGGAGA ACGTGAGGCG GCAGATGCGC TGGGGCTTAC ACTCTATGAG
CTGATGCTTC GCGCTGGCGA AGCCGCATTC CAGGTGTGTC GTTCGGCGTA TCCTGACGCC
CGCCATTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA CGTGGTCGCG
CGGCTGGCCA CAGCGGTCGG CATTGAGGTC ACGCTGCTGG CCCAGGAGAG CGACAAACCG
TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGAGG CGAGATCCAT
GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC
GGCTTGCAGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT
CCTGCGCCGA TTGCGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC CGGCGCTACG
CCAGGCGCAG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG
CTCACTGGAA AAGCGCGGGA TGTTACCGGA CAACTGCATT TTGACTCACT GGGGCTGGAT
AGTTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CGGCAGAACA ACTTTCTCAC
TGGCTAAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTAATTATC
GGTGGCGATC ACGGCACGGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTACGTGCT
GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA
CGACCGGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTACCGAAAG CCTGGAATGG
GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCACTG
CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG
GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA
CGGTTATTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT
CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC
CATCCTGACG CTTTAGGCAT TATTGATGCC GGAAATGCAG GCATGGCGAG CGGCGGCATG
GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT
GCCGCCTGTG CGGGCTGTGT CGCGCACGGT GCTGCAGCTG ACGTACTGGC GGCGCGTTTT
GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG
GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATTCCG CTCCCTGA
 
Protein sequence
MTDHTMKKNP VSIPHTVWHA DDIRRGEREA ADALGLTLYE LMLRAGEAAF QVCRSAYPDA 
RHWLVLCGHG NNGGDGYVVA RLATAVGIEV TLLAQESDKP LPEEAALARE AWLNAGGEIH
ASNIVWPESV DLIVDALLGT GLQQAPRESI SQLIDHANSH PAPIAAVDIP SGLLAETGAT
PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH
WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA
RPELMVHELT MDSLTESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL
AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA
HPDALGIIDA GNAGMASGGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF
GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNSAP