Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4724 |
Symbol | |
ID | 5590045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4726191 |
End bp | 4727738 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640928336 |
Product | hypothetical protein |
Protein accession | YP_001465664 |
Protein GI | 157158402 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000041383 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGCACGCC GACGATATCC GCCGCGGAGA ACGTGAGGCG GCAGATGCGC TGGGGCTTAC ACTCTATGAG CTGATGCTTC GCGCTGGCGA AGCCGCATTC CAGGTGTGTC GTTCGGCGTA TCCTGACGCC CGCCACTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA CGTGGTCGCG CGACTGGCCA AAGCGGTCGG CATTGAGGTC ACGTTGTTGG CCCAGGAGAG CGACAAACCG TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGAGG CGAGATCCAT GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC GGCTTGCAGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT CCTGCGCCGA TTGCGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC CGGCGCTACG CCAGGCGCAG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG CTCACTGGAA AAGCGCGGGA TGTTACCGGA CAACTGCATT TTGACTCACT GGGGCTGGAT AGTTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CGGCAGAACA ACTTTCTCAC TGGCTAAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTAATTATC GGTGGCGATC ACGGCACGGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTACGTGCT GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA CGACCGGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTACCGAAAG CCTGGAATGG GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCACTG CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA CGGTTGTTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC CATCCTGACG CTTTAGGCAT TATTGATGCC GGAAATGCAG GCATGGCGAG CGGCGGCATG GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT GCCGCCTGTG CGGGCTGTGT CGCGCACGGT GCTGCAGCTG ACGTACTGGC GGCGCGTTTT GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATTCCG CTCCCTGA
|
Protein sequence | MTDHTMKKNP VSIPHTVWHA DDIRRGEREA ADALGLTLYE LMLRAGEAAF QVCRSAYPDA RHWLVLCGHG NNGGDGYVVA RLAKAVGIEV TLLAQESDKP LPEEAALARE AWLNAGGEIH ASNIVWPESV DLIVDALLGT GLQQAPRESI SQLIDHANSH PAPIAAVDIP SGLLAETGAT PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA RPELMVHELT MDSLTESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA HPDALGIIDA GNAGMASGGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNSAP
|
| |