Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3846 |
Symbol | |
ID | 6066898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4201463 |
End bp | 4202995 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641603258 |
Product | hypothetical protein |
Protein accession | YP_001726777 |
Protein GI | 170021823 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000624619 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000292473 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACCCCGTAAG TATACCACAC ACCGTCTGGT ACGCCGACGA TATCCGCCGC GGAGAACGCG AGGCGGCAGA TGTGCTGGGG CTCACACTCT ATGAGCTGAT GCTTCGCGCT GGCGAGGCCG CATTCCAGGT GTGTCGTTCG GCGTATCCTG ACGCCCGCCA CTGGCTGGTG CTGTGCGGTC ATGGTAATAA CGGCGGCGAT GGCTACGTGG TCGCGCGACT GGCCAAAGCG GTCGGCATTG AGGTCACGTT GTTGGCCCAG GAGAGCGACA AACCGTTGCC GGAAGAGGCC GCGCTGGCAC GCGAAGCATG GTTAAACGCG GGTGGCGAGA TCCATGCTTC GAATATTGTC TGGCCCGAAT CGGTAGATCT GATTGTTGAT GCGCTGCTCG GTACCGGTTT GCGGCAAGCG CCCCGCGAAT CCATTAGCCA GTTAATCGAC CACGCTAATT CCCATCCTGC GCCGATTGTG GCGGTTGATA TCCCTTCCGG CCTGCTGGCT GAAACTGGCG CTACGCCAGG CGCGGTGATC AACGCCGATC ACACCATCAC TTTTATTGCG CTGAAACCAG GCTTGCTCAC TGGAAAAGCG CGGGATGTTA CCGGACAACT GCATTTTGAC TCACTGGGGC TGGATAGTTG GCTGGCAGGT CAGGAGACGA AAATTCAGCG GTTTTCAGCA GAACAACTTT CTCACTGGCT AAAACCGCGT CGCCCGACTT CGCATAAAGG CGATCACGGG CGGCTGGTAA TTATCGGTGG CGATCACGGC ACGGCGGGGG CTATTCGTAT GACGGGGGAA GCGGCGCTGC GTGCTGGTGC TGGTTTAGTC CGAGTACTGA CCCGCAGTGA AAACATTGCG CCGCTGCTGA CTGCACGACC GGAATTGATG GTGCATGAAC TGACGATGGA CTCTCTTACC GAAAGCCTGG AATGGGCCGA TGTGGTGGTG ATTGGTCCCG GTCTGGGCCA GCAAGAGTGG GGGAAAAAAG CACTGCAAAA AGTTGAGAAT TTTCGCAAAC CGATGTTGTG GGATGCCGAT GCATTGAACC TGCTGGCAAT CAATCCCGAT AAGCGTCACA ATCGCGTGAT CACGCCGCAT CCTGGCGAGG CCGCACGGTT GTTAGGCTGT TCCGTCGCTG AAATTGAAAG TGACCGCTTA CATTGCGCCA AACGTCTGGT ACAACGTTAT GGCGGCGTAG CGGTGCTGAA AGGTGCCGGA ACCGTGGTCG CCGCCCATCC TGACGCTTTA GGCATTATTG ATGCCGGAAA TGCAGGCATG GCGAGCGGCG GCATGGGCGA TGTGCTCTCT GGTATTATTG GCGCATTGCT TGGGCAAAAA CTGTCGCCGT ATGATGCAGC CTGTGCAGGC TGTGTCGCGC ACGGTGCGGC AGCTGACGTA CTGGCGGCGC GTTTTGGAAC GCGCGGGATG CTGGCAACCG ATCTCTTTTC CACGCTACAG CGTATTGTTA ACCCGGAAGT GACTGATAAA AACCATGATG AATCGAGTAA TTCCGCTCCC TGA
|
Protein sequence | MKKNPVSIPH TVWYADDIRR GEREAADVLG LTLYELMLRA GEAAFQVCRS AYPDARHWLV LCGHGNNGGD GYVVARLAKA VGIEVTLLAQ ESDKPLPEEA ALAREAWLNA GGEIHASNIV WPESVDLIVD ALLGTGLRQA PRESISQLID HANSHPAPIV AVDIPSGLLA ETGATPGAVI NADHTITFIA LKPGLLTGKA RDVTGQLHFD SLGLDSWLAG QETKIQRFSA EQLSHWLKPR RPTSHKGDHG RLVIIGGDHG TAGAIRMTGE AALRAGAGLV RVLTRSENIA PLLTARPELM VHELTMDSLT ESLEWADVVV IGPGLGQQEW GKKALQKVEN FRKPMLWDAD ALNLLAINPD KRHNRVITPH PGEAARLLGC SVAEIESDRL HCAKRLVQRY GGVAVLKGAG TVVAAHPDAL GIIDAGNAGM ASGGMGDVLS GIIGALLGQK LSPYDAACAG CVAHGAAADV LAARFGTRGM LATDLFSTLQ RIVNPEVTDK NHDESSNSAP
|
| |