Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5683 |
Symbol | |
ID | 6967054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5323006 |
End bp | 5324538 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643389316 |
Product | hypothetical protein |
Protein accession | YP_002273709 |
Protein GI | 209398092 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000899207 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACCCCGTAAG TATACCACAC ACCGTCTGGC ACGCCGACGA TATCCGCCGC GGAGAACGCG AGGCGGCAGA TGCGCTGGGG CTCACACTCT ATGAGCTGAT GCTTCGCGCT GGCGAGGCCG CATTCCAGGT GTGTCGTTCG GCGTATCCTG ACGCCCGCCA CTGGCTGGTG TTGTGCGGTC ATGGTAATAA CGGCGGCGAT GGCTACGTGG TCGCGCGACT GGCCAAAGCG GTCGGCATTG AGGTCACGTT GTTGGCCCAG GAGAGCGATA AACCGTTGCC GGAAGAGGCC GCGCTGGCAC GCGAGGCATG GTTAAACGCG GGAGGCGAGA TCCATGCTTC GAATATTGTC TGGCCCGAGT CGGTAGCTCT GATTGTTGAT GCGCTGCTCG GTACCGGCTT GCAGCAAGCA CCCCGCGAAT CCATTAGCCA GTTAATCGAC CACGCTAATT CCCATCCTGC GCCGATTGTG GCGGTTGATA TCCCTTCCGG CCTGCTGGCT GAAACCGGCG CTACGCCAGG CGCAGTGATC AACGCCGATC ACACCATCAC TTTTATTGCG CTGAAACCAG GCTTGCTCAC TGGAAAAGCG CGGGATGTTA CAGGACAACT GCATTTTGAC TCACTGGGGC TGGATAGCTG GCTGGCAGGT CAGGAGACGA AAATTCAGCG GTTTTCGGCA GAACAACTTT CTCACTGGCT GAAACCGCGT CGCCCGACTT CGCATAAAGG CGATCACGGG CGGCTGGTGA TTATCGGTGG CGATCACGGC ACGGCGGGGG CTATTCGTAT GACGGGGGAA GCGGCGCTAC GTGCTGGTGC TGGTTTAGTC CGAGTACTGA CCCGCAGTGA GAACATTGCG CCGCTGCTGA CTGCACGACC AGAATTGATG GTGCATGAAC TGACTATGGA CTCTCTTACC GAAAGCCTGG AATGGGCCGA TGTGGTGGTG ATTGGTCCCG GTCTGGGCCA GCAAGAGTGG GGGAAAAAAG CCCTGCAAAA AGTTGAGAAT TTTCGCAAAC CGATGTTGTG GGATGCCGAT GCATTGAACC TGCTGGCAAT CAATCCCGAT AAGCGTCACA ATCGCGTGAT CACGCCGCAT CCTGGCGAGG CCGCACGGTT GTTAGGCTGT TCCGTCGCTG AAATTGAAAG TGACCGCTTA CATTGCGCCA AACGTCTGGT ACAACGTTAT GGCGGCGTAG CGGTGCTGAA AGGTGCCGGA ACCGTGGTCG CCGCCCATCC TGACGCTTTA GGCATTATTG ATGCCGGAAA TGCAGGCATG GCAAGCGGCG GCATGGGCGA TGTGCTCTCT GGTATTATTG GCGCATTGCT TGGGCAAAAA CTGTCGCCGT ATGATGCAGC CTGTGCAGGC TGTGTCGCGC ACGGTGCGGC AGCTGACGTA CTGGCGGCGC GTTTTGGAAC GCGCGGGATG CTGGCAACCG ATCTCTTTTC CACGCTACAG CGTATTGTTA ACCCGGAAGT GACTGATAAA AACCATGATG AATCGAGTAA TTCCGCTCCC TGA
|
Protein sequence | MKKNPVSIPH TVWHADDIRR GEREAADALG LTLYELMLRA GEAAFQVCRS AYPDARHWLV LCGHGNNGGD GYVVARLAKA VGIEVTLLAQ ESDKPLPEEA ALAREAWLNA GGEIHASNIV WPESVALIVD ALLGTGLQQA PRESISQLID HANSHPAPIV AVDIPSGLLA ETGATPGAVI NADHTITFIA LKPGLLTGKA RDVTGQLHFD SLGLDSWLAG QETKIQRFSA EQLSHWLKPR RPTSHKGDHG RLVIIGGDHG TAGAIRMTGE AALRAGAGLV RVLTRSENIA PLLTARPELM VHELTMDSLT ESLEWADVVV IGPGLGQQEW GKKALQKVEN FRKPMLWDAD ALNLLAINPD KRHNRVITPH PGEAARLLGC SVAEIESDRL HCAKRLVQRY GGVAVLKGAG TVVAAHPDAL GIIDAGNAGM ASGGMGDVLS GIIGALLGQK LSPYDAACAG CVAHGAAADV LAARFGTRGM LATDLFSTLQ RIVNPEVTDK NHDESSNSAP
|
| |