Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3074 |
Symbol | |
ID | 6969424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2845395 |
End bp | 2846657 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643386906 |
Product | putative tagatose-6-phosphate kinase |
Protein accession | YP_002271374 |
Protein GI | 209400959 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4573] Predicted tagatose 6-phosphate kinase |
TIGRFAM ID | [TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00154416 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGT TAATTGCCCG GCATAAAGCT GGTGAACATA TCGGCATATG TTCAGTCTGT TCTGCCCATC CGTTGGTTAT CGAAGCGGCG CTGGCATTTG ATCGCAACAG CACGCGCAAA GTGCTGATTG AAGCAACGTC AAACCAGGTC AATCAATTTG GCGGTTATAC CGGAATGACA CCGGCAGACT TTCGCGAATT TGTTTTTGCG ATTGCCGATA AAGTCGGATT TGCACGCGAG CGTATTATTC TCGGCGGTGA CCATCTGGGG CCAAACTGCT GGCAGCAAGA AAATGTGGAT GCGGCGATGG AAAAATCCGT CGAGCTGGTA AAGGCATATG TTCGTGCTGG CTTCAGTAAA ATTCATCTTG ATGCGTCAAT GTCCTGCGCG GGGGATCCCA TACCGTTAGC ACCAGAAACG GTTGCGGAAC GAGCTGCTGT GCTTTGCTTT GCTGCGGAAA GTGTGGCGAC AGATTGCCAG CGTGAGCAAC TGAGCTATGT CATTGGCACC GAAGTTCCGG TTCCGGGCGG TGAGGCCAGC GCCATTCAGT CAGTACACAT CACCCATGTT GAAGATGCCG CCAATACTTT ACGTACGCAT CAAAAGGCCT TTATTGCCCG TGGGCTGACA GAGGCATTAA CACGCGTGAT TGCCATCGTG GTGCAGCCGG GTGTGGAATT TGATCACAGC AATATTATCC ATTATCAGCC GCAGGAAGCG CAGGCGCTGG CGCAATGGAT AGAAAATACC CGAATGGTTT ATGAAGCACA TTCTACCGAT TACCAGACCC GGACGGCTTA TTGGGAATTA GTCCGCGATC ACTTTGCAAT ATTGAAAGTC GGTCCCGCAT TAACCTTTGC TTTACGTGAG GCGATATTTG CGCTGGCGCA AATTGAGCAG GAACTTATCG CCCCCGAAAA TCGCAGCGGT TGCCTGGCGG TAATTGAAGA AGTGATGCTC GACGAACCGC AATACTGGAA AAAATATTAT CGCACGGGTT TTAACGATTC ATTACTGGAT ATTCGTTACA GCCTGTCGGA TCGTATTCGT TATTACTGGC CGCATAGTCG GATTAAAAAT AGCGTTGAAA CGATGATGGT GAATCTGCAA GGCGTGGACA TCCCACTGGG CATGATTAGT CAGTATCTTC CCAAACAATT TGAACGCATT CAGTCCGGGG AATTATCAGC AATACCGCAT CAGCTGATTA TGGATAAAAT TTATGATGTT TTGCGCGCCT ATCGCTACGG CTGTGCGGAA TAA
|
Protein sequence | MKTLIARHKA GEHIGICSVC SAHPLVIEAA LAFDRNSTRK VLIEATSNQV NQFGGYTGMT PADFREFVFA IADKVGFARE RIILGGDHLG PNCWQQENVD AAMEKSVELV KAYVRAGFSK IHLDASMSCA GDPIPLAPET VAERAAVLCF AAESVATDCQ REQLSYVIGT EVPVPGGEAS AIQSVHITHV EDAANTLRTH QKAFIARGLT EALTRVIAIV VQPGVEFDHS NIIHYQPQEA QALAQWIENT RMVYEAHSTD YQTRTAYWEL VRDHFAILKV GPALTFALRE AIFALAQIEQ ELIAPENRSG CLAVIEEVML DEPQYWKKYY RTGFNDSLLD IRYSLSDRIR YYWPHSRIKN SVETMMVNLQ GVDIPLGMIS QYLPKQFERI QSGELSAIPH QLIMDKIYDV LRAYRYGCAE
|
| |