Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2231 |
Symbol | |
ID | 5592452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2217408 |
End bp | 2218670 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640921361 |
Product | putative tagatose-6-phosphate kinase |
Protein accession | YP_001458897 |
Protein GI | 157161579 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4573] Predicted tagatose 6-phosphate kinase |
TIGRFAM ID | [TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 0.26036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGT TAATTGCCCG GCATAAAGCT GGTGAACATA TCGGCATATG TTCAGTCTGT TCTGCCCATC CGTTGGTTAT CGAAGCGGCG CTGGCATTTG ATCGCAACAG CACGCGCAAA GTGCTGATTG AAGCAACGTC AAACCAGGTC AATCAATTTG GCGGTTATAC CGGAATGACA CCGGCAGACT TTCGCGAATT TGTTTTTACG ATTGCCGATA AAGTTGGGTT TGCACGCGAA CGCATTATTC TCGGCGGCGA TCATCTGGGG CCAAACTGCT GGCAGCAAGA AAATGCGGAT GCGGCGATGG AAAAATCCGT CGAGCTGGTA AAGGAATATG TTCGTGCCGG CTTCAGTAAA ATTCATCTTG ATGCGTCAAT GTCCTGCGCG GGGGATCCCA TACCGTTAGC ACCAGAAACG GTTGCGGAAC GAGCTGCTGT GCTTTGCTTT GCTGCGGAAA GTGTGGCGAC AGATTGCCAG CGTGAGCAAC TGAGCTATGT CATTGGCACC GAAGTTCCGG TTCCGGGCGG TGAGGCCAGC GCCATTCAGT CAGTACACAT CACCCATGTT GAAGATGCCG CCAATACTTT ACGTACGCAT CAAAAGGCCT TTATTGCCCG TGGGCTGACA GAGGCGTTAA CACGTGTGAT TGCCATCGTG GTGCAGCCGG GTGTGGAATT TGATCACAGC AATATTATCC ATTATCAGCC GCAGGAAGCG CAGCCGCTGG CGCAATGGAT AGAAAACACC CGAATGGTTT ATGAAGCACA TTCTACCGAT TACCAGACCC GGACGGCTTA TTGGGAATTA GTCCGCGATC ACTTTGCAAT ATTGAAAGTC GGTCCCGCAT TAACCTTTGC TTTACGCGAG GCGATATTTG CACTGGCACA AATTGAGCAG GAACTTATCG CCCCTGAAAA TCGCAGCGGT TGCCTGGCGG TAATTGAAGA AGTGATGCTC GACGAACCGC AATACTGGAA AAAATATTAT CGTACGGGTT TTAACGATTC ATTACTGGAT ATTCGTTACA GCCTGTCGGA TCGTATTCGT TATTACTGGC CGCATAGTCG GATTAAAAAT AGCGTCGATA CGATGATGGT GAATCTTGAA GGCGTGGACA TCCCACTGGG CATGATTAGT CAGTATCTTC CCAAACAATT TGAACGCATT CAGTCCGGGG AATTATCAGC AATACCGCAT CAGCTGATTA TGGATAAAAT TTATGATGTT TTGCGCGCCT ATCGCTACGG CTGTGCGGAA TAA
|
Protein sequence | MKTLIARHKA GEHIGICSVC SAHPLVIEAA LAFDRNSTRK VLIEATSNQV NQFGGYTGMT PADFREFVFT IADKVGFARE RIILGGDHLG PNCWQQENAD AAMEKSVELV KEYVRAGFSK IHLDASMSCA GDPIPLAPET VAERAAVLCF AAESVATDCQ REQLSYVIGT EVPVPGGEAS AIQSVHITHV EDAANTLRTH QKAFIARGLT EALTRVIAIV VQPGVEFDHS NIIHYQPQEA QPLAQWIENT RMVYEAHSTD YQTRTAYWEL VRDHFAILKV GPALTFALRE AIFALAQIEQ ELIAPENRSG CLAVIEEVML DEPQYWKKYY RTGFNDSLLD IRYSLSDRIR YYWPHSRIKN SVDTMMVNLE GVDIPLGMIS QYLPKQFERI QSGELSAIPH QLIMDKIYDV LRAYRYGCAE
|
| |