Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0969 |
Symbol | |
ID | 6145820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 979763 |
End bp | 981025 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641615856 |
Product | putative tagatose-6-phosphate kinase |
Protein accession | YP_001743048 |
Protein GI | 170681136 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4573] Predicted tagatose 6-phosphate kinase |
TIGRFAM ID | [TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.566701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGT TAATTGCCCG GCATAAAGCT GGTGAACATA TCGGCATATG TTCAGTCTGT TCTGCCCATC CGTTGGTTAT CGAAGCGGCG CTGGCATTTG ATCGCAACAG CACGCGCAAA GTGCTGATTG AAGCAACGTC AAACCAGGTC AATCAATTTG GCGGTTATAC CGGAATGACA CCGGCAGACT TTCGCGAATT TGTTTTTGCG ATTGCCAATA AAGTTGGGTT TGCACGCGAA CGCATTATTC TCGGCGGCGA TCATCTGGGG CCAAACTGCT GGCAGCAAGA AAATGCGGAT GCGGCGATGG AAAAATCCGT CGAGCTGGTA AAGGCATATG TTCGTGCCGG CTTCAGTAAA ATTCATCTTG ATGCGTCAAT GTCCTGCGCG GGGGATCCTA TACCGTTAGC CCCAGAAACG GTTGCTGAAC GAGCTGCTGT GCTTTGTCTG GCGGCGGAAA GTGTGGCGAC AGATTGCCAG CGTGAGCAAC TGAGCTATGT CATTGGCACC GAAGTTCCGG TTCCGGGTGG TGAGGCCAGC GCCATTCAGT CAGTACACAT CACCCAGGTT GAAGATGCCG CCAATACTTT ACGCACGCAT CAAAAGGCCT TTATTGCCCG TGGTCTGGCA GAGGCGTTAA CACGCGTGAT TGCCATCGTG GTGCAGCCTG GTGTGGAATT TGACCATAGC AATATTATTC ATTATCAGGC GCAAGAAGCT CAGGCGCTGG CGCAATGGAT AGAAAAAACC AAAATGGTTT ATGAAGCGCA TTCCACCGAT TACCAGACTC AGGCGGCCTA TCGGGAATTA GTCCGCGATC ACTTTGCAAT ATTGAAAGTC GGTCCCGCAT TAACCTTTGC TTTACGTGAG GCGGTATTTG CGCTGGCACA AATTGAGCAG GAACTTATCG CCCCCGAAAA TCGCAGCGGT TGCCTGGCGG TAATTGAAGA AGTAATGCTC GACGAACCGC AGTACTGGAA GAAATATTAT CGCACGGGTT TTCACGATTC ATTACTGGAT ATTCGTTACA GCCTGTCGGA TCGTATTCGT TATTACTGGC CGCATAGCCG GATTAAAAAT AGCGTCGAAA CGATGATGGT GAATCTGGAA GGCGTGGACA TCCCACTGGG CATGATTAGT CAGTATCTTC CCAAACAATT TGAACGCATT CAGTCCGGGG AATTATCAGC AATACCGCAT CAGTTGATTA TGGATAAAAT TTATGATGTT TTGCGCGCCT ATCGCTACGG CTGTGCGGAA TAA
|
Protein sequence | MKTLIARHKA GEHIGICSVC SAHPLVIEAA LAFDRNSTRK VLIEATSNQV NQFGGYTGMT PADFREFVFA IANKVGFARE RIILGGDHLG PNCWQQENAD AAMEKSVELV KAYVRAGFSK IHLDASMSCA GDPIPLAPET VAERAAVLCL AAESVATDCQ REQLSYVIGT EVPVPGGEAS AIQSVHITQV EDAANTLRTH QKAFIARGLA EALTRVIAIV VQPGVEFDHS NIIHYQAQEA QALAQWIEKT KMVYEAHSTD YQTQAAYREL VRDHFAILKV GPALTFALRE AVFALAQIEQ ELIAPENRSG CLAVIEEVML DEPQYWKKYY RTGFHDSLLD IRYSLSDRIR YYWPHSRIKN SVETMMVNLE GVDIPLGMIS QYLPKQFERI QSGELSAIPH QLIMDKIYDV LRAYRYGCAE
|
| |