Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_2414 |
Symbol | |
ID | 3672294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | + |
Start bp | 2484060 |
End bp | 2485784 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637711120 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_316172 |
Protein GI | 74318432 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0115007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCT CCATGCACGG CATCGGCGTC TCGGGCGGCA TCGCGATCGG CTATGCCCAT CTCGCATCGA GCGCGCGCGT CGAAGTGCCG CAGTACATGC TCGACCACAA GTACATCAAG GACGAGCTGG CACGCTTCGA CGAGGCGATC CTCGCCACCC GCGCAGAACT CGAAACGCTC CGCAGCCACA TCCCGGCGAA CGCCCCGGCC GAGTTGTCGG CGTTTCTCGA CATGCACCTG ATGTTCCTCG GCGACTCGAT GATTGCCGAG CAGCCCAAGC GGCTGATCCG TGAAACGCAA TGCAACGCCG AGTGGGCGCT TGCGCAGCAG ATGGAAGCGC TGGTCGCGCG CTTCGACGAG ATCGACGACC CCTACCTCAG GAGCCGGCAG GAAGACGTCG TGCAGGTGGT GCAGCGCGTG CTGAAGGCGC TGATGGGCCA TCCGAGCCAT TTGCCGCTCG ACGTCGACTT CGACAGCGAG CGCATCCTGG TCGCGCACGA ATTGTCGCCG GCCGACATGG TGATCTTCAA GAACGTGCAC TTCGCGGCCT TCGTCACCGA CCTCGGCGGC ACCACCTCGC ATACCGCGAT TCTCGCCCGC AGCATGGGCA TGCCGTCGGT GATGGCGCTG CACAATGCGC GCGGGCTGAT CCGCGACCAC GATCTCCTGA TCGTCGACGG TCGCGAGGGC GTCGTCATCG TCAATCCCGA CGAGTCGGTG CTCGCCGAAT ACCGCTTGCG CCAGAACCAG TGGCGCATCG ACACCGACAA GCTCAAGCGG CTGAAGACCA GCAAGTCGGC GACGCTCGAC GGTACGCCGG TCGAGCTGAT GGCGAACATC GAGCTGCTTT CCGACATCGA TGCGGTCAAG GCCGCGGGCG CGCATGGGAT CGGCCTGTTC CGCAGCGAAT TCCTGTTCCT CAACCGCAGC GACCTGCCGA CCGAGGAGGA GCAATACGAG TCCTACAAGG CCGTCGCCGA AGCGCTCGAC GGCAAGCCGG TGACGATACG CACGCTCGAC CTGGGCGCGG ACAAGCAGGC GCCGTGGGGG CACACGGTCG CCGACAACCC TGCGCTCGGC CTGCGCGCGA TCCGTCTGTG CCTCGCCGAG CCGGGGCTCT TCCACACCCA GTTGCGCGCG ATCCTGCGGG CCTCGGCACA CGGCCGTGTC CGCATGCTGA TTCCGATGCT GGCGAACTTC GTCGAGTTGC GGCAGACGCT GCAGCGCATC GACGAGGCCA AGGACTCGCT ACGCCGCGAC GGACTGGCCT TCGACGAAGG CATCCCGGTC GGCGGCATGG TCGAGGTCCC GGCGGCGGCG CTCGCCGCGT CGTTCTTCGC CGACCAGCTC GATTTCCTGT CGATCGGCAC CAACGACCTG ATCCAGTACA CGCTCGCGAT CGACCGCGCC GACGACAGCG TCGCCCATCT CTACGACCCC TTGCACCCCG CCGTGCTGGG CCTGATCCAG CAGACGATCC GCGCCGGCGC CAAGGCGGGC AAGCCGGTCG CGGTGTGCGG GGAGATGGCG GGCGATCCCT TGCTGACGCG GCTGCTGCTG GGTCTCGGCC TGCGCACGTT CTCGATGCAT CCGGCGAGCC TGCTGCAGGT CAAGCAGCAG ATCTTGCGCT CGCACCTCGC CGAGCTCCCG GCGCTGACCC AGCGGCTGCT GAAGAACACC GATCCCGACC GCACGCTGAC GCTGCTCGGC CGCCTCAACG GCTGA
|
Protein sequence | MSFSMHGIGV SGGIAIGYAH LASSARVEVP QYMLDHKYIK DELARFDEAI LATRAELETL RSHIPANAPA ELSAFLDMHL MFLGDSMIAE QPKRLIRETQ CNAEWALAQQ MEALVARFDE IDDPYLRSRQ EDVVQVVQRV LKALMGHPSH LPLDVDFDSE RILVAHELSP ADMVIFKNVH FAAFVTDLGG TTSHTAILAR SMGMPSVMAL HNARGLIRDH DLLIVDGREG VVIVNPDESV LAEYRLRQNQ WRIDTDKLKR LKTSKSATLD GTPVELMANI ELLSDIDAVK AAGAHGIGLF RSEFLFLNRS DLPTEEEQYE SYKAVAEALD GKPVTIRTLD LGADKQAPWG HTVADNPALG LRAIRLCLAE PGLFHTQLRA ILRASAHGRV RMLIPMLANF VELRQTLQRI DEAKDSLRRD GLAFDEGIPV GGMVEVPAAA LAASFFADQL DFLSIGTNDL IQYTLAIDRA DDSVAHLYDP LHPAVLGLIQ QTIRAGAKAG KPVAVCGEMA GDPLLTRLLL GLGLRTFSMH PASLLQVKQQ ILRSHLAELP ALTQRLLKNT DPDRTLTLLG RLNG
|
| |