Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0467 |
Symbol | |
ID | 7084978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 529182 |
End bp | 530912 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697499 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_002354141 |
Protein GI | 217968907 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.688262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTCA CGCTGCACGG CCTGGCGGTC TCGCAGGGCA TCGCGATCGG CCATGTCCAC CTCGTCTCTC ACGCGCTGCT CGAGGTCAAT CACTACCACG TCGCGCCGCG CTACCTCGAC GATGAACTCG CCCGTCTCGA CGAGGCCCTC GCCACCGTCC AGGGCGAGCT GATCGGTCTC AAGGCGATCA CCACCTCCGG CCAGGCGCAC AGCGAGGTCG GCGCCTTCGT CGATCTGCAG CTGATGATGC TCGCCGACCC GATGCTGGTC GACGCCGCCC GCCAGCTCGT CACCGAGCGC CGCTGCAACG CCGAGTGGGC GCTGGTGCAG CAGATGGAGC ACCTGGTCGA GCAGTTCCGC CAGATCGAGG ACCCCTACCT GCGCGAACGC CAGGCCGATG TCGTCCAGGT CGTCGAGCGC CTGGTCAAGG TGCTGCTCGG CCACCCCGGC CACCTGCCGC CGCGCCGGCG CGACGGGCTG GGCAGCATCG TCGTCGCCCA CGATCTGTCG CCCGCCGACA CCATCGGCTT CCGCGACCAC AACATCGCCG GCTTCGTCAC CGACGTCGGC GGCCCCACCA GCCACACCGC GATCGTCGCG CGCAGCCTCA AGATCCCCGC GGTGGTCGGC CTGCACCACG TGCGCGACCT GCTCGAGGAC GACGAGCTCA TCATCGTCGA CGGCACCCGC GGCGTGATCA TCGTCGCCCC CGACGAGCAG ATCGTCGAGG AATACCGCCT GCGCCGCAGC GAGCTCGAGA TCGAGCGTTC CAAGCTCAAC CGCCTGCGCG ACACCCGCGC CGCCACGCTC GACGGCGAGG AGGTCAACCT GCTCGCCAAC ATCGAGGGCC CCAAGGACCT GCCCGCGGTC AAGAGCGCCA ACGCCGACGG CATCGGCCTG TACCGCACCG AGTTCCTCTT CATCGGCCGC GACACCCTGC CCGACGAGGA AGAGCAGTAC GAGGCCTACC GCGCCGTGGT CAAGGCCCTG CCCGGCAAGC CGGTCACCAT CCGCACCTTC GACGTCGGCG CCGACAAGGC GCTCAACGGT GCCCAGGCGC GCGAGGAACC CAACCCGGCG CTCGGCCTGC GCGCGGTGCG CTACTCGCTG GCCGAGCCCA AGATGTTCAA GACGCAGCTG CGCGCGCTGC TGCGCGCCTC GGCCCACGGC CGCCTGCAGA TCATGGTGCC GATGCTCGCC CACGCCCAGG AGGTCGACCA GTGCCTGACC CTGCTCGAGA AGGCCAAGGC CGAGCTGCGC GCCGAAAAGA TCAAGTTCGA CGAGGGCATC CGCATCGGCG GCATGATCGA GGTGCCCGCC GCGGCGCTGT CGCTCGGCAT GTTCATCCGG CGGCTGGCCT TCCTGTCGAT CGGCACCAAC GACCTCATCC AGTACACCCT GGCGATCGAC CGCTCGGACG AGGCCGTCGC GCACCTCTAC GACCCGCTCC ACCCCGCCGT GCTCAAGCTC ATCGCCGGCA CCATCTCCAG CGGCGCGCGC TTCGGCCTGC CGGTGTCGGT GTGCGGCGAG ATGGCGGGCG ACCCGCTGTA CACCGAGCTG CTGCTCGGCA TGGGCCTGCG CAACTTCTCG ATGCACCCGG GCAGCATCCT CGAGATCAAG CAGCAGGTGC TGCGCGCCGA CCTCGGCGAG CTCGCCCCCA GGGTGCAGCG CATCCTCAAG ATGGACGAGC CGGCACGCAT CCGCGAGGCG GTGGAACGGC TCACCGCCTG A
|
Protein sequence | MSFTLHGLAV SQGIAIGHVH LVSHALLEVN HYHVAPRYLD DELARLDEAL ATVQGELIGL KAITTSGQAH SEVGAFVDLQ LMMLADPMLV DAARQLVTER RCNAEWALVQ QMEHLVEQFR QIEDPYLRER QADVVQVVER LVKVLLGHPG HLPPRRRDGL GSIVVAHDLS PADTIGFRDH NIAGFVTDVG GPTSHTAIVA RSLKIPAVVG LHHVRDLLED DELIIVDGTR GVIIVAPDEQ IVEEYRLRRS ELEIERSKLN RLRDTRAATL DGEEVNLLAN IEGPKDLPAV KSANADGIGL YRTEFLFIGR DTLPDEEEQY EAYRAVVKAL PGKPVTIRTF DVGADKALNG AQAREEPNPA LGLRAVRYSL AEPKMFKTQL RALLRASAHG RLQIMVPMLA HAQEVDQCLT LLEKAKAELR AEKIKFDEGI RIGGMIEVPA AALSLGMFIR RLAFLSIGTN DLIQYTLAID RSDEAVAHLY DPLHPAVLKL IAGTISSGAR FGLPVSVCGE MAGDPLYTEL LLGMGLRNFS MHPGSILEIK QQVLRADLGE LAPRVQRILK MDEPARIREA VERLTA
|
| |