Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0841 |
Symbol | |
ID | 7084698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 929904 |
End bp | 931028 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697865 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_002354506 |
Protein GI | 217969272 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAC TCGCCCCCTA TGCCGTCACC GAGGCCGCCT CGCGCGGCCG CGTCCACGAC GAGCCCGCCC CGGTCGCGCG CGGCCAGTTC CAGCGCGACC GCGACCGCAT CGTGCATTCC ACCGCCTTCC GCCGCCTGGA ATACAAGACC CAGGTGTTCG TGAACCACGA GGGCGACCTC TTCCGCACCC GCCTCACCCA CAGCCTCGAG GTCGCCCAGC TCACCCGCGG CCTCGCGCGC GAGCTCGGTC TCAACGAGGA CCTCGCCGAG GCGATCGCGC TCGCCCACGA CCTCGGCCAC ACCCCCTTCG GCCACGCCGG GCAGGATGCG CTCAACGCCT GCATGAAGGA CTTCGGCGGC TTCGAGCACA ACCTGCAGTC GCTGCGCACG GTGGACCTGC TCGAAGACCG CTACGCCGGC TTCGACGGGC TCAACCTGAT GTTCGAGACC CGCGAGGGCA TCCTCAAGCA CTGCTCGCGC GCCAACGCCG AGCGCCTCGG CGAGCTCGGC CAGCGCTTCC TCGACAGCAC CCAGCCCTCA CTCGAGGCCC AGCTCGCCAA TCTCGCCGAC GAGATCGCCT ACAACAACCA CGATGTGGAC GACGGCCTGC GCTCAGGGCT GATCACGCTC GAGCAGCTCG ACGAGGTGCC GATCTTCGCG GTGCAGCGGC GCGAGGCCGA GGCGCGCTGG CCGGGGCTGT CGGGGCGCAA GCTGATCAAC GAGACGGTGC GACGCATGAT CCACCTGATG GTGATCGACC TCATCGAGCA GACCCGCGCC AACATCGCCG CCGAAGGCGT CCGGACGCTC GCCGACGTCC ATGCCGCGCC GCGCCTGGTG GCGTATTCCG ACACGCTGCT GCCGCGCCTG CGCGAGCTCA AGGTCTTCCT GCGCGACAAG CTCTATCGCC ACTACCAGGT GCTGCGCATG ACCAACAAGG CGCGCCGCAT CGTCGGCGAC CTGTTCACGG CGTTCATGGA CGACCCCCAC ATCCTGCCGC CGCAGTATCA GGCGATGGCG CGCGAGGACA AGCCGCGCGC CATCGCCGAC TACATCGCCG GCATGACCGA CCGCTATGCG ATGAAGGAGC ACCGGCGGCT GTTCGCGGTG GGGGAGATCC ATTAA
|
Protein sequence | MQQLAPYAVT EAASRGRVHD EPAPVARGQF QRDRDRIVHS TAFRRLEYKT QVFVNHEGDL FRTRLTHSLE VAQLTRGLAR ELGLNEDLAE AIALAHDLGH TPFGHAGQDA LNACMKDFGG FEHNLQSLRT VDLLEDRYAG FDGLNLMFET REGILKHCSR ANAERLGELG QRFLDSTQPS LEAQLANLAD EIAYNNHDVD DGLRSGLITL EQLDEVPIFA VQRREAEARW PGLSGRKLIN ETVRRMIHLM VIDLIEQTRA NIAAEGVRTL ADVHAAPRLV AYSDTLLPRL RELKVFLRDK LYRHYQVLRM TNKARRIVGD LFTAFMDDPH ILPPQYQAMA REDKPRAIAD YIAGMTDRYA MKEHRRLFAV GEIH
|
| |