Gene Tmz1t_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0841 
Symbol 
ID7084698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp929904 
End bp931028 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID643697865 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_002354506 
Protein GI217969272 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC TCGCCCCCTA TGCCGTCACC GAGGCCGCCT CGCGCGGCCG CGTCCACGAC 
GAGCCCGCCC CGGTCGCGCG CGGCCAGTTC CAGCGCGACC GCGACCGCAT CGTGCATTCC
ACCGCCTTCC GCCGCCTGGA ATACAAGACC CAGGTGTTCG TGAACCACGA GGGCGACCTC
TTCCGCACCC GCCTCACCCA CAGCCTCGAG GTCGCCCAGC TCACCCGCGG CCTCGCGCGC
GAGCTCGGTC TCAACGAGGA CCTCGCCGAG GCGATCGCGC TCGCCCACGA CCTCGGCCAC
ACCCCCTTCG GCCACGCCGG GCAGGATGCG CTCAACGCCT GCATGAAGGA CTTCGGCGGC
TTCGAGCACA ACCTGCAGTC GCTGCGCACG GTGGACCTGC TCGAAGACCG CTACGCCGGC
TTCGACGGGC TCAACCTGAT GTTCGAGACC CGCGAGGGCA TCCTCAAGCA CTGCTCGCGC
GCCAACGCCG AGCGCCTCGG CGAGCTCGGC CAGCGCTTCC TCGACAGCAC CCAGCCCTCA
CTCGAGGCCC AGCTCGCCAA TCTCGCCGAC GAGATCGCCT ACAACAACCA CGATGTGGAC
GACGGCCTGC GCTCAGGGCT GATCACGCTC GAGCAGCTCG ACGAGGTGCC GATCTTCGCG
GTGCAGCGGC GCGAGGCCGA GGCGCGCTGG CCGGGGCTGT CGGGGCGCAA GCTGATCAAC
GAGACGGTGC GACGCATGAT CCACCTGATG GTGATCGACC TCATCGAGCA GACCCGCGCC
AACATCGCCG CCGAAGGCGT CCGGACGCTC GCCGACGTCC ATGCCGCGCC GCGCCTGGTG
GCGTATTCCG ACACGCTGCT GCCGCGCCTG CGCGAGCTCA AGGTCTTCCT GCGCGACAAG
CTCTATCGCC ACTACCAGGT GCTGCGCATG ACCAACAAGG CGCGCCGCAT CGTCGGCGAC
CTGTTCACGG CGTTCATGGA CGACCCCCAC ATCCTGCCGC CGCAGTATCA GGCGATGGCG
CGCGAGGACA AGCCGCGCGC CATCGCCGAC TACATCGCCG GCATGACCGA CCGCTATGCG
ATGAAGGAGC ACCGGCGGCT GTTCGCGGTG GGGGAGATCC ATTAA
 
Protein sequence
MQQLAPYAVT EAASRGRVHD EPAPVARGQF QRDRDRIVHS TAFRRLEYKT QVFVNHEGDL 
FRTRLTHSLE VAQLTRGLAR ELGLNEDLAE AIALAHDLGH TPFGHAGQDA LNACMKDFGG
FEHNLQSLRT VDLLEDRYAG FDGLNLMFET REGILKHCSR ANAERLGELG QRFLDSTQPS
LEAQLANLAD EIAYNNHDVD DGLRSGLITL EQLDEVPIFA VQRREAEARW PGLSGRKLIN
ETVRRMIHLM VIDLIEQTRA NIAAEGVRTL ADVHAAPRLV AYSDTLLPRL RELKVFLRDK
LYRHYQVLRM TNKARRIVGD LFTAFMDDPH ILPPQYQAMA REDKPRAIAD YIAGMTDRYA
MKEHRRLFAV GEIH