Gene Tmz1t_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4031 
Symbol 
ID7873677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4428337 
End bp4429551 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID643700968 
ProductDNA mismatch endonuclease Vsr 
Protein accessionYP_002890991 
Protein GI237654677 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGA GTGACTGGAA GATTGCCGTC GTCGGTGCAG GTATCGGTGG CCTGACCCTC 
GCCCTTGCCC TGCGCCAGCA TGGCATCGAA GTTGAACTCT ATGAGCAGAC GCCGGAGTTG
AGGGAGGTCG GTGCGGCCGT GGCGCTGTCG GCTAATGCGA CACGCTTTTA TGACCGGATC
GGCTTGCGGA GCCAGTTCGA CGAGGTCTGC TACTCCATCT CGACCCTGAT CTACCGCGAT
GGACGCGACG GCCGTGTCAT CGGCCGCCAC AGTGGTGAGC CGGACTACGA GGGCCAGTTC
GGCGCCCGCT ACTGGGGCAT TCACCGCGCC GACCTGCAAG CCATCCTGTC GCGCGCCGTC
GGCATAGAGC ACATTCACCT TGGCAAGCGC GTCAGCAACC TCAAGGATGA CGGCAACGAG
GTCGTGCTCG AGTTCGAGGA CGGCAGCTCC GTGCGTGCTG ACCTGGTAAT TGGCGGCGAC
GGCGCGCGTT CCGTCGTGCG CCGCTGGATG CTCGGGTATG ACGATGCGCT GTATTCCGGG
TGCTCGGGCT TTCGCGGCAT TGTCCCGCCG GCGATGCTCG ACCTGTTGCC CGATCCCGAG
GCCATCCAGT TCTGGATCGG CCCGGGCGCC CATCTGCTGC ATTACCCGAT CGGCAACGGC
GACCAGAACT TCCTGCTGGT CGAGCGCAGC CCCTCGCCGT GGCCGGTGCG CGAGTGGGTG
ACCGGCGCCG AGCAGGGCGA ACAGCTGCAG CGCTTCGCCG ACTGGCACCC GGCGGTAGTA
CAGATGATCA GCGCCGTACC CACCAGCCAG CGCTGGGCCT TGTTCCACCG GCCGCCGCTG
GGGCGCTGGA CGCGCGGCCG GGTGACCCTG CTCGGCGATG CCGCGCATGC ACTGGTGCCG
CACCATGGCC AGGGCGCCAA CCAGTCCATC GAGGACTCGG TGGTGCTGGC GGCGCAACTC
GCCGAAAAGG GCCCGGCACG CTTCGAGCAG GCGCTGGAGG ATTACGAGCA CCTGCGCCGC
GGCCGTACCC GCAAGGTGCA GTTCGCCTCG ATCTCGACCG CCGATGTCCT GCACCTGCCC
GACGGCCCCG CCGCCGACCT GCGCAATGCC CGCTTCGCGG ATCGCGAGGA GATGATGAAT
CACCTCGGCT GGATCCATGA CTTCGATCCG GCCACCCAGA TTCCGAGCGA GCGGCAAGGC
GGCACCTGGC TGTAA
 
Protein sequence
MTTSDWKIAV VGAGIGGLTL ALALRQHGIE VELYEQTPEL REVGAAVALS ANATRFYDRI 
GLRSQFDEVC YSISTLIYRD GRDGRVIGRH SGEPDYEGQF GARYWGIHRA DLQAILSRAV
GIEHIHLGKR VSNLKDDGNE VVLEFEDGSS VRADLVIGGD GARSVVRRWM LGYDDALYSG
CSGFRGIVPP AMLDLLPDPE AIQFWIGPGA HLLHYPIGNG DQNFLLVERS PSPWPVREWV
TGAEQGEQLQ RFADWHPAVV QMISAVPTSQ RWALFHRPPL GRWTRGRVTL LGDAAHALVP
HHGQGANQSI EDSVVLAAQL AEKGPARFEQ ALEDYEHLRR GRTRKVQFAS ISTADVLHLP
DGPAADLRNA RFADREEMMN HLGWIHDFDP ATQIPSERQG GTWL