Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4031 |
Symbol | |
ID | 7873677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4428337 |
End bp | 4429551 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700968 |
Product | DNA mismatch endonuclease Vsr |
Protein accession | YP_002890991 |
Protein GI | 237654677 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACGA GTGACTGGAA GATTGCCGTC GTCGGTGCAG GTATCGGTGG CCTGACCCTC GCCCTTGCCC TGCGCCAGCA TGGCATCGAA GTTGAACTCT ATGAGCAGAC GCCGGAGTTG AGGGAGGTCG GTGCGGCCGT GGCGCTGTCG GCTAATGCGA CACGCTTTTA TGACCGGATC GGCTTGCGGA GCCAGTTCGA CGAGGTCTGC TACTCCATCT CGACCCTGAT CTACCGCGAT GGACGCGACG GCCGTGTCAT CGGCCGCCAC AGTGGTGAGC CGGACTACGA GGGCCAGTTC GGCGCCCGCT ACTGGGGCAT TCACCGCGCC GACCTGCAAG CCATCCTGTC GCGCGCCGTC GGCATAGAGC ACATTCACCT TGGCAAGCGC GTCAGCAACC TCAAGGATGA CGGCAACGAG GTCGTGCTCG AGTTCGAGGA CGGCAGCTCC GTGCGTGCTG ACCTGGTAAT TGGCGGCGAC GGCGCGCGTT CCGTCGTGCG CCGCTGGATG CTCGGGTATG ACGATGCGCT GTATTCCGGG TGCTCGGGCT TTCGCGGCAT TGTCCCGCCG GCGATGCTCG ACCTGTTGCC CGATCCCGAG GCCATCCAGT TCTGGATCGG CCCGGGCGCC CATCTGCTGC ATTACCCGAT CGGCAACGGC GACCAGAACT TCCTGCTGGT CGAGCGCAGC CCCTCGCCGT GGCCGGTGCG CGAGTGGGTG ACCGGCGCCG AGCAGGGCGA ACAGCTGCAG CGCTTCGCCG ACTGGCACCC GGCGGTAGTA CAGATGATCA GCGCCGTACC CACCAGCCAG CGCTGGGCCT TGTTCCACCG GCCGCCGCTG GGGCGCTGGA CGCGCGGCCG GGTGACCCTG CTCGGCGATG CCGCGCATGC ACTGGTGCCG CACCATGGCC AGGGCGCCAA CCAGTCCATC GAGGACTCGG TGGTGCTGGC GGCGCAACTC GCCGAAAAGG GCCCGGCACG CTTCGAGCAG GCGCTGGAGG ATTACGAGCA CCTGCGCCGC GGCCGTACCC GCAAGGTGCA GTTCGCCTCG ATCTCGACCG CCGATGTCCT GCACCTGCCC GACGGCCCCG CCGCCGACCT GCGCAATGCC CGCTTCGCGG ATCGCGAGGA GATGATGAAT CACCTCGGCT GGATCCATGA CTTCGATCCG GCCACCCAGA TTCCGAGCGA GCGGCAAGGC GGCACCTGGC TGTAA
|
Protein sequence | MTTSDWKIAV VGAGIGGLTL ALALRQHGIE VELYEQTPEL REVGAAVALS ANATRFYDRI GLRSQFDEVC YSISTLIYRD GRDGRVIGRH SGEPDYEGQF GARYWGIHRA DLQAILSRAV GIEHIHLGKR VSNLKDDGNE VVLEFEDGSS VRADLVIGGD GARSVVRRWM LGYDDALYSG CSGFRGIVPP AMLDLLPDPE AIQFWIGPGA HLLHYPIGNG DQNFLLVERS PSPWPVREWV TGAEQGEQLQ RFADWHPAVV QMISAVPTSQ RWALFHRPPL GRWTRGRVTL LGDAAHALVP HHGQGANQSI EDSVVLAAQL AEKGPARFEQ ALEDYEHLRR GRTRKVQFAS ISTADVLHLP DGPAADLRNA RFADREEMMN HLGWIHDFDP ATQIPSERQG GTWL
|
| |