Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1823 |
Symbol | |
ID | 7084245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2045064 |
End bp | 2046950 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698845 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_002355471 |
Protein GI | 217970237 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000270616 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCACA TCCACCGCCT CTCCGACCTG CTGGTCAACC AGATCGCCGC CGGCGAGGTG GTCGAGCGCC CGGCCTCGGT GCTCAAGGAA ATACTCGAGA ACGCGGTCGA TGCTGGCGCG CGCGCGATCG AGGTGCAGCT CGAGCAGGGC GGCGTGCGGC GCATCCGCGT CGCCGACGAC GGCTGCGGCA TCGCGCGCGA CGAGCTCGCG CTCGCGCTCG AGCGCCACGC CACCAGCAAG ATCGCCACCC TCGACGACCT CGAGCGCGTG GGCACGATGG GGTTTCGCGG CGAGGCGCTG GCGGCCATCG CCGCGGTGGC GCGCACCGGC ATCACCAGCC GCGCCGAGGG CGCGAGCCAC GCCTGGCGCA TCGAGGCCGG GCGCGAGCCC GAGCCGGCGG CGCTCGACCA GGGCACAGTG GTCGACGTCG CCGACCTCTA CTACAACACC CCCGCGCGGC GGAAGTTCCT CAAGACCGAA TCCACCGAGT TCGCCCACTG CGACGACATG TTCCGCCGCG TCGCGCTCGC GCGCCCCGAC ATCGGCCTGC AGCTCGCCCA CAACGGCCGC GTGATCCACC GCCTCCCTCC CTCACCGCCC GCCGCGCGGG TGGCGGCGCT GATGGGCGAC GACTTCCTGC AGCACGCCCG CGAGGTCCAG GCCGACGCCG GGATACTGCG CCTGGCCGGC TTCGCGTCGC TGCCCGCCTA TTCGCGCGCC AGCCGCGACG CGCAGTATTT CTTCGTCAAT GGCCGCTTCG TGCGCGACAA GCTGCTCGCC CACGCGGTGC GCGAAGCCTA CGCCGACATC CTGCACGGCG CGCGCCATCC GGCTTACGTG CTCTTCCTCG AGCTCGACCC CGCCGGGGTG GATGTGAACG TGCATCCGGC CAAGATCGAG GTGCGCTTCC GCGAGTCGCG CGCGGTGCAT CAGTTCGTCT TCCACGCCGT GCGCCGCACC CTGGCCGAGA GCGGAGCGGG ACGCGCTGCG GAGCTGCACG CGCCCGCAGC CATTGGCGTC GGTGCCGGCC CCGCGGCCTC CCCGGCATCG GGCACCGTCC CGCAAGGGGG CTTCTCGAGC GCCCTCCCCC CGCTTTCCGT CCCGACGGCC GCCCACCACC CCGCATCGCG GTCCTGGCCG CCCGCAGACG CCCGCCAAGG ACGCCTGGCG ATGGAGTCGG CGAGCCGCGC CTACTTTGAT TTCGCCGCGG GCGCACGCGA TAGCGGTGGC GCCAGCCTGC CCGACCGCCC CGGCGAGATC CAGTCGGGCC CGCGCCCGGC GCCGACGGAG ACCGCCACGG GCGAGGCGCC GCCGCTCGGC TACGCGCTCG GCCAGCTGCA CGGCATCTAC ATCCTCGCCC AGAATGCCGC CGGCCTCGTG CTGGTGGACA TGCACGCCGC CCACGAACGC ATCCTCTACG AGAAGCTCAA GACCGTGATC GACGGCCGCC CGGCCGTGCA GCGCCTGCTG ATCCCGGCGG TGTTCTCGGT GGGCGCGAAG GACATGGCGG CGGCCGAGGA AGGCACCGAG GTACTCGCCG GCATGGGCTT CGAGATCGCC GCCGCCGGTC CGCAGGAGCT CGTGGTGCGC AGCGTGCCCG CGCTGCTGGC GAGCGCGCCG GTGGCCGAAC TGGTGCGCGA GCTGCTGCAG GAGCTGCGCG AATTCCCCGC CACCGAGGTC GTCACCGCGC GCCGCAACGA GCTGCTCGCC ACCATGGCCT GCCACGGCGC GGTGCGCGCC AATCGCCAGC TCACCCTGCC CGAGATGAAC GCCCTGCTGC GCGACATGGA GGCGACCGAG CGCGCCGACC AGTGCAACCA CGGCCGCCCC ACGTGGACGC AGCTGAGCTT GGCCGAGCTC GACCGCTTCT TCATGCGCGG GCAGTAG
|
Protein sequence | MPHIHRLSDL LVNQIAAGEV VERPASVLKE ILENAVDAGA RAIEVQLEQG GVRRIRVADD GCGIARDELA LALERHATSK IATLDDLERV GTMGFRGEAL AAIAAVARTG ITSRAEGASH AWRIEAGREP EPAALDQGTV VDVADLYYNT PARRKFLKTE STEFAHCDDM FRRVALARPD IGLQLAHNGR VIHRLPPSPP AARVAALMGD DFLQHAREVQ ADAGILRLAG FASLPAYSRA SRDAQYFFVN GRFVRDKLLA HAVREAYADI LHGARHPAYV LFLELDPAGV DVNVHPAKIE VRFRESRAVH QFVFHAVRRT LAESGAGRAA ELHAPAAIGV GAGPAASPAS GTVPQGGFSS ALPPLSVPTA AHHPASRSWP PADARQGRLA MESASRAYFD FAAGARDSGG ASLPDRPGEI QSGPRPAPTE TATGEAPPLG YALGQLHGIY ILAQNAAGLV LVDMHAAHER ILYEKLKTVI DGRPAVQRLL IPAVFSVGAK DMAAAEEGTE VLAGMGFEIA AAGPQELVVR SVPALLASAP VAELVRELLQ ELREFPATEV VTARRNELLA TMACHGAVRA NRQLTLPEMN ALLRDMEATE RADQCNHGRP TWTQLSLAEL DRFFMRGQ
|
| |