Gene Tmz1t_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1823 
Symbol 
ID7084245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2045064 
End bp2046950 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content73% 
IMG OID643698845 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002355471 
Protein GI217970237 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000270616 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCACA TCCACCGCCT CTCCGACCTG CTGGTCAACC AGATCGCCGC CGGCGAGGTG 
GTCGAGCGCC CGGCCTCGGT GCTCAAGGAA ATACTCGAGA ACGCGGTCGA TGCTGGCGCG
CGCGCGATCG AGGTGCAGCT CGAGCAGGGC GGCGTGCGGC GCATCCGCGT CGCCGACGAC
GGCTGCGGCA TCGCGCGCGA CGAGCTCGCG CTCGCGCTCG AGCGCCACGC CACCAGCAAG
ATCGCCACCC TCGACGACCT CGAGCGCGTG GGCACGATGG GGTTTCGCGG CGAGGCGCTG
GCGGCCATCG CCGCGGTGGC GCGCACCGGC ATCACCAGCC GCGCCGAGGG CGCGAGCCAC
GCCTGGCGCA TCGAGGCCGG GCGCGAGCCC GAGCCGGCGG CGCTCGACCA GGGCACAGTG
GTCGACGTCG CCGACCTCTA CTACAACACC CCCGCGCGGC GGAAGTTCCT CAAGACCGAA
TCCACCGAGT TCGCCCACTG CGACGACATG TTCCGCCGCG TCGCGCTCGC GCGCCCCGAC
ATCGGCCTGC AGCTCGCCCA CAACGGCCGC GTGATCCACC GCCTCCCTCC CTCACCGCCC
GCCGCGCGGG TGGCGGCGCT GATGGGCGAC GACTTCCTGC AGCACGCCCG CGAGGTCCAG
GCCGACGCCG GGATACTGCG CCTGGCCGGC TTCGCGTCGC TGCCCGCCTA TTCGCGCGCC
AGCCGCGACG CGCAGTATTT CTTCGTCAAT GGCCGCTTCG TGCGCGACAA GCTGCTCGCC
CACGCGGTGC GCGAAGCCTA CGCCGACATC CTGCACGGCG CGCGCCATCC GGCTTACGTG
CTCTTCCTCG AGCTCGACCC CGCCGGGGTG GATGTGAACG TGCATCCGGC CAAGATCGAG
GTGCGCTTCC GCGAGTCGCG CGCGGTGCAT CAGTTCGTCT TCCACGCCGT GCGCCGCACC
CTGGCCGAGA GCGGAGCGGG ACGCGCTGCG GAGCTGCACG CGCCCGCAGC CATTGGCGTC
GGTGCCGGCC CCGCGGCCTC CCCGGCATCG GGCACCGTCC CGCAAGGGGG CTTCTCGAGC
GCCCTCCCCC CGCTTTCCGT CCCGACGGCC GCCCACCACC CCGCATCGCG GTCCTGGCCG
CCCGCAGACG CCCGCCAAGG ACGCCTGGCG ATGGAGTCGG CGAGCCGCGC CTACTTTGAT
TTCGCCGCGG GCGCACGCGA TAGCGGTGGC GCCAGCCTGC CCGACCGCCC CGGCGAGATC
CAGTCGGGCC CGCGCCCGGC GCCGACGGAG ACCGCCACGG GCGAGGCGCC GCCGCTCGGC
TACGCGCTCG GCCAGCTGCA CGGCATCTAC ATCCTCGCCC AGAATGCCGC CGGCCTCGTG
CTGGTGGACA TGCACGCCGC CCACGAACGC ATCCTCTACG AGAAGCTCAA GACCGTGATC
GACGGCCGCC CGGCCGTGCA GCGCCTGCTG ATCCCGGCGG TGTTCTCGGT GGGCGCGAAG
GACATGGCGG CGGCCGAGGA AGGCACCGAG GTACTCGCCG GCATGGGCTT CGAGATCGCC
GCCGCCGGTC CGCAGGAGCT CGTGGTGCGC AGCGTGCCCG CGCTGCTGGC GAGCGCGCCG
GTGGCCGAAC TGGTGCGCGA GCTGCTGCAG GAGCTGCGCG AATTCCCCGC CACCGAGGTC
GTCACCGCGC GCCGCAACGA GCTGCTCGCC ACCATGGCCT GCCACGGCGC GGTGCGCGCC
AATCGCCAGC TCACCCTGCC CGAGATGAAC GCCCTGCTGC GCGACATGGA GGCGACCGAG
CGCGCCGACC AGTGCAACCA CGGCCGCCCC ACGTGGACGC AGCTGAGCTT GGCCGAGCTC
GACCGCTTCT TCATGCGCGG GCAGTAG
 
Protein sequence
MPHIHRLSDL LVNQIAAGEV VERPASVLKE ILENAVDAGA RAIEVQLEQG GVRRIRVADD 
GCGIARDELA LALERHATSK IATLDDLERV GTMGFRGEAL AAIAAVARTG ITSRAEGASH
AWRIEAGREP EPAALDQGTV VDVADLYYNT PARRKFLKTE STEFAHCDDM FRRVALARPD
IGLQLAHNGR VIHRLPPSPP AARVAALMGD DFLQHAREVQ ADAGILRLAG FASLPAYSRA
SRDAQYFFVN GRFVRDKLLA HAVREAYADI LHGARHPAYV LFLELDPAGV DVNVHPAKIE
VRFRESRAVH QFVFHAVRRT LAESGAGRAA ELHAPAAIGV GAGPAASPAS GTVPQGGFSS
ALPPLSVPTA AHHPASRSWP PADARQGRLA MESASRAYFD FAAGARDSGG ASLPDRPGEI
QSGPRPAPTE TATGEAPPLG YALGQLHGIY ILAQNAAGLV LVDMHAAHER ILYEKLKTVI
DGRPAVQRLL IPAVFSVGAK DMAAAEEGTE VLAGMGFEIA AAGPQELVVR SVPALLASAP
VAELVRELLQ ELREFPATEV VTARRNELLA TMACHGAVRA NRQLTLPEMN ALLRDMEATE
RADQCNHGRP TWTQLSLAEL DRFFMRGQ