Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2284 |
Symbol | |
ID | 7083716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2570452 |
End bp | 2573145 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699303 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002355919 |
Protein GI | 217970685 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.339354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAATCC GCAAGCCGCC CTTCGGCGAC ACGGCCATCC GCCCCGCCGC GCTGAGCGAG GCCGAGATCG GCGCCCACAC TCCGATGATG CAGCAGTACC TCCGGATCAA GCAGCAGCAT CCGGACACCC TGCTCTTCTA CCGCATGGGC GACTTCTACG AGCTCTTCTT CGACGACGCC GAGAAGGCCG CGCGCCTGCT CGACATCACG CTCACCACCC GCGGCCAGTC GGGCGGCAAG CCGATCCGCA TGGCGGGCGT GCCCTTCCAC GCGGTCGAGC AATACCTCGC GCGCCTGGTC AAGCTCGGCG AATCGGTGGT GATCGCCGAG CAGGTGGGCG AGCCGGGCGC GAACAAGGGG CCGATGGAGC GCGCGGTGAG CCGCATCGTC ACCCCCGGTA CGCTCACGGA CGCCGCGCTG CTCGACGACC GCCGCGACTC CTTGCTGCTC GCCGCCAGCC TGCACCGCGG CGTGCTCGGG CTGGCCTGGC TGAATCTCGC CAACGGCGAC TTCCGCCTCA TGCAGTGCCC ATCGGACGCA CTGCAGGCGC AGTTCGAGCG CCTGCGCCCG GCCGAGGTGC TGGTGCCCGA CGGCCTCGCC CTGCCGCTCC TCGACACCCT CGCCCCCGCG CTGCGCCGCC TGGCCGACTG GCAGTTCGAC GCCGAGACCG GCACGCGGCT GCTGACGACC CACTTCGGCA CGCGAGACCT CGCCGGCTTC GGCGTCGAGG ACCTGCCGGT GGCACTCGGC GCCGCCGCGG CCTTGTACGA CTACGCCCAG GCCACCCAGC GCCAGACGCT CTCCCACGTC ACCGGCCTCG TGGTCGAACG CGAGTCGGAA TATCTGCGCC TGGACGCCGC AACCCGGCGC AACCTGGAGC TCACCGAGAC CCTGCGCGGC GAGTCCTCGC CCACCCTGCT GTCGCTGCTC GATACCTGCG TGACCAGCAT GGGTTCGCGC TGGCTGCGCC ATGCGTTGCA CCACCCGCTG CGCGAGCGCG CCGAGCCCGC CGCGCGCCAT GCCGCGGTGG CGGAGCTGGT GGGTACGGTC GAAGGCGAGA TGAGCGAGCA CACCGGCGCA GCCCCCGGCG GCTTCGCGCT CGGTGGCCGC GACGGCCGCA TCGCCTTCGC GGTGCGCAGC GCGCTGCGCG GCGTGGCCGA CGTCGACCGC ATCACCGCCC GCATCGCGCT GCGCAGCGCG CGCCCGCGCG ACCTCTCGGC GCTGCGCGAG AGCCTGCTGC GGCTGCCCGA GCTCGCCGCC GCGCTCGCAC CCTGCCGGGC GCCGCTGCTC GCCGAGATCG TCGGCGCGCT CGCCATCGCC CCCGAGCCGC TCGCCCTGCT GCAGCAAGCC ATCGCCGCCG AGCCCGCCGC CATGGTGCGC GACGGCGGCG TGATCGCCCC CGGCTTTGAC ACCGAGCTCG ACGAACTGCG CGGTATCCAG ACCAATTGCG GCGAATTCCT CATGGCGCTG GAGACCCGCG AGCGCGAGCG CAGCGGCATC CCCGGGCTCA AGGTCGAGTT CAACAAGGTG CACGGCTTCT ACATCGAGGT CAGCCGCGCC AACGCCGACA AGGTGCCCGA CGACTACCGC CGCCGCCAGA CGCTGAAGAA CGCCGAGCGC TACATCACGC CCGAGCTCAA GGCCTTCGAG GACAAGGCGC TGTCGGCCAA CGAGCGTGCG CTGGCGCGCG AGAAGACGCT CTACGACCAG GTGCTCGAGG TGCTCGCCGC CCACATCCCG GCGCTGCAGC GCATCGCGCG CGCGCTGGCC CTGCTCGACG GCCTCGCCGC GCTGGCCGAG GCCGCGCTGC GTTACGGCTA CGTACAGCCA CGCTTCCTGG AGACACCCGG GCTCACCATT ACAGGCGGCC GCCATCCGGT GGTCGAGCGC CAGGTCGAGA GCTTCATCCG CAACGACGCC CGCCTGGCCG CCACGCGCCG CATGCTGATG ATCACCGGCC CCAACATGGG CGGCAAATCG ACCTTCATGC GCCAGGTCGC GCTGATCTGC CTGCTCGCCC ATGTCGGCAG CTTCGTGCCG GCGGATGCCG TCGAGCTCGG CCCGCTCGAC GCCATCTTCA CTCGCATCGG CGCCTCCGAC GACCTCGCCT CGGGGCGCTC GACCTTCATG GTCGAGATGA CCGAGGCCGC CGCCATCCTG CACGGCGCCA CCGAGCGCAG CCTGGTGCTG ATGGACGAGA TCGGCCGCGG CACCTCGACC TTCGACGGCC TCGCGCTCGC GTTCGCGATC GCCCGCCACC TGCTGGAGAA GTCGCGCGCG CTGACACTGT TCGCCACGCA TTATTTCGAG CTCACCCGGC TCAACGCCGA CTATCCCGAG TGCGCCAACG TGCATCTGGA CGCGGTCGAG CACGGCCACC GCATCGTCTT CCTGCACGCG CTCGAAGAAG GCCCGGCGAG CCAGAGCTAC GGCATCGAGG TCGCCGCGCT CGCCGGCATC CCGGCCGCGG TGATCCGCGA CGCCAAGCGC CGGCTGCGCG CGCTGGAGAA CCGCGAGATC GACGCCGGCC CGCAGGCCGA CCTCTTCGCC GCCCTGCCCG AGCCGGAGGA CGCGCCGCTG TCGCACCCGG TGCTCAGCGC GCTCGCCGAC ATCGACCCCG ATGCACTGAG CCCGCGCGAG GCGCTGGAAC GCCTCTACGC CCTCAAGCGC CTGGCCGGAG ACCGCAACAC ATGA
|
Protein sequence | MAIRKPPFGD TAIRPAALSE AEIGAHTPMM QQYLRIKQQH PDTLLFYRMG DFYELFFDDA EKAARLLDIT LTTRGQSGGK PIRMAGVPFH AVEQYLARLV KLGESVVIAE QVGEPGANKG PMERAVSRIV TPGTLTDAAL LDDRRDSLLL AASLHRGVLG LAWLNLANGD FRLMQCPSDA LQAQFERLRP AEVLVPDGLA LPLLDTLAPA LRRLADWQFD AETGTRLLTT HFGTRDLAGF GVEDLPVALG AAAALYDYAQ ATQRQTLSHV TGLVVERESE YLRLDAATRR NLELTETLRG ESSPTLLSLL DTCVTSMGSR WLRHALHHPL RERAEPAARH AAVAELVGTV EGEMSEHTGA APGGFALGGR DGRIAFAVRS ALRGVADVDR ITARIALRSA RPRDLSALRE SLLRLPELAA ALAPCRAPLL AEIVGALAIA PEPLALLQQA IAAEPAAMVR DGGVIAPGFD TELDELRGIQ TNCGEFLMAL ETRERERSGI PGLKVEFNKV HGFYIEVSRA NADKVPDDYR RRQTLKNAER YITPELKAFE DKALSANERA LAREKTLYDQ VLEVLAAHIP ALQRIARALA LLDGLAALAE AALRYGYVQP RFLETPGLTI TGGRHPVVER QVESFIRNDA RLAATRRMLM ITGPNMGGKS TFMRQVALIC LLAHVGSFVP ADAVELGPLD AIFTRIGASD DLASGRSTFM VEMTEAAAIL HGATERSLVL MDEIGRGTST FDGLALAFAI ARHLLEKSRA LTLFATHYFE LTRLNADYPE CANVHLDAVE HGHRIVFLHA LEEGPASQSY GIEVAALAGI PAAVIRDAKR RLRALENREI DAGPQADLFA ALPEPEDAPL SHPVLSALAD IDPDALSPRE ALERLYALKR LAGDRNT
|
| |