Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1216 |
Symbol | |
ID | 5410388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1235929 |
End bp | 1238640 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640868443 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001404377 |
Protein GI | 154150759 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0405457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACTG AGCCTTCTCT CGGCACCGGC CCGGCAGCCG ATGATGGCAG AACCCTTACG CCCGGGATGC GGCAGTACCG GGAGATCAAG GCGCAGTACC CCGATGCAAT TCTCTTTTTC CATATAGGGG ACTTTTTTGA GACGCTGGAA GGCGATGCGG AGATCGTGTC AAAGGAGCTC GATCTTGTAC TTACCTCACG GTCCAAGAAT GGCGACCAGC GTATACCGCT TGCCGGGGTT CCCCATCATG CCGGTGAGGG ATACATCGCC CGCCTCGTGG CAAAAGGGTA CAAGGTTGCC ATCTGCGAAC AGACCGAGGA CCCGAAAACA GCAAAGGGCC TGGTAAAACG CGAGGTGGTC CGGGTGATCA CGCCCGGTAC AGTAATTGAT CCGGTAATGC TCCCCTCATC GGCTGCTGCG TACCTGATGG CGGTCTCTCC CGGCACAAAA GGTGCGGATT GGGGCATCGC GCTCCTGGAC ATCTCCACCG GTGAATTTTT TGTCCTTGCA ATTCCGGCGG AAGGGAGTAC CGGGAACCTC CGGTCCGAGA TTGCCCGGTA CCGCCCGGCG GAGTGTATCG TCCCCTCATC CGTGGACGAA GAACTGCGTA CCTCGCTGCG GCGTGACGGT GTGGTGGTGA CGCCTTATGC CGATGACCGT TTCCTTCCCG ACCGGGCCGG GCGCACCCTT TGCGAACACT TCGGTGTCGC CTCCCTTGCC GGGTATGGCT GTGAGGGAAT GCCGGCTGCG GTCTCAGCGG CCGGTGCAGC GCTCTCCTAC GCGGCTGACA CCCAGAAGTC CACACTCCCT CATGTGAGTT CCCTTTTCAC GCGCTCATCA GCACAGGGCA TGATGCTCGA TGCGATCACC CTGCGCAACC TCGAGGTGCA TGAGAGCATA CGGGGCGGGA CAAAAGGGGC AACGCTCTTT TCCACCCTTG ACCGTACAAA GACCCCGATG GGCAGCCGGT TTTTACGCTT GCATCTGACC CGGCCGCTCA CGGATATTGC CCGGATCAAT GCACGGCTCG ATGCTGTCGA ATATTTCACT GCTTCAGCCA CACTGCGCAT GACCCTGCGC GAGCTTCTTA CGCGCCATGC TGATATCGAG CGGATCACGG CACGAATCTC ATATGGGAAT GCCGGTCCGC GGGATCTTGT TGCCCTTGCC GAAACGCTCG CCACCCTCCC CGAGATCCGG AAACTGCTTG CCGGGCCTTC AGTGGGATCG GATAACAGCG CACCGGTTCC CCGGCGTGTT GCTGCTGCCC GGGACTGCCT TTTCGATCTC CCGTCCATAA TCGATCTTAT CCGGAGAGCG ATCACCGATG ATCCCCCGGC GATCGCAAAG AATGGCGGAG TTATCCGGTC CGGCTATTCG GCAGAGCTCG ACGGGATGAC CGGTGTCCTG CACTCAGGGA AGAACTGGAT TGCAGAGCTC CAGCAGCAGG AGCGCGAAAA GACCGGGATC AAATCCCTCA AGATCGGATA CAATCGGATC TTCGGGTATT ACATTGATGT GAGCAAGAGC AATATTGCAC TCGTGCCGGC CCGGTACGAG CGCCGGCAGA CCACGGCAAC CGGGGAACGG TACACCATTC CCGAACTCCG GGAAAAGGAG GCCGTGATCA CGGATGCAGA TGAGCGCGTT CTTGCGCTTG AACGCTCGCT GTATGCCGGC CTTATCGATG AGATCAGAAA GGAGATCCCC GCGCTCCAGA GCATTTCCCG CGGGATCGCA ACGCTTGATG TTGCCGCTGC CCTTGCCGAT GCGGCCACGG ATTTCGATTA CGTCCGCCCC CGGCCGGATA CCGGGGATGC GATTACCTGC CGGGATATCC GTCACCCTGT GGTGGAGCAA AGCCTTGCCG GGGGTTTTGT CCCCAATGAT ACCGAGCTCT CCGGGTCAAA AAACCAGATC ATGATCATCA CCGGCGCCAA CATGGCGGGC AAGTCCACCT ACATGCGCTC GGTGGCGCTC TGCTGCATCA TGGCCCAGGC GGGAAGTTTT GTCCCGGCCC GGTCTGCCCA AATTGGGATC ATCGACCGGA TCTTTACCCG TGTCGGGGCA TTTGACGATC TTGCAAGCGG CCAGAGTACC TTCTTTGTCG AGATGCTCGA ACTGGCAAAT ATCCTCAACA ACATGACAGA AAAAAGTCTT GTGATCCTTG ACGAGATCGG GCGTGGCACG AGCACGGCTG ACGGGTGCTC GATCGCGCAG GCGGTACTCG AATACCTGCA TGGAAAATCC TCTGCCGGCC CAAAGACACT CTTTGCCACC CACTTCCATG AACTCATTGG CATGGAAGCG GAGCTTAAGC GGGTAAAAAA TTACCATTTT GCCGTGCAGG AGACTAAACA GGATGTAGTC TTCCTCCGGA AACTGATCCC CGGGGCAACC GACAAGAGTT ACGGTATCCA TGTTGCACGG CTTGCGGGCA TCCCAAAAAA AGCAACTGAC CGGGCCGAAG TCCACCTTTG CGAAACCCTC AAACGCGATG CTTCCGGTGG CTCAAAAGCA CGGCGCTATA CCCAGCTCCT GCTGGCAGAC GACAACCTGC CGGCCCGGGC TGCTGCTGCT CCGGATCCGG TAATTGCGGA GATTGCCGGA CTGGATCCTG ATTCCATGAC GCCAATGCAG GCACTCTCAA AACTGGCAGA ACTCAAAAGT CGTGCAGGCA GGTCATGCAA CATGCCCGGG AAGGATCTCT GA
|
Protein sequence | MKTEPSLGTG PAADDGRTLT PGMRQYREIK AQYPDAILFF HIGDFFETLE GDAEIVSKEL DLVLTSRSKN GDQRIPLAGV PHHAGEGYIA RLVAKGYKVA ICEQTEDPKT AKGLVKREVV RVITPGTVID PVMLPSSAAA YLMAVSPGTK GADWGIALLD ISTGEFFVLA IPAEGSTGNL RSEIARYRPA ECIVPSSVDE ELRTSLRRDG VVVTPYADDR FLPDRAGRTL CEHFGVASLA GYGCEGMPAA VSAAGAALSY AADTQKSTLP HVSSLFTRSS AQGMMLDAIT LRNLEVHESI RGGTKGATLF STLDRTKTPM GSRFLRLHLT RPLTDIARIN ARLDAVEYFT ASATLRMTLR ELLTRHADIE RITARISYGN AGPRDLVALA ETLATLPEIR KLLAGPSVGS DNSAPVPRRV AAARDCLFDL PSIIDLIRRA ITDDPPAIAK NGGVIRSGYS AELDGMTGVL HSGKNWIAEL QQQEREKTGI KSLKIGYNRI FGYYIDVSKS NIALVPARYE RRQTTATGER YTIPELREKE AVITDADERV LALERSLYAG LIDEIRKEIP ALQSISRGIA TLDVAAALAD AATDFDYVRP RPDTGDAITC RDIRHPVVEQ SLAGGFVPND TELSGSKNQI MIITGANMAG KSTYMRSVAL CCIMAQAGSF VPARSAQIGI IDRIFTRVGA FDDLASGQST FFVEMLELAN ILNNMTEKSL VILDEIGRGT STADGCSIAQ AVLEYLHGKS SAGPKTLFAT HFHELIGMEA ELKRVKNYHF AVQETKQDVV FLRKLIPGAT DKSYGIHVAR LAGIPKKATD RAEVHLCETL KRDASGGSKA RRYTQLLLAD DNLPARAAAA PDPVIAEIAG LDPDSMTPMQ ALSKLAELKS RAGRSCNMPG KDL
|
| |