Gene Mboo_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1217 
SymbolmutL 
ID5410389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1238640 
End bp1240478 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content59% 
IMG OID640868444 
ProductDNA mismatch repair protein 
Protein accessionYP_001404378 
Protein GI154150760 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.469734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0661873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAG AGGAGCATCC CCCCGCGATA CGGGTACTGG ACCCGGCTAC TGTCAACCAG 
ATCGCGGCGG GAGAAGTGAT CGAACGGCCG GCTTCGGTAG TAAAGGAAAT GGTGGAGAAT
GCGATCGATG CCGGCGCCCG CACGATCCGG ATTGATATTA CTTCCGTGCA GGGTGGGATC
ACAGCGATAA AGGTAACCGA CGACGGGTGC GGGATGTCGC CGGTTGATGC AGAGCTTGCC
TTTGTCCCGC ATGCCACAAG CAAGATCCAT ACGCTTGACG ATCTCTTCTC CATCCATTCC
CTGGGATTCC GGGGTGAGGC ACTGGCAAGC ATTGCGGCGA TCGCAAAAGT TACACTTATT
ACAAAGCCAC AAGGCAGCGA CCGGGTGCCC GGTACCCGGA TTGTGGTGGT GGGCGGAGAG
ATCCAATTAC GTGGCGGGAC CGGTGCCCCC GAAGGCACAA GCGTGCTTGT GGAAGAATTG
TTCTTTAATA CCCCTGCCCG GAAAAAGTTC CAGAAGAGCC TTACAACAGA AATTGCCCGC
ATCCACGGCA TCCTCGAAGG CCTCTGCCTT GCCTGCCCGC AGATCTCGTT TAAGGTCTTC
CATAACAACC GCGAGCAGCT GGCCACCGAG CGGACCGGCC GGCCGCTCGA CACAATCGCC
CGGATCTTCG GAAACGAATC TGCCCGCGAA CTTATCCCGG CCGCCGCTGC CCTCCCGTTC
ATGCGTATAT CCGGCTACAT TTCCCGTCCG GCTCTCTCCC GCAAGGATCA TGACCGGATC
CTCATTGCGA TCAATGGCAG GTACATCTCA TCCCCACCCG TAACAACTGC CATCCGCGAA
GGCTATGGGA CTCTCCTTCC TCATGGCCGG TATCCTGTTG CGTTTCTCTC ACTTGAGATC
GACACCCGGC TTGTGGACAT CAACGTCCAT CCTACCAAAA AGGAGGTCCG GCTCACCAAA
GAAAAAGAGA TCACTGATGG TGTACGTGAA GCAGTGCGGG CAGCGCTTGC ATCGGGCGAT
CTGATCCCTG AGGTGAACGC ACCGAAACCG GTTTACCGGA AACTGGATGC CGGGGGATCT
GACTTATCTC CTGTGCCGTA TGTTGCAGAA CCTGCAGAAC CGTACTGTGC CGGAACCCTC
CCTTCAGCAG TATCCACAGG AACGCTCTCG CCATTCACGG AACCGACCCA TACCGGGACG
GTTGCAACCG ATTATCGCCT CCGCCAGACC GAGCTTGCAA GCGGTGTCCC GCCGGTTACG
GCCGTAGTGC CGGAGATGGA TGTAATCGGG CAGATTGGCG GGATCTATAT CCTTGCCGAA
GCGGCTGGCG GGGAACTTAT CATTATCGAC CAGCACGCTG CCCACGAGCG AATCTTCTAT
GAGCAGGTGA CAAGGAGCAT GGCAGCCCGG CAGGCTCAGG AGCTGCTTGT CCCGGCAATC
ATCCACTGCC CTCCCAAAGA TACTGCGATT CTCAAAAGCC TGATCCCCGC GCTTGCTCAG
GAAGGTGTTA TTATCGAGGA GTTCGGGGCC GGATCCTTTC TGGTCCGGGC AGTTCCTGCC
CTGATGGGAA AGGTGGAGGG GCCGGCAATG ATTGACGACC TGGTAAGCGA TCTCCTCCAC
AAGGACCTTG ACCGCCCGGT CAGCGACCGG GAGCGCCTGA CCCGGATCAT TGCCTGCCGG
AGCGCGATAA AAGCCGGTAC GGTCTGCACC GTCGAACAGT GCCGGCGGCT TATTTCCCAG
CTCAGGGCAA CAACGACACC GTTTACCTGC CCGCACGGCC GGCCCACCAT GGTCAGGTTC
ACCCGCGCAA AACTGGACGA GATGTTCAAG CGGACATAA
 
Protein sequence
MGAEEHPPAI RVLDPATVNQ IAAGEVIERP ASVVKEMVEN AIDAGARTIR IDITSVQGGI 
TAIKVTDDGC GMSPVDAELA FVPHATSKIH TLDDLFSIHS LGFRGEALAS IAAIAKVTLI
TKPQGSDRVP GTRIVVVGGE IQLRGGTGAP EGTSVLVEEL FFNTPARKKF QKSLTTEIAR
IHGILEGLCL ACPQISFKVF HNNREQLATE RTGRPLDTIA RIFGNESARE LIPAAAALPF
MRISGYISRP ALSRKDHDRI LIAINGRYIS SPPVTTAIRE GYGTLLPHGR YPVAFLSLEI
DTRLVDINVH PTKKEVRLTK EKEITDGVRE AVRAALASGD LIPEVNAPKP VYRKLDAGGS
DLSPVPYVAE PAEPYCAGTL PSAVSTGTLS PFTEPTHTGT VATDYRLRQT ELASGVPPVT
AVVPEMDVIG QIGGIYILAE AAGGELIIID QHAAHERIFY EQVTRSMAAR QAQELLVPAI
IHCPPKDTAI LKSLIPALAQ EGVIIEEFGA GSFLVRAVPA LMGKVEGPAM IDDLVSDLLH
KDLDRPVSDR ERLTRIIACR SAIKAGTVCT VEQCRRLISQ LRATTTPFTC PHGRPTMVRF
TRAKLDEMFK RT