Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2031 |
Symbol | |
ID | 4784251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2172909 |
End bp | 2174723 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640090601 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_001021224 |
Protein GI | 124267220 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.440642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCG TGTTGTCCCC GGCCGGCCCG CGCCGGCAGA TCCGTGAGCT GCCCGACGAG CTGGTGAGCC AGATCGCCGC CGGCGAAGTG GTCGAGCGGC CGGCCTCGGT GGTGCGAGAG CTGGTCGACA ACGCCCTCGA CGCCGGCGCC CGCGAGATCG TCGTCAAGCT GATGGCCGGC GGCGTGCGCG CCATCCTGGT GGAGGACGAC GGTGCCGGCA TCCCCGCCAG CGAGCTGCCG CTGGCGCTCA AGCGCCACGC GACCAGCAAG ATCGCCTCGC TCGACGAGCT GGAGAACGTG TCGACGATGG GCTTTCGCGG CGAGGCGCTG GCCGCCATCG CCGCGGTGTC GGAGCTGTCG ATCGCCAGCC GCCACGCCGA CGCCCCCCAC GCGCAGCGCC TCGACGCCCG CTCCGGTGAG CTGGTGCCCG CCGCGCGCGG CGTGGGCACC AGCGTGGAGG TGCGCGAGCT GTTCTTCAGC ACGCCCGCAC GCCGCAAGTT CCTGAAGACC GACGCCACCG AGCTGGCCCA CTGCCTGGAG GCGGTGCGCC GCCACGCGCT GGCGCGGCCC GACGTGGGCT TCGCGGTCTG GCACGAGGGC AAGCTGCTGG CGCAGTGGCG CCGCGCGCCG CTCGAGCAGC GCATCCGCGA CGCGCTGGGC GAAGACTTCA TGGCCCACAG CCGCGAGGTC ACGGCCCAGC CCACCGGCCT GCGCATCAGC GGCCGCATCG GCCTGCCCGA TGCCGCGCGT GCCCGGGCCG ACGAGCAGTA CGTCTACGTC AACGGCCGCC ACGTGCGCGA CCGGCTCATC TCCCACGGCC TGCGCACCGC CTACGCCGAC GTGCTGCACG GCGGGCGCCA GCCGAGCTAC GTGCTGTTCA TCGAGATCGC CCCCTCGCGG GTCGACGTGA ACGTGCACCC GACCAAGATC GAGGTGCGCT TCCGCGACGG CCGCGAGGTG CACCAGGCGG TGCGACACGC CGCCGAGGAC GCGCTGGCGC TGCCGCGTGC CGACGAGACC CGGCCCGCGC TGTTCGAGCC CACCCGGCCG GCCGTGTGGA GCCCGCTGGC CGAGCAGGCC GGGCTGGGCC TGGGGGCAGG CGTCAGCGAG CGCCGGCCGG CCTGGCCCGC ACCGAGCGGC GAATCGGTCG ACCTGCTGCT GCACCTCGAT GCGGCGCCGG ACGGTGGCGC TCCGTCCCCG TTCCTGCCGC TGACCCTTCA GAGCGGTGAC GACTGGCCGC TGGGCCGCGC GCTGGCGCAG TTGGGCGGCG TGTACATCCT GGCCGAGAAC CGCGACGGCC TGGTCATCGT CGACATGCAC GCCGCGCACG AGCGCGTGGT CTACGAGCGG CTCAAGGCCG GCCTGGCCGG CGCACGCATC GAATCGCAGC CGCTGCTGAT CCCCGCCATC TTCCCGGCCA CCGCCGCCGA GGTGGCTACC GCCGAGGCGC AGGTGGAGAC GCTGGCGCGG CTCGGCCTCG ACCTCACGGT GCTGTCGTCC AACGTGCTGG CGCTGCGCTC GCACCCGGCC GCGCTGGCCG GCGGCGACAT GGTGGCGCTG GCGCGGTCGG TGCTCGCCGA ACTGGCACGC TACGACGCCA GCCACGCGAT CGAGCGTGCG CAGCACGAGC TGCTGTCCAG CATGGCCTGC CACGGCGCGG TGCGCGCCAA CCGGCGCCTC AGCGTCGAAG AGATGAACGC GCTGCTGCGC GACATGGAGC GCACCGAGCG CGCCGACCAA TGCAACCACG GCCGCCCCAC CTGGCGCCAG CTCACGCTGA AGGAGCTGGA CCAGCGCTTC CTGCGCGGCC GCTGA
|
Protein sequence | MNAVLSPAGP RRQIRELPDE LVSQIAAGEV VERPASVVRE LVDNALDAGA REIVVKLMAG GVRAILVEDD GAGIPASELP LALKRHATSK IASLDELENV STMGFRGEAL AAIAAVSELS IASRHADAPH AQRLDARSGE LVPAARGVGT SVEVRELFFS TPARRKFLKT DATELAHCLE AVRRHALARP DVGFAVWHEG KLLAQWRRAP LEQRIRDALG EDFMAHSREV TAQPTGLRIS GRIGLPDAAR ARADEQYVYV NGRHVRDRLI SHGLRTAYAD VLHGGRQPSY VLFIEIAPSR VDVNVHPTKI EVRFRDGREV HQAVRHAAED ALALPRADET RPALFEPTRP AVWSPLAEQA GLGLGAGVSE RRPAWPAPSG ESVDLLLHLD AAPDGGAPSP FLPLTLQSGD DWPLGRALAQ LGGVYILAEN RDGLVIVDMH AAHERVVYER LKAGLAGARI ESQPLLIPAI FPATAAEVAT AEAQVETLAR LGLDLTVLSS NVLALRSHPA ALAGGDMVAL ARSVLAELAR YDASHAIERA QHELLSSMAC HGAVRANRRL SVEEMNALLR DMERTERADQ CNHGRPTWRQ LTLKELDQRF LRGR
|
| |