Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plim_1838 |
Symbol | |
ID | 9138539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Planctomyces limnophilus DSM 3776 |
Kingdom | Bacteria |
Replicon accession | NC_014148 |
Strand | + |
Start bp | 2403677 |
End bp | 2405755 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_003629867 |
Protein GI | 296122089 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0019765 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGGA TACAACAGCT CTCCAACAGT GTAGTGAACA AAATCGCAGC AGGCGAAGTG ATCGAGCGCC CTGCCAGCGT GGTGAAAGAA CTGATCGAAA ACAGCATCGA TTCCGGCGCA CTTCGGGTTG AAGTCGATAT TGGCGCGGGT GGCAGTGAAT ATATTCGCGT GACCGATGAT GGCGGAGGCA TTCACGCGGA TGACATTCTC CTCGCTGTGA CCACACATGC GACCAGCAAG ATCCGCACGG CTGACGACCT GTTTCAGATT TCCACACTCG GTTTTCGTGG CGAGGCCCTG GCGTCGATTG CCGAAGTGAG TCAACTGAGA ATCCGGTCGC GAATTCCAGA CAGCGATGTG GGTTCAGAGC TGGTCGTCAA TCTGGGTGAG CGGGGAGAAG TCGTTCCTTG CGGCTGTGCG ACGGGAACAT GTATCGAGAT TCGACAACTC TTTGCAAATA CACCTGTCCG CAGAAAATTT CTCAAGAGCG ATGCCACCGA ATTCGGACAC ATCAGTGAAC AATTTACCCG GATTGCACTG GCCCGCTCTC ATGTGCACAT GGTGCTTCGT CACAACGATA AAATTGTGTA CGAACTCCCT GCCACAGACC GACTGATTGA CCGTCTCGAA ATGTTCTATG GTTCGGCCAT TTCCGAACAG TTGATCTCGA TTGATCATCA ACTCGGAGAA ATGCGACTCT GGGGTTATGT CGGCCATCCG GCTGTCAATA AAGCCACACG GAAATGGCAG CATCTTTTCT TGAATGGCCG CTGGTTCCAG GATCGATCCA TTCAACATGC TCTTTCCGAG GCTTATCGCG GCTTGATCAT GGTGCAGCGT CATCCGATCT GTTTTCTCTT TCTCGAACTC AACCCGGCTG ATGTCGATGT GAATGTGCAT CCCACCAAGG TGGAAGTACG CTTTCAAGAT CCGCAGTCAA TCTATCGCCT CATGCTCTCA TCACTACGCA ATCGATTTCT AGGCATGGAT CTCGACAGCA AGCTCAAGGT TCCTGCTGAA GCCAGCACAG TCGATCCTCA AGAGCAGAAC GAAATCCAGA AAGAGTTCTC CCAATGGGCA AAGACCGAAC TTTTCAAAAC GGCTGAGATC GCCGGGTCTC CGCTCATGGT TGGATCGACG ATCGGCAGTG CCAACGCAAA TAGTCCACTC TCGCTGCTTT CGCCGAACCT GCCGGATTCA GCCACTTCTC CCTGGAGCTC GGAAATCGCT CCCGGATTGA ATCGCTTCGA GACTGAGAAT CTGGAATGGT CGGCCACGAC TCACCGGACA GTCAGCAGAG TGAATTCACC ATCGCATTTT GATCACGTAG CCCATTCCGA ATCGACGGAT GGCATGGATA CGCCAGAAGT CGAAGAGGCG GCATTCGCAG AGCAGCCCCA GAAATCTGAT CATGAACTGG ACTCCTCTCA GGCGAGGGAA ATCTCTGAAC TTCCTCATGC CGCTGGGAGC CAGAACCATC ATTTCTCGAC ACAATTCCTG GAAGGGGTGC GAGCGCTGCA GATTCATGAC TGCTATCTGG TGGTCGAAAC TCCCGAAGGG ATGACGGTCA TCGATCAACA TGCCTTGCAT GAACGAATTC TTTACGAAGA ATTCCGCCGT CGAGTGCATA GCAAGGCCAT GGAATCGCAG CGACTGCTCA TCCCGGTACC GATTGCCCTG GGCTTTCGGG GATCCAGCCT GCTCCTCGAT TGCCGGGAAG CGTTAGATCA ACTGGGATTC GAGATTTGCG AATTCGGGCA GGGAACCATT CTCCTTTCGG CCTACCCGGC GATGCTCGGA AAACTGAATC AGGAGCAATT GCTTCGTGAT CTGGCGGAAC AATTGGAGTC TTCATCACTG GAAAGTGCTC ATCGCGATAT TCTGGATGAG CTACTGAATA TGATGGCTTG TAAGGCGGCT GTAAAATCGG GGCAGAAGTT GAGCCAGGAG GAAATCGAAG AGCTGTTGCG ACAACGACAT CTTGTCGCCG ATGCTCATCA TTGTCCTCAT GGCCGGCCTA CGGCACTCAA TCTGTCAAGA AGCGAACTTG ATCGGCAATT TGGACGTCTG GGTTCCTGA
|
Protein sequence | MGRIQQLSNS VVNKIAAGEV IERPASVVKE LIENSIDSGA LRVEVDIGAG GSEYIRVTDD GGGIHADDIL LAVTTHATSK IRTADDLFQI STLGFRGEAL ASIAEVSQLR IRSRIPDSDV GSELVVNLGE RGEVVPCGCA TGTCIEIRQL FANTPVRRKF LKSDATEFGH ISEQFTRIAL ARSHVHMVLR HNDKIVYELP ATDRLIDRLE MFYGSAISEQ LISIDHQLGE MRLWGYVGHP AVNKATRKWQ HLFLNGRWFQ DRSIQHALSE AYRGLIMVQR HPICFLFLEL NPADVDVNVH PTKVEVRFQD PQSIYRLMLS SLRNRFLGMD LDSKLKVPAE ASTVDPQEQN EIQKEFSQWA KTELFKTAEI AGSPLMVGST IGSANANSPL SLLSPNLPDS ATSPWSSEIA PGLNRFETEN LEWSATTHRT VSRVNSPSHF DHVAHSESTD GMDTPEVEEA AFAEQPQKSD HELDSSQARE ISELPHAAGS QNHHFSTQFL EGVRALQIHD CYLVVETPEG MTVIDQHALH ERILYEEFRR RVHSKAMESQ RLLIPVPIAL GFRGSSLLLD CREALDQLGF EICEFGQGTI LLSAYPAMLG KLNQEQLLRD LAEQLESSSL ESAHRDILDE LLNMMACKAA VKSGQKLSQE EIEELLRQRH LVADAHHCPH GRPTALNLSR SELDRQFGRL GS
|
| |