Gene Plim_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1838 
Symbol 
ID9138539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2403677 
End bp2405755 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content53% 
IMG OID 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003629867 
Protein GI296122089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0019765 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGGA TACAACAGCT CTCCAACAGT GTAGTGAACA AAATCGCAGC AGGCGAAGTG 
ATCGAGCGCC CTGCCAGCGT GGTGAAAGAA CTGATCGAAA ACAGCATCGA TTCCGGCGCA
CTTCGGGTTG AAGTCGATAT TGGCGCGGGT GGCAGTGAAT ATATTCGCGT GACCGATGAT
GGCGGAGGCA TTCACGCGGA TGACATTCTC CTCGCTGTGA CCACACATGC GACCAGCAAG
ATCCGCACGG CTGACGACCT GTTTCAGATT TCCACACTCG GTTTTCGTGG CGAGGCCCTG
GCGTCGATTG CCGAAGTGAG TCAACTGAGA ATCCGGTCGC GAATTCCAGA CAGCGATGTG
GGTTCAGAGC TGGTCGTCAA TCTGGGTGAG CGGGGAGAAG TCGTTCCTTG CGGCTGTGCG
ACGGGAACAT GTATCGAGAT TCGACAACTC TTTGCAAATA CACCTGTCCG CAGAAAATTT
CTCAAGAGCG ATGCCACCGA ATTCGGACAC ATCAGTGAAC AATTTACCCG GATTGCACTG
GCCCGCTCTC ATGTGCACAT GGTGCTTCGT CACAACGATA AAATTGTGTA CGAACTCCCT
GCCACAGACC GACTGATTGA CCGTCTCGAA ATGTTCTATG GTTCGGCCAT TTCCGAACAG
TTGATCTCGA TTGATCATCA ACTCGGAGAA ATGCGACTCT GGGGTTATGT CGGCCATCCG
GCTGTCAATA AAGCCACACG GAAATGGCAG CATCTTTTCT TGAATGGCCG CTGGTTCCAG
GATCGATCCA TTCAACATGC TCTTTCCGAG GCTTATCGCG GCTTGATCAT GGTGCAGCGT
CATCCGATCT GTTTTCTCTT TCTCGAACTC AACCCGGCTG ATGTCGATGT GAATGTGCAT
CCCACCAAGG TGGAAGTACG CTTTCAAGAT CCGCAGTCAA TCTATCGCCT CATGCTCTCA
TCACTACGCA ATCGATTTCT AGGCATGGAT CTCGACAGCA AGCTCAAGGT TCCTGCTGAA
GCCAGCACAG TCGATCCTCA AGAGCAGAAC GAAATCCAGA AAGAGTTCTC CCAATGGGCA
AAGACCGAAC TTTTCAAAAC GGCTGAGATC GCCGGGTCTC CGCTCATGGT TGGATCGACG
ATCGGCAGTG CCAACGCAAA TAGTCCACTC TCGCTGCTTT CGCCGAACCT GCCGGATTCA
GCCACTTCTC CCTGGAGCTC GGAAATCGCT CCCGGATTGA ATCGCTTCGA GACTGAGAAT
CTGGAATGGT CGGCCACGAC TCACCGGACA GTCAGCAGAG TGAATTCACC ATCGCATTTT
GATCACGTAG CCCATTCCGA ATCGACGGAT GGCATGGATA CGCCAGAAGT CGAAGAGGCG
GCATTCGCAG AGCAGCCCCA GAAATCTGAT CATGAACTGG ACTCCTCTCA GGCGAGGGAA
ATCTCTGAAC TTCCTCATGC CGCTGGGAGC CAGAACCATC ATTTCTCGAC ACAATTCCTG
GAAGGGGTGC GAGCGCTGCA GATTCATGAC TGCTATCTGG TGGTCGAAAC TCCCGAAGGG
ATGACGGTCA TCGATCAACA TGCCTTGCAT GAACGAATTC TTTACGAAGA ATTCCGCCGT
CGAGTGCATA GCAAGGCCAT GGAATCGCAG CGACTGCTCA TCCCGGTACC GATTGCCCTG
GGCTTTCGGG GATCCAGCCT GCTCCTCGAT TGCCGGGAAG CGTTAGATCA ACTGGGATTC
GAGATTTGCG AATTCGGGCA GGGAACCATT CTCCTTTCGG CCTACCCGGC GATGCTCGGA
AAACTGAATC AGGAGCAATT GCTTCGTGAT CTGGCGGAAC AATTGGAGTC TTCATCACTG
GAAAGTGCTC ATCGCGATAT TCTGGATGAG CTACTGAATA TGATGGCTTG TAAGGCGGCT
GTAAAATCGG GGCAGAAGTT GAGCCAGGAG GAAATCGAAG AGCTGTTGCG ACAACGACAT
CTTGTCGCCG ATGCTCATCA TTGTCCTCAT GGCCGGCCTA CGGCACTCAA TCTGTCAAGA
AGCGAACTTG ATCGGCAATT TGGACGTCTG GGTTCCTGA
 
Protein sequence
MGRIQQLSNS VVNKIAAGEV IERPASVVKE LIENSIDSGA LRVEVDIGAG GSEYIRVTDD 
GGGIHADDIL LAVTTHATSK IRTADDLFQI STLGFRGEAL ASIAEVSQLR IRSRIPDSDV
GSELVVNLGE RGEVVPCGCA TGTCIEIRQL FANTPVRRKF LKSDATEFGH ISEQFTRIAL
ARSHVHMVLR HNDKIVYELP ATDRLIDRLE MFYGSAISEQ LISIDHQLGE MRLWGYVGHP
AVNKATRKWQ HLFLNGRWFQ DRSIQHALSE AYRGLIMVQR HPICFLFLEL NPADVDVNVH
PTKVEVRFQD PQSIYRLMLS SLRNRFLGMD LDSKLKVPAE ASTVDPQEQN EIQKEFSQWA
KTELFKTAEI AGSPLMVGST IGSANANSPL SLLSPNLPDS ATSPWSSEIA PGLNRFETEN
LEWSATTHRT VSRVNSPSHF DHVAHSESTD GMDTPEVEEA AFAEQPQKSD HELDSSQARE
ISELPHAAGS QNHHFSTQFL EGVRALQIHD CYLVVETPEG MTVIDQHALH ERILYEEFRR
RVHSKAMESQ RLLIPVPIAL GFRGSSLLLD CREALDQLGF EICEFGQGTI LLSAYPAMLG
KLNQEQLLRD LAEQLESSSL ESAHRDILDE LLNMMACKAA VKSGQKLSQE EIEELLRQRH
LVADAHHCPH GRPTALNLSR SELDRQFGRL GS