Gene Pnap_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1359 
Symbol 
ID4686631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1445917 
End bp1448529 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content67% 
IMG OID639834362 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_981595 
Protein GI121604266 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0149931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACACACTCAA ACCCGGCGAA GCACCGGTGC GGGCCGCCCC CCACACGCCG 
ATGATGACCC AGTACCACGC CATCAAGGCC GAGTATCCCG ACACGCTGGT GTTCTACCGC
ATGGGCGACT TCTATGAGGT TTTCTACGCC GATGCCGAGA AAGCCTCCAG CCTGCTCGAC
ATCACCCTGA CCAAACGCGG CCAGTCGGCG GGCGAACCGG TCGTCATGGC GGGCATCCCG
TTTCACGCGC TCGAAGGCTA CCTGGCCAAG CTCATCAAGC TGGGCGAGTC GGTGGCGATT
TGCGAGCAGG TCGGCGACGT GGCGACGGCC AAGGGGCCGG TCGAGCGAAA AGTGGTGCGC
GTGGTCACGC CCGGCACATT GACCGACACC GAACTGCTGA GCGACAAGAC CGAATCCATC
CTGCTGGCCG TGCACCAGGG CGCGCGCAAC ACCTGCGGCC TGGCCTGGCT CAGCGTGACG
CAGGGCGAGA TCCACCTGGC CCATTGCGCC AACGGCGAGC TGGAAACCTG GCTGGCCCGG
ATTGCGCCCA GCGAACTGCT CTACAACGTC GATGCCACGC CCGCGTTCGA GCAGCGCCTG
CAAACCCAGC GCTGCGCCGC CAGCGCCCGC CCGGCCTGGC AGTTTGACGC GGCCCTGGGC
GTGCGGCGCC TGCTGGACCA GCTCAAGGTC GCTTCGCTGG CCTCGTGGCA CGCCGAAGGG
CTGAATGAAG CGCACGCCGC CGCCTCGGCG CTGCTGGGTT ACGCCGAGCA CACGCAGGGC
CGCGCCCTGC CCCATGTGCA GGGCCTGCAG GTGGTGCGTT CGGGCGAGCT GATCGAGCTG
CCGCCGGCCA CGCGGCGCAA CCTCGAACTG ACGCAGACCC TGCGCGGCGA GGATTCGCCC
ACGCTGTTCT CGCTGCTGGA CACCTGCACC ACCGGCATGG GCAGCCGGGC GCTGAAAAGC
TGGCTGCTCA GCCCGCGCCG CGACCGCGCC CAGGCAGCAA GCCGCTTAGA GGCCATCACG
CAATTGCGCA GCGGCGCGCA GCAAACCTTG CGCACCCAAC TCAAGGGCTG CAGCGACGTC
GAGCGCATCA CCGCCCGCAT CGCGCTGCGC CAGGTGCGCC CGCGCGAACT CGTGGCGCTG
CAGCTGACGC TACAAAAAGC AGAGCTGCTC ACGCCCGTGG ATAGTGCGCA AATACCTCTT
TTGACCACTA TTTTCGAGGA TTTGCAGCCG CCCCTTGGCT GCGCCGAACT GCTGGGCCAG
TGCATCCTGG ACGAGCCGGC CGCGCTGATC CGCGACGGCG GCGTCATCAA CCACGGCCTT
GATGCCGAAC TCGACGAGCT GCGCGCCATC CAGACCAACT GCGACGGCTT TCTGCTCGAC
CTGGAAATCC GCGAGAAAGC CCGCACCGGC ATTGCCAACC TGCGCGTGCA GTTCAACAAG
GTGCACGGCT TTTACATCGA GGTCACGCAG GGCCAGCTGG ACAAGGTGCC GGCCGACTAC
CGCCGCCGCC AGACGCTGAA GAATGCCGAG CGCTACATCA CGCCCGAACT GAAAACCTTC
GAGGACAAGG CGCTGTCGGC GCAGGAACGC GCGCTGGCGC GCGAGAAGTG GCTGTACGAG
CAGTTGCTCG ACCAACTGCA GGAGTTCGTC CCCGCCCTGA GCCGGCTGGC GCGCGCCATT
GCCGCGCTCG ATGCGCTGTG CGCGCTGGCC GAGCGCTCGC TGACGCTGAA CTGGTGCGCG
CCAGTATTTG TGAAGGAGCC GTGCATCGCC ATCGGCCAGG GCCGGCATCC GGTGGTGGAG
GCGCGGCTGG CAGAGACGGG CGGCGGCGCC TTCATCGCCA ACGACTGCAG CCTGACTGGC
AAAAGCCGCA TGCAGATGAT CACCGGCCCC AACATGGGCG GCAAATCGAC CTACATGCGG
CAGGTCGCGC TGATCGTGCT GCTGGCCAGC GTGGGCTCCT ACGTTCCCGC CAGCGCCTGC
CGGCTCGGGC CGATAGACGC CATCCACACC CGCATCGGCG CGGCCGACGA CGTGGCCAAC
GCGCAATCGA CCTTCATGCT GGAGATGCTG GAGGCCGCGC AGATTTTGCA TGCCGCCACT
CCTTATTCGC TGGTGCTGAT GGACGAGATC GGGCGCGGCA CCTCGACCTT CGACGGGCTG
GCGCTGGCCG GCGGCATTGC CGCTTATCTG CACAACAAGG CGCAGGCCTT CACGCTGTTC
GCCACGCATT ACTTCGAGCT GACCGAGTTC GCGGCCCAGC ACCACGGCGC CATGAATGTG
CATGTCAGCG CGGTCGAGTC GGGCAGCGAC ATTGTGTTTT TGCACCACAT CGAGCCCGGC
CCGGCCAGCA AGAGCTACGG CATCGCGGTC GCCAAGCTGG CCGGCGTTCC CGCCGCCGTC
GTCAACCATG CGCGCCATGC GCTGGCCGCG CTGGAGGCCC AGCAAAGCCA GGCCAGCGCC
CAGGTGGACC TGTTCGCCGC GCCGCCCGAA GCGCCGGCAT CCGGGCAAAC CGCTATTGAC
AAGGCGCTGG CGAGCATAGA CCCTGATATC CTGAGCCCCC GAGAAGCGCT TGAAGCGCTT
TACCAACTAA AAAAACTGGC CAGCGCCGCC TGA
 
Protein sequence
MQKDTLKPGE APVRAAPHTP MMTQYHAIKA EYPDTLVFYR MGDFYEVFYA DAEKASSLLD 
ITLTKRGQSA GEPVVMAGIP FHALEGYLAK LIKLGESVAI CEQVGDVATA KGPVERKVVR
VVTPGTLTDT ELLSDKTESI LLAVHQGARN TCGLAWLSVT QGEIHLAHCA NGELETWLAR
IAPSELLYNV DATPAFEQRL QTQRCAASAR PAWQFDAALG VRRLLDQLKV ASLASWHAEG
LNEAHAAASA LLGYAEHTQG RALPHVQGLQ VVRSGELIEL PPATRRNLEL TQTLRGEDSP
TLFSLLDTCT TGMGSRALKS WLLSPRRDRA QAASRLEAIT QLRSGAQQTL RTQLKGCSDV
ERITARIALR QVRPRELVAL QLTLQKAELL TPVDSAQIPL LTTIFEDLQP PLGCAELLGQ
CILDEPAALI RDGGVINHGL DAELDELRAI QTNCDGFLLD LEIREKARTG IANLRVQFNK
VHGFYIEVTQ GQLDKVPADY RRRQTLKNAE RYITPELKTF EDKALSAQER ALAREKWLYE
QLLDQLQEFV PALSRLARAI AALDALCALA ERSLTLNWCA PVFVKEPCIA IGQGRHPVVE
ARLAETGGGA FIANDCSLTG KSRMQMITGP NMGGKSTYMR QVALIVLLAS VGSYVPASAC
RLGPIDAIHT RIGAADDVAN AQSTFMLEML EAAQILHAAT PYSLVLMDEI GRGTSTFDGL
ALAGGIAAYL HNKAQAFTLF ATHYFELTEF AAQHHGAMNV HVSAVESGSD IVFLHHIEPG
PASKSYGIAV AKLAGVPAAV VNHARHALAA LEAQQSQASA QVDLFAAPPE APASGQTAID
KALASIDPDI LSPREALEAL YQLKKLASAA