Gene Pnap_1157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1157 
Symbol 
ID4689533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1227891 
End bp1229825 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content69% 
IMG OID639834161 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_981394 
Protein GI121604065 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCTC AAGACCCGGC CGCGCGCCGC CCCATCCGTG ACCTTCCCGA TGAACTGATC 
AGCCAGATCG CGGCTGGCGA GGTGGTCGAG CGCCCGGCGT CGGTGGTGCG CGAGCTGCTG
GACAACGCGC TCGATGCCGG CGCGACGCAG GTCACGGTGC GCCTGTCGGC GGGCGGCGTG
CGGCTGATCC TGGTCGAAGA CGACGGCCTG GGCATTCCGC GCGAAGAATT GCCGGTGGCG
CTACGCCGCC ACGCCACCAG CAAGATCGCG TCGCTGCAGG ACCTGGAAGC CGTCGGCACC
ATGGGTTTTC GGGGCGAGGC GCTGGCCGCG ATCAACTCGA TTGCCGACAT GAGCCTGCTG
TCCAGAACAC TGGACACTGA GCACGCCTGG CAGCTCGATG GCCGCACCGG CGAGATCAAG
CCGGCGTCCC GCGCGCGCGG CACCAGCGTC GAGGTGCGCG AGCTGTTCTA CGCCACGCCG
GCCCGGCGCA AATTCTTGAA GACCGACGCC ACCGAACTGG CGCATTGCAT CGACGCCGTG
CGCCGCCATG CGCTGGTGCG GCCCGACGTG GGTTTTGCCA TCTGGCACGA GGGCAAGCTG
GTCGAGCAGT GGCGCGCCTG CAGCGGCGCA TTGGCCGGCG AGCACAGCCT GCCGCACCAC
GACCAGCGCC TGGCCGACGT GCTGGGCGCG GAGTTCGTCG AGCAGTCGAT TGCAGTGCAT
TACGAAAGCA GCGCCCGCCG CGCCGATGGC CGGCCCGCCG TTCGCGTCTG GGGCCGGGCC
GGCATTCCGG ACGCCGCCCG CACGCGCGCC GACCAGCAGT TTGCCTACGT CAACGGCCGC
TACGTGCGCG ACAAGGTGCT GAGCCACGCC GCCCGCAGCG CCTACGAAGA CGTGCTGCAC
GGCCAGCGCC AGCCGGTGTA TGCGCTGTAT GTCGAGATCG ACCCGGCGCG GATCGACGTG
AACGTGCATC CGACCAAGAT CGAGGTGCGC TTTCGCGACA GCCGCGAGGT GCATCAGGCG
GTGCGCCACG CGACCGAGGA TGCGCTGGCC GCGCCGCGCG CAGGCGCGGC CCTTGCCGAC
AAACCGGCGG CCGAAAGCGC CGTGCAGCCG GTCGTTGGCG GTATGTTTGA ATCAAACAAG
GCATTTGCGC AACAGGGATG GGCGCAACCA GCTATCAATT TCAGAGCAAA CCAGGGCAGC
CACGCCTCGG ACTTTGAAGC CCTGTGGCCG GCGCCGCCGG GAACAGGCCG CCCGTCGGCC
CCCAGCAGCC CGATGCCGGG TGCGCCCCCT GCCCCCACGG AAAACCCGCA GGCCAGCGCG
CCGCCTGAAA CCAGCAGCCT GCCCCAGGGC GACTGGCCGC TGGGCCGCGC CCTCGCCCAG
CTGCAGGGCG TCTATATCCT GGCCGAAAAC GCCCAGGGTC TGATCATCGT GGACATGCAC
GCGGCCCACG AGCGCATCGT CTATGAGCGG CTGAAAAGCC AGTTCGATGC CTCGCGCATC
GCCAGCCAGC CGCTGTTGAT CCCGGCGACG TTTGCCGCCA CGGCGCAGGA AGTGGCCACG
GCTGAAGCCT CAACCGACAC GCTGGCCCTG CTGGGACTGG AGATCACGCC TTTTTCGCCC
AAAACGCTGG CGGTGCGGGC CGTGCCGACC AGCCTGGCTG CGGGCGACGC GGTCGAACTG
GCGCGCAGTG TGCTGGCCGA ACTGGCCCAG CATGACGCCA GCACCGTGAT TCAGCGCGCG
CAGAACGAGC TGCTCGGCAC CATGGCCTGC CACGGCGCGG TGCGCGCCAA CAGGCGGCTG
ACCGTTGATG AAATGAACGC CCTGCTGCGC CAGATGGAAG CGACCGAACG CTCCGACCAG
TGCAACCACG GCCGGCCGAC CTGGCGCCAG CTCAGCCTGA AGGAACTCGA CAGCCTGTTT
TTGCGCGGGC GCTAG
 
Protein sequence
MTPQDPAARR PIRDLPDELI SQIAAGEVVE RPASVVRELL DNALDAGATQ VTVRLSAGGV 
RLILVEDDGL GIPREELPVA LRRHATSKIA SLQDLEAVGT MGFRGEALAA INSIADMSLL
SRTLDTEHAW QLDGRTGEIK PASRARGTSV EVRELFYATP ARRKFLKTDA TELAHCIDAV
RRHALVRPDV GFAIWHEGKL VEQWRACSGA LAGEHSLPHH DQRLADVLGA EFVEQSIAVH
YESSARRADG RPAVRVWGRA GIPDAARTRA DQQFAYVNGR YVRDKVLSHA ARSAYEDVLH
GQRQPVYALY VEIDPARIDV NVHPTKIEVR FRDSREVHQA VRHATEDALA APRAGAALAD
KPAAESAVQP VVGGMFESNK AFAQQGWAQP AINFRANQGS HASDFEALWP APPGTGRPSA
PSSPMPGAPP APTENPQASA PPETSSLPQG DWPLGRALAQ LQGVYILAEN AQGLIIVDMH
AAHERIVYER LKSQFDASRI ASQPLLIPAT FAATAQEVAT AEASTDTLAL LGLEITPFSP
KTLAVRAVPT SLAAGDAVEL ARSVLAELAQ HDASTVIQRA QNELLGTMAC HGAVRANRRL
TVDEMNALLR QMEATERSDQ CNHGRPTWRQ LSLKELDSLF LRGR