Gene Pnap_4778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4778 
Symbol 
ID4685995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008761 
Strand
Start bp15591 
End bp16838 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content54% 
IMG OID639826767 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_973929 
Protein GI121583503 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.16943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones135 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTG AGTGGCAATT CGGAAAGCTC GGCGACTTCA TTGAACTAAA ACGTGGTTAC 
GATTTACCGC AAGCGAAAAG AACTTCTGGC CCGTTTCCTC TCGTTTCATC CTCCGGTGTT
AGTGACTGCC ACTCCGTGCC AATGGTGCGA GGGCCAGGCG TGGTCACTGG GCGATACGGA
ACGATTGGGC AAGTCTATTT TGTTGAAGAT GATTTTTGGC CGCTGAACAC AACCCTTTAT
GTTCGTGATT TCAAGGGTAA TGACCCAAAG TTCATCAGCT ATTTTTTGAA GACCGTTGAT
TTTTTCGCCT ATTCAGACAA GGCGGCAGTG CCTGGCGTAA ACAGAAATCA TCTTCATGAA
GCTCTTGGTG CAATTCCTGA TTTACCCACT CAACAGGAGA TAGCGAGAAC GCTCGGTGTC
CTGGACGACC GCATCGCCCT GCTGCGCGAA ACCAATGCCA CGCTCGAAGC CATCGCTCAG
GCGCTGTTCA AGTCGTGGTT TGTCGATTTC GACCCGGTGC GCGCCAGGAT GGAAGGCCGC
GCCCCCGAAG GCATGGACGA GGCCACGGCG GCGCTGTTTC CGGATGGGTT CGAGGATTCG
GAGCTGGGAT TGGTGCCGAA GGGGTGGGCG ACACGCACTA TGGCGGATAT ATCAACCGTG
GGGATTGGAA AGACACCTCC TCGCAAGGAA CAACATTGGT TCAGCGAAGA CCCAAGCGAT
GTTCGATGGG TTTCCATTCG CGATATGGGC GCTGTTGGGG TTTACGCCGC AGTGACCAGC
GAGTTTCTGA AGAAAGAGGC CATTGAAAAG TTCAACATCC GACGAGTGCC TGACAACACG
GTATTGATGA GCTTCAAGAT GACCATTGGC CGCGTGGCAA TTACCGATGG CGAAATGACA
ACCAACGAAG CCATTGCCCA CTTCAAACTG GCCCCGGATG CACAGTTGAG CACAGAGTAC
ATCTATCTGC ATTTGAAACA GTTCGACTTC TCCACTTTGA GCAGCACATC CTCGATTGCA
GATGCCGTCA ACTCCAAGAC CGTGCGCGAA ATTCCAATAC TAATGCCGAG CCTTGAAGGC
TTGACTGCAT TCCAAAGCCA AGTCGCAGCG CTCTTTGCGA AACTGAAAAA TACAGAACAG
CACGCCCAAA CCCTCGTCAC ACTGCGCGAC ACCCTGCTCC CACGCCTGAT CTCGGGCCAG
CTGCGCCTGC CCGAAGCCGA GGCGCTGCTC GAAGAAGCCT GCGCATGA
 
Protein sequence
MSSEWQFGKL GDFIELKRGY DLPQAKRTSG PFPLVSSSGV SDCHSVPMVR GPGVVTGRYG 
TIGQVYFVED DFWPLNTTLY VRDFKGNDPK FISYFLKTVD FFAYSDKAAV PGVNRNHLHE
ALGAIPDLPT QQEIARTLGV LDDRIALLRE TNATLEAIAQ ALFKSWFVDF DPVRARMEGR
APEGMDEATA ALFPDGFEDS ELGLVPKGWA TRTMADISTV GIGKTPPRKE QHWFSEDPSD
VRWVSIRDMG AVGVYAAVTS EFLKKEAIEK FNIRRVPDNT VLMSFKMTIG RVAITDGEMT
TNEAIAHFKL APDAQLSTEY IYLHLKQFDF STLSSTSSIA DAVNSKTVRE IPILMPSLEG
LTAFQSQVAA LFAKLKNTEQ HAQTLVTLRD TLLPRLISGQ LRLPEAEALL EEACA