Gene Pnap_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1088 
Symbol 
ID4688047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1154205 
End bp1155185 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID639834088 
Productproline iminopeptidase 
Protein accessionYP_981326 
Protein GI121603997 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.445968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCT CCATGACTTC TTCCGCCCCC ATCGTTGCCG CGCCTGCGCT TTACCCGCCC 
ATCGCGCCGT TTCGCACCGG CACGCTCGAT ACCGGCGACG GCCATTCGAT TTACTGGGAG
CTGTGCGGCA ACCCGCAAGG CAAGCCAGCG GTGTTTTTGC ATGGCGGGCC GGGTTCGGGC
TGCTCGCCAG ACCACCGGCG GCTGTTCGAC CCCGAGCGCT ACTGCGTGCT GCTGTTCGAC
CAGCGCGGCT GCGGCCGTTC GCGCCCGCAT GCTTCGCTCC ACAACAACAC GACCTGGCAC
CTGGTCGCCG ACATCGAGCG GCTGCGCACC CTCTTGGGCG TCAAGCGCTG GCTGGTGTTC
GGCGGCTCCT GGGGCTCATC GCTGGCGCTG GCCTATGCGC AAACCCATCC GGCGCAGGTC
TCCGAACTCG TCGTGCGCGG CATCTTCACG CTGCGCCGCG CCGAACTGCT CTGGTATTAC
CAGGAGGGCG CTTCGTGGCT GTTTCCCGAT CTGTGGGAAG ATTTTGTCGC GCCGATTCCA
CCGGCCGAGC GCGGCGACCT GATGGCCGCC TACCGCCAGC GGCTGGTCGG CAGCGACCGC
GCCGCGCAAC TGGCCTGCGC CCGCGCCTGG AGCCTGTGGG AAGGCCAGAC CATCACGCTG
TTGCCCGATC CGACTGGTGC GGCCAAGCAT GGCGATGACG ACTTCGCGCT GGCGTTTTCG
CGCATCGAAA ACCATTATTT CGTGCATGGC GGCTGGCTGG AGGAGGGCCA GCTGATCCGG
GACGCGGGCA AGCTGGCCGG CATTCCCGGC GTCATCGTGC AGGGCCGCTA CGACATGGCC
TGCCCCGCCA GAACCGCCTG GGACTTGCAC CGCGCCTGGC CGCAGGCGGA ATTTCACCTG
ATTGCCGATG CCGGCCATGC CTTCAACGAG CCCGGCATCC TGGCGCAGCT GATCGCCGCG
ACTGACCGGT TTGCACGCTG A
 
Protein sequence
METSMTSSAP IVAAPALYPP IAPFRTGTLD TGDGHSIYWE LCGNPQGKPA VFLHGGPGSG 
CSPDHRRLFD PERYCVLLFD QRGCGRSRPH ASLHNNTTWH LVADIERLRT LLGVKRWLVF
GGSWGSSLAL AYAQTHPAQV SELVVRGIFT LRRAELLWYY QEGASWLFPD LWEDFVAPIP
PAERGDLMAA YRQRLVGSDR AAQLACARAW SLWEGQTITL LPDPTGAAKH GDDDFALAFS
RIENHYFVHG GWLEEGQLIR DAGKLAGIPG VIVQGRYDMA CPARTAWDLH RAWPQAEFHL
IADAGHAFNE PGILAQLIAA TDRFAR