Gene PP_5028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_5028 
Symbolpip 
ID1045446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp5729386 
End bp5730357 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content64% 
IMG OID637148427 
Productproline iminopeptidase 
Protein accessionNP_747129 
Protein GI26991704 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCC TCTACCCGCA GATCAAACCC TACGCCAGGC ACGATCTGGC CGTGGAAGCG 
CCGCATGTGC TTTATGTCGA CGAAAGCGGC TCGCCGGAAG GTCTGCCCGT GGTGTTCATC
CACGGAGGCC CCGGTGCTGG CTGCGACGCC CAGAGCCGCT GCTACTTTGA TCCCAACCTG
TACCGCATCA TCACCTTCGA CCAGCGCGGC TGTGGCCGCT CCACGCCCCA TGCGAGCCTG
GAGAACAACA CCACCTGGCA CCTGGTCGAA GACCTGGAGC GCATTCGCGA ACACCTTGGC
ATAGACAAGT GGGTGCTGTT CGGTGGTTCT TGGGGCTCGA CCCTGGCCCT GGCCTACGCT
CAGGCTCACC CCGAACGCGT GCACGGCCTG ATCCTGCGCG GCATCTTCCT GTGCCGGCCG
CAGGAAATCG AGTGGTTCTA CCAGGAAGGC GCCAGCCGCC TGTTCCCCGA CTACTGGCAG
GACTACATCG CACCGATTCC ACCGGAGGAA CGCGGCGACC TGGTCAGGGC CTTCCACAAG
CGCCTGACCG GTAACGACCA GATCGCCCAG ATGCACGCCG CCAAGGCGTG GTCCACCTGG
GAAGGCCGTA CCGCCACCCT GCGCCCCAAC CCGCTGGTGG TCGACCGCTT TTCCGAACCG
CAGCGGGCGC TGTCGATCGC CCGCATCGAA TGCCACTACT TCATGAACAA CGCCTTCCTC
GAACCGGACC AGCTGATCCG CGATCTGCCC AAAATCGCCC ACCTGCCGGC GGTGATCGTG
CATGGTCGCT ACGATGTGAT CTGCCCCTTG GACAACGCCT GGGCGTTGCA CCAGGCCTGG
CCGAACAGTG AGTTGAAAGT GATCCGTGAC GCCGGTCACG CGGCTTCCGA GCCTGGCATC
ACCGATGCCT TGGTGCGTGC CGCCGACCAG ATGGCCCGGC GCCTGCTCGA TTTGCCTCTG
GAAGAAGCAT GA
 
Protein sequence
MQTLYPQIKP YARHDLAVEA PHVLYVDESG SPEGLPVVFI HGGPGAGCDA QSRCYFDPNL 
YRIITFDQRG CGRSTPHASL ENNTTWHLVE DLERIREHLG IDKWVLFGGS WGSTLALAYA
QAHPERVHGL ILRGIFLCRP QEIEWFYQEG ASRLFPDYWQ DYIAPIPPEE RGDLVRAFHK
RLTGNDQIAQ MHAAKAWSTW EGRTATLRPN PLVVDRFSEP QRALSIARIE CHYFMNNAFL
EPDQLIRDLP KIAHLPAVIV HGRYDVICPL DNAWALHQAW PNSELKVIRD AGHAASEPGI
TDALVRAADQ MARRLLDLPL EEA