Gene PSPTO_5164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPSPTO_5164 
Symbolpip 
ID1186849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas syringae pv. tomato str. DC3000 
KingdomBacteria 
Replicon accessionNC_004578 
Strand
Start bp5876768 
End bp5877739 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID637396483 
Productproline iminopeptidase 
Protein accessionNP_794895 
Protein GI28872276 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACTT TGTACCCGCA GATCAAACCC TACGCCCGGC ACGATCTGGC CGTGGAACAA 
CCGCATGTGC TCTACGTCGA TGAAAGCGGT TCGCCTGAAG GTTTGCCTGT GGTGTTCATT
CACGGTGGCC CGGGTTCTGG ATGCGATGCG CACAGCCGCT GCTATTTCGA TCCCAACCTG
TACCGAATTG TTACCTTCGA TCAGCGTGGC TGTGGCCGCT CCACACCTCA TGCCAGCCTC
GAAAACAATA CCACCTGGAA GCTGGTGGAA GACCTTGAGG TCATTCGCGA GCACTTGGGC
ATCGACAAAT GGGTACTGTT CGGCGGCTCG TGGGGTTCGA CCCTCGCGCT GGCTTACGCT
CAGACCCACC CCGACCGCGT GCATGCGCTG ATTCTGCGTG GCGTGTTTCT GGCCCGTCAG
CAAGAAATCG ACTGGTTCTA TCAAGCGGGT GCCAGCCGCC TGTTCCCCGA TTACTGGCAG
GACTACGTCG CCCCTATCCC GTTGGATGAG CGCAACAATA TTCTCGCTGC CTTTCACAAG
CGTCTCACCG GCGCAGACCA GATTGCCCAG ATGCATGCCG CCAAGGCCTG GTCGACGTGG
GAAGGCCGCT GCGCAACCTT GCGTCCCAAT CCTCAGGTGG TCGACCGCTT TACCGATCCG
CACCGTGCCC TGTCCATCGC GCGTATCGAA TGCCACTACT TCATGAACAA GGCGTTTCTG
GAAGAGAACC AGCTGATTCG CGACATGCCG AAGATCGCTC ACCTGCCGGC AATCATTGTG
CACGGTCGTT ACGATGTCAT CTGCCCGCTG GACAATGCCT GGGAGCTGCA TCAGAACTGG
CCCGACAGCG AGCTGCAGAT CATTCGCGAC GCAGGGCATT CGGCCGCCGA AACCGGTATC
GCCGATGCGC TGGTACGTGC CGCTGCGCAG ATTGCGCAGA ACCTGCTCGA TCTGCCGCCC
GAAGAAGCCT GA
 
Protein sequence
MQTLYPQIKP YARHDLAVEQ PHVLYVDESG SPEGLPVVFI HGGPGSGCDA HSRCYFDPNL 
YRIVTFDQRG CGRSTPHASL ENNTTWKLVE DLEVIREHLG IDKWVLFGGS WGSTLALAYA
QTHPDRVHAL ILRGVFLARQ QEIDWFYQAG ASRLFPDYWQ DYVAPIPLDE RNNILAAFHK
RLTGADQIAQ MHAAKAWSTW EGRCATLRPN PQVVDRFTDP HRALSIARIE CHYFMNKAFL
EENQLIRDMP KIAHLPAIIV HGRYDVICPL DNAWELHQNW PDSELQIIRD AGHSAAETGI
ADALVRAAAQ IAQNLLDLPP EEA