Gene CPS_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_3964 
Symbolpip2 
ID3520788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4152998 
End bp4153954 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content41% 
IMG OID637286410 
Productproline iminopeptidase 
Protein accessionYP_270622 
Protein GI71280323 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA TGAATCAGTT TTACCCTGAA ATATCCCCGT TTAATGACTT TCTACTTGAT 
GTTGATGGCC AACATCGAAT TTATGTAGAG CAGTGTGGTA ATCCTAAAGG GCAACCCGTG
CTTTTCATTC ATGGTGGTCC AGGAGGCGGT TGTTCAACTA ATGACAGGCG TTTTTTTGAT
CCTGAGCAAT ACCATATCAT TTTATTTGAT CAACGTGGCT GTGGTCGTTC ATTACCTCAT
GGTTGTCTAG ATAACAACGA AACCAATTTC TTAGTTGCAG ATATAGAAAA GATTCGCCAA
CACTTGAATA TTGAACAATG GCATGTGTTT GGTGGTTCGT GGGGCTCTAC GTTATCTTTA
GTCTATGCTG AGGCGCATCC GGTCAGTGTT AAAAGTTTAG TCTTACGTGG TATTTTCCTT
GGACGTGAGG TTGATACTAA TTGGACATTT TCAGGTGGCG GGGCAACACG TATTTTCCCT
GATTATTGGC AAGATTATAT TGATGTACTT CCGCTAGGCA GAGAACAAGC GACGACTAAA
GCGGCTTACG AAATGTTGAT TGGTGAGGAT AAAGCCCTAG CGCAAAAAAT AGCAACGGCT
TGGAGTATTT GGGAAATCCG TTGCTGTACC TTAATACCTG ATCAAGCCTT TGTTGATGCA
GCTACAGGTG ACGACCATGC TTGGACATTA GCTCGCCATG AGGCACACTT TATGGTGAAT
GATTGTTTTT TAACTGACAA TCAGATCTTA GCGAACTGCG ATAAAATAAA AGATATTCCA
ACGACTATCG TTCATGGTCG TTATGATATT GTTTGTCCTG CTGATAATGC GTGGTTATTA
CATCAACAGT TACCTAATTC GCGTCTTGTT ATTAGTGAGG CATCAGGTCA TGCTTCTGTT
GAACCAAACA CTAAGCATCA CTTGATTGCG GCAACTCAAT CAATGTTGTC ATTGTAG
 
Protein sequence
MNKMNQFYPE ISPFNDFLLD VDGQHRIYVE QCGNPKGQPV LFIHGGPGGG CSTNDRRFFD 
PEQYHIILFD QRGCGRSLPH GCLDNNETNF LVADIEKIRQ HLNIEQWHVF GGSWGSTLSL
VYAEAHPVSV KSLVLRGIFL GREVDTNWTF SGGGATRIFP DYWQDYIDVL PLGREQATTK
AAYEMLIGED KALAQKIATA WSIWEIRCCT LIPDQAFVDA ATGDDHAWTL ARHEAHFMVN
DCFLTDNQIL ANCDKIKDIP TTIVHGRYDI VCPADNAWLL HQQLPNSRLV ISEASGHASV
EPNTKHHLIA ATQSMLSL