Gene RSP_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1174 
Symbolpip 
ID3718167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2941873 
End bp2942838 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID640072405 
Productprolyl aminopeptidase 
Protein accessionYP_354259 
Protein GI77464755 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.651315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TCTATCCGTC GATCGATCCC 
TACGATCAGC GGGTCATCGA CATGGGCGAC GGCCATCGGA TCTATGTCGA GCAATGCGGC
GACCCGGACG GCGAGCCGGT GCTGGTGCTG CATGGCGGCC CCGGGGGCGG ATGCAGCCCC
TCGATGCGGC GCTATTTCGA CCCGAGCCGC TACCGCGTGA TCCTGTTCGA CCAGCGCGGC
TGCGGCCGGT CGCGGCCCCA TGCCTCGGTC GAGGCGAACA CGACCTGGCA CCTCGTCTCG
GACATCGAGG CGATCCGCCG GAAGCTCGGC ATCGACCGCT GGACCTGCTT CGGCGGCAGC
TGGGGGGCGA CGCTTGCGCT GATCTATGCG ATCTCGCACC CCGAGCGGGT GTCGAACCTG
ATCCTGCGCG GCGTCTTCCT GATGACCAAG GCCGAGCTCG ACTGGTTCTA CGGCGGCGGA
GCGGCGGCCT TCTTCCCCGA CATCTGGGCC CGGTTCGTGG CCCCCGTCCC GCCGGAGGAG
CGGGGCGATC TCGTCGCGGC CTATCGGCGG CGGCTCTTCT CGGGAAACCT GATGGAAGAG
ACGCGTTTCG GCCGCACCTG GGCCAACTGG GAGAATGCGC TGGCCTCGGT CGCGCAGGAC
GGGCCGCTGG GCGAGAGCCC GTCGGAATAT GCCCGCGCCT TCGCCCGGCT CGAGAACCAC
TATTTCTCCC ACGCAGGCTT CCTCGAGCAC GACGGCTGGA TCCTCGCCAA CCGCCACCGG
ATCGAGCATA TCCCGGCGGT GATCGTGCAG GGGCGCTACG ACATGATCTG CCCGCCGGTC
TCCGCCTGGA CGCTGGCCGA CGGGTGGGAG AAGGCGGATC TCAGGATCGT GCCGTTCGCG
GGCCACGCGC TCTCGGAACC CGGCATCAGC GCCGAACTCG TCCGCGTGAT GGACACGCTT
CCCTGA
 
Protein sequence
MDQRSGQKRA VEFLYPSIDP YDQRVIDMGD GHRIYVEQCG DPDGEPVLVL HGGPGGGCSP 
SMRRYFDPSR YRVILFDQRG CGRSRPHASV EANTTWHLVS DIEAIRRKLG IDRWTCFGGS
WGATLALIYA ISHPERVSNL ILRGVFLMTK AELDWFYGGG AAAFFPDIWA RFVAPVPPEE
RGDLVAAYRR RLFSGNLMEE TRFGRTWANW ENALASVAQD GPLGESPSEY ARAFARLENH
YFSHAGFLEH DGWILANRHR IEHIPAVIVQ GRYDMICPPV SAWTLADGWE KADLRIVPFA
GHALSEPGIS AELVRVMDTL P