Gene Rsph17025_2767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2767 
Symbol 
ID5085120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2813324 
End bp2814292 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content67% 
IMG OID640484330 
Productproline iminopeptidase 
Protein accessionYP_001168959 
Protein GI146278800 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TTTATCCGTC GATCGACCCG 
TTCGATCAGC GGGTGATCGA CATGGGCGAC GGCCACCGGA TCTATGTCGA GCAGTGCGGC
AACCCGCAAG GTGAGGCGGT GCTCGTGCTT CATGGCGGAC CGGGCGGCGG CTGCAGCCCC
TCGATGCGGC GCTATTTCGA CCCGGTACGC TACCGGGTGG TGCTCTTCGA CCAGCGCGGC
TGCGGGCGGT CGCGGCCCCA TGCCTCGGTC GAGGCCAACA CGACCTGGCA TCTCGTCTCG
GACATCGAGG TGATCCGCGC CAAGCTCGGG ATCGACCGCT GGACCTGCTT CGGGGGCAGC
TGGGGCGCCA CTCTGGCGCT GATCTACGCC ATTTCGCACC CCGAGCGGGT GACGAACCTC
GTGCTGCGCG GCGTCTTCCT GATGACCCGG GCGGAACTGG ACTGGTTCTA CGGCGGTGGC
GCGGCGACCT TCTTCCCCGA CATCTGGGCG CGGTTCGTGG CACCTGTCCC CGCGGCCGAG
AGAGGCGACA TGATCGCCGC CTATCACCGG CGGCTGTTCT CGGGGAACCT GATGGAAGAG
AGCCGGTTCG GCCGCGCCTG GGCGAACTGG GAGAACGCGC TCGCCTCGGT CTCGCAAGAC
GTTCCGGTGG GCGAGAGCCC CTCGGAATAT GCGCGCGCCT TCGCCCGACT GGAGAATCAC
TATTTCTCGA ATGCAGGTTT CCTCGAGCAG GACGGCTGGA TCCTCGCCAA CCGCTCGCGG
ATCGCGCACA TTCCGGCCGT GATCGTGCAG GGCCGCTATG ACATGATCTG CCCCCCGCTC
TCTGCCTGGA AACTGGCTGA GGGCTGGGAC AAGGCGGACC TGCGGCTGGT GCCCTTCGCG
GGCCACGCAC TTTCGGAACC CGGCATCAGC GCCGAGCTGG TGCGCGTGAT GGACACGCTT
CCCCGCTAG
 
Protein sequence
MDQRSGQKRA VEFLYPSIDP FDQRVIDMGD GHRIYVEQCG NPQGEAVLVL HGGPGGGCSP 
SMRRYFDPVR YRVVLFDQRG CGRSRPHASV EANTTWHLVS DIEVIRAKLG IDRWTCFGGS
WGATLALIYA ISHPERVTNL VLRGVFLMTR AELDWFYGGG AATFFPDIWA RFVAPVPAAE
RGDMIAAYHR RLFSGNLMEE SRFGRAWANW ENALASVSQD VPVGESPSEY ARAFARLENH
YFSNAGFLEQ DGWILANRSR IAHIPAVIVQ GRYDMICPPL SAWKLAEGWD KADLRLVPFA
GHALSEPGIS AELVRVMDTL PR