Gene Rsph17029_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2836 
Symbol 
ID4897366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2989052 
End bp2990017 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID640113439 
Productproline iminopeptidase 
Protein accessionYP_001044710 
Protein GI126463596 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAA GATCAGGCCA AAAGCGCGCA GTCGAGTTCC TCTATCCGTC GATCGATCCC 
TACGATCAGC GGGTCATCGA CATGGGCGAC GGCCATCGGA TCTATGTCGA GCAATGCGGC
GACCCGGACG GCGAGCCGGT GCTGGTGCTG CATGGCGGCC CCGGGGGCGG CTGCAGCCCC
TCGATGCGGC GCTATTTCGA CCCGAGCCGC TACCGCGTGA TCCTGTTCGA CCAGCGCGGC
TGCGGCCGGT CGCGGCCCCA TGCCTCGGTC GAGGCGAACA CGACCTGGCA CCTCGTCTCG
GACATCGAGG CGATCCGCCG GAAGCTCGGC ATCGACCGCT GGACCTGCTT CGGCGGCAGC
TGGGGGGCAA CCCTTGCGCT GATCTATGCG ATCTCGCACC CCGAGCGGGT GTCGAATCTG
ATCCTGCGCG GCGTCTTCCT GATGACCAAG GCCGAGCTCG ACTGGTTCTA CGGCGGCGGA
GCGGCGGCCT TCTTCCCCGA CATCTGGGCC CGGTTCGTGG CCCCCGTCCC GCCGGAGGAG
CGGGGCGATC TCGTCGCGGC CTACCGGCGG CGGCTCTTCT CGGGCAACCT GATGGAAGAG
ACGCGTTTCG GCCGCACTTG GGCCAACTGG GAGAATGCGC TGGCCTCGGT CGCGCAGGAC
GGGCCGCTGG GCGAGAGCCC GTCGGAATAT GCCCGCGCCT TCGCCCGGCT CGAGAACCAC
TATTTCTCCC ATGGGGGCTT CCTCGAGCAC GACGGCTGGA TCCTCGCCAA CCGCCACCGG
ATCGAGCATA TCCCGGCGGT GATCGTGCAG GGGCGCTACG ACATGATCTG CCCGCCGGTC
TCCGCCTGGA CGCTGGCCGA CGGGTGGGAA AAGGCGGATC TCAGGGTCGT GCCGTTCGCG
GGCCACGCGC TCTCGGAACC CGGCATCAGC GCCGAACTCG TCCGCGTGAT GGACACGCTT
CCCTGA
 
Protein sequence
MDQRSGQKRA VEFLYPSIDP YDQRVIDMGD GHRIYVEQCG DPDGEPVLVL HGGPGGGCSP 
SMRRYFDPSR YRVILFDQRG CGRSRPHASV EANTTWHLVS DIEAIRRKLG IDRWTCFGGS
WGATLALIYA ISHPERVSNL ILRGVFLMTK AELDWFYGGG AAAFFPDIWA RFVAPVPPEE
RGDLVAAYRR RLFSGNLMEE TRFGRTWANW ENALASVAQD GPLGESPSEY ARAFARLENH
YFSHGGFLEH DGWILANRHR IEHIPAVIVQ GRYDMICPPV SAWTLADGWE KADLRVVPFA
GHALSEPGIS AELVRVMDTL P