Gene RPD_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3473 
Symbol 
ID4023987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3852277 
End bp3853266 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID637963677 
Productproline iminopeptidase 
Protein accessionYP_570597 
Protein GI91977938 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.598272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCCG ACGCGAAGTC CGAGATTCGT TCCGACAGCA ACGGCAAATC CACGGCGCCG 
CTCACCGCGC AAATGCTCGC GGTCGGCGAC GGCCACGAGT TATATGTCGA AACCAACGGC
AACCCCGATG GTCTCGCCGC GGTCTACCTG CATGGCGGCC CCGGCAGCGG TTGCCAGCCC
GATCATCGGC GGCTGTTCGA TGCGCAGCGG TTTCATGCCG TGTTGTTCGA TCAGCGCGGC
GCGGGACGTA GCCGCCCGAA AGGCGGGCGT TACGCGAACA CGCTGCCGCA TCTGATCGCC
GACATGGAAA TGATCCGCAC CACGCTCGGC ATCGAACGCT GGCTCGTAGT CGGCGGATCG
TGGGGCGCGA CGCTGGCGCT GGCCTATGCG CAGTCGCATC CGCAGCGCGT CAGCGGCGTC
GTTCTGCGTG CGGCTTTTCT CGGCACGCGC GCGGAACTCG AGGGTGCCTT CATGTCGAGC
CTGCCGCGGT TCTATCCGGA ACTGCACGCG GATTTTCTCG GCATCCTTCC CGCGGCGGAG
CGCAGCGCGC CGCTCGACGC CTATTGGCGG CGCATCCTCG ATCCCGATCC GGAGGTGCAC
GGCCCCGCGG CGCGGGCCTG GGGCGAAACC GAGGCGATCA TGTCGCAAAT CGGGCCAAAG
CGGTCACGGC TCGAAATCTC CAATGAAAAC AATACCCGGC CGATCCCGTC GACGCCGTTC
ATGGAAGCGC ATTACTTCGT CCACGACTGC TTCATGCGCC CCGATCAATT GCTGCATGAC
GCGCCGGCGC TCGCGGGCAT TCCCGGCGTC ATCGTGCAAG GCCGCTACGA TCTGCTCTGC
CCGCCGGCCA CCGCGCATCG GCTCGGCGCG GCGTGGCCGG ACGCCGAACT ACGCGTCATC
GATGCCGCCG GACATCTGTT GTACGATCCG GGAATCCGCG ACGCGGTGAT CGCCGCGATC
AACGACCTCG CGACCAAGAT CAAAGCGTGA
 
Protein sequence
MAPDAKSEIR SDSNGKSTAP LTAQMLAVGD GHELYVETNG NPDGLAAVYL HGGPGSGCQP 
DHRRLFDAQR FHAVLFDQRG AGRSRPKGGR YANTLPHLIA DMEMIRTTLG IERWLVVGGS
WGATLALAYA QSHPQRVSGV VLRAAFLGTR AELEGAFMSS LPRFYPELHA DFLGILPAAE
RSAPLDAYWR RILDPDPEVH GPAARAWGET EAIMSQIGPK RSRLEISNEN NTRPIPSTPF
MEAHYFVHDC FMRPDQLLHD APALAGIPGV IVQGRYDLLC PPATAHRLGA AWPDAELRVI
DAAGHLLYDP GIRDAVIAAI NDLATKIKA