Gene RPB_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1225 
Symbol 
ID3910160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1401931 
End bp1403061 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID637883119 
Productcarboxypeptidase 
Protein accessionYP_484846 
Protein GI86748350 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG CAAATCTGCC GTTTGATTCC GAAGCCATGC TGCAAGGCCT GCGCGCCTGG 
GTCGAGTGCG AAAGCCCGAC CTGGGACAAA GCCGCCGTCG AGCGCATGCT CGACCTCGCC
GCGCGCGACA TGGCGGTGAT GGGTGCGTCG ATCGAACGCA TCGCCGGACG GCAGGGCTTC
GCCGGCTGTG TTCGCGCACG CTTTCCGCAC CCGCGGCAGG GCGAGCCCGG CATCCTGATC
GCCGGCCATC TCGACACCGT GCATCCGGTC GGCACGATCG AGAAACTGCA ATGGCGCCGC
GACGGCAACA AATGCTACGG CCCGGGCATC TTCGACATGA AGGGCGGCAA CTATCTGACG
CTCGAAGCCA TCCGCCAGCT CGCGCGCGCG TCGTTCACGA CGCCGCTGCC GGTCACCGTG
CTGTTCACGC CGGACGAGGA AGTCGGCACG CCCTCGACCC GGGACATCAT CGAGGCGGAG
GCCGCCCGCA ACAAATACGT GCTGGTGCCG GAGCCCGGCC GCCCCGACAA CGGCGTCGTC
ACCGGCCGCT ACGCGATCGC GCGATTCAAT CTGACGGCGA CCGGCAAGCC CAGCCACGCC
GGCGCGACGC TGTCCTCGGG ACGTTCCGCG ATCCGGGAAA TGGCGCGGCA GATATTGGCG
ATCGACGCGA TGACGACGGA GGACTGCACG TTCAGCGTCG GCATCGTGCA CGGCGGACAA
TGGGTCAATT GCGTCGCCAC CACCTGCACC GGCGAGGCGC TCAGCATGGC GAAGCGGCAG
GCCGATCTCG ACCGCGGCGT CGAACGGATG CTGGCGCTGT CCGGCACCAG CAACGACGTC
GGCTTCGAAG TGACGCGCGG CGTGACGCGG CCGGTCTGGG AGCCCGACGC CGGCACCATG
GCGCTGTACC AGAAGGCGGC CGCGATCGCC GACCAGCTCG GGCTGAAGCT GCCGCACGGC
AGCGCCGGCG GCGGTTCCGA CGGCAACTTC ACCGGCGCGA TGGGGATCCC GACTCTCGAC
GGCCTCGGCG TGCGTGGCGC CGACGCCCAC ACGCTGAACG AGCATATCGA AGTCGATAGT
CTGGCGGAAC GCGGGCGCCT GATGGCCGGG CTGCTCGCGA CTCTCGCATG A
 
Protein sequence
MNPANLPFDS EAMLQGLRAW VECESPTWDK AAVERMLDLA ARDMAVMGAS IERIAGRQGF 
AGCVRARFPH PRQGEPGILI AGHLDTVHPV GTIEKLQWRR DGNKCYGPGI FDMKGGNYLT
LEAIRQLARA SFTTPLPVTV LFTPDEEVGT PSTRDIIEAE AARNKYVLVP EPGRPDNGVV
TGRYAIARFN LTATGKPSHA GATLSSGRSA IREMARQILA IDAMTTEDCT FSVGIVHGGQ
WVNCVATTCT GEALSMAKRQ ADLDRGVERM LALSGTSNDV GFEVTRGVTR PVWEPDAGTM
ALYQKAAAIA DQLGLKLPHG SAGGGSDGNF TGAMGIPTLD GLGVRGADAH TLNEHIEVDS
LAERGRLMAG LLATLA