Gene RPD_4023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4023 
Symbol 
ID4024540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4471424 
End bp4472554 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID637964226 
Productcarboxypeptidase 
Protein accessionYP_571143 
Protein GI91978484 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.50112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.963752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG CCAATTTACC GTTTGATTCC GAAGTGATGC TGCAGGGCCT GCGCGCCTGG 
GTTGAATGCG AAAGCCCGAC CTGGGACGCC GCCGCGGTCG AGCGCATGCT CGACCTCGCC
GCGCGCGACA TGGCCATCAT GGGTGCGGCG ATCGAACGCA TCGCCGGACG GCAGGGCTTC
GCCGGCTGCG TCCGCGCACG CTTCCCGCAT CCGCGGCAGG GCGAGCCCGG CATCCTGATC
GCCGGCCATC TCGACACCGT GCACCCGGTC GGCACGCTGC AGAAACTGCA ATGGCGCCGC
GACGGCAACA AATGCTTCGG CCCCGGCATC TTCGACATGA AGGGCGGCAA CTACCTGACG
CTTGAAGCCA TTCGACAGCT CGCGCGCGCA TCGTTCACCA CGCCGTTGCC GATCACCGTC
TTGTTCACGC CGGACGAGGA AGTCGGCACG CCCTCGACGC GAGACATCAT CGAGGCCGAG
GCCGCCCGCA ACAAATATGT GCTGGTTCCG GAACCCGGCC GTCCGGACAA CGGCGTCGTC
ACCGGTCGCT ACGCGATCGC CCGCTTCAAC CTGACGGCAA CCGGAAAGCC GAGCCACGCC
GGCGCAACGC TGTCGTCGGG GCGTTCCGCG ATACGGGAAA TGGCGCGGCA GATTCTGGCG
ATCGACGCGA TGACGACCGA CGACTGCACC TTCAGCGTCG GCATCGTGCA TGGCGGGCAA
TGGGTCAATT GCGTCGCCAC CACATGCACC GGCGAAGCGC TGAGCATGGC CAAGCGACAG
GCCGATCTGG ATCGCGGCGT CGAACGGATG CTGGCGCTGT CCGGCGCCGG CAACGACGTC
GGTTTCGAGG TGACCCGCGG CGTCACCCGC CCGGTCTGGG AACCCGATGC CGGCACGATG
GCCCTGTACG AGAAAGCCTC CGCCGTCGCC AAACAGATGG GCATGTCGCT GCCTCACGGC
AGTGCAGGCG GCGGCTCCGA CGGCAACTTC ACCGGCGCGA TGGGAATTCC GACGCTGGAC
GGACTCGGCG TGCGCGGCGC CGATGCCCAC ACGCTGAACG AGCATATCGA GGTCGACAGT
CTGGCCGAGC GCGGCCGCTT GATGGCGGGG CTGCTGGCGA CTCTGGAATG A
 
Protein sequence
MNPANLPFDS EVMLQGLRAW VECESPTWDA AAVERMLDLA ARDMAIMGAA IERIAGRQGF 
AGCVRARFPH PRQGEPGILI AGHLDTVHPV GTLQKLQWRR DGNKCFGPGI FDMKGGNYLT
LEAIRQLARA SFTTPLPITV LFTPDEEVGT PSTRDIIEAE AARNKYVLVP EPGRPDNGVV
TGRYAIARFN LTATGKPSHA GATLSSGRSA IREMARQILA IDAMTTDDCT FSVGIVHGGQ
WVNCVATTCT GEALSMAKRQ ADLDRGVERM LALSGAGNDV GFEVTRGVTR PVWEPDAGTM
ALYEKASAVA KQMGMSLPHG SAGGGSDGNF TGAMGIPTLD GLGVRGADAH TLNEHIEVDS
LAERGRLMAG LLATLE