Gene RPD_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0858 
Symbol 
ID4021332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp961351 
End bp963441 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content68% 
IMG OID637961048 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_567997 
Protein GI91975338 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.556421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA GCTCCGGACC GAATGCCGCA GCCCCGAGCA CCGGCAATCC GCTGTTGCAG 
GCCTGGACCA CCCCGTTCGA AACCCCGCCA TTCGCCGAGA TCGCGCCGGA GCATTTCCTG
CCGGCGTTCG AGCAGGCGTT CGCCGACCAT GCCGCCGAGA TCGCCGCGAT CACCCACGAT
CCGACCGAAC CGGACTTCGA CAACACCATC ACCGCGCTGG AGCGCTCCGG CAAGCTGCTC
AACAGGGTGG CGGCGGTGTT CTACGACCTG GTCTCGGCGC ACTCCAATCC GGCGCTTCTG
GAGATCGACA AGGACGTGTC GCTGCGGATG GCGCGGCACT GGAATCCGAT CATGATGAAC
GCCGTGCTGT TCGGCCGCAT CGCCGCGCTG CGCGACAAGC GCGCCGAGCT GAAGCTGACC
TCCGAGCAGC AGCGGCTGCT GGAGCGCACC TATACCCGCT TCCATCGCTC CGGTGCCGGC
CTCGACGACG CCGCCAAGGC GCGGATGGCC GAGATCAACG AGCGGCTGGC GCAGCTCGGC
ACCAATTTCG GCCATCATCT GCTCGGCGAC GAGCAGGACT GGTTCATGGA ATTGGGCGAA
AGCGACACCG ACGGCCTGCC CGCGAGCTAC GTCGCCGCCG CCCGTGCCGC GGCGAAGGAA
CGCGGGATGC CCGGCAAGGC GGTGGTGACG CTGTCGCGCT CCTCGGTCGA GCCGTTCCTG
AAGAGTTCGA GCCGCCGCGA CCTGCGCGAG AAGGTCTATC GCGCTTTCAT CGCCCGCGGC
GACAACGGCA ACGCCAACGA CAACAACGCG CTGATCGGCG AGATCCTCAG TCTGCGCGAG
GAGACCGCGA AGCTGCTCGG CTACCCGACC TACGCGGCCT ACCGGCTCGA GGATTCGATG
GCCAAGACGC CGGAAGCGGT GCGCGGCCTG CTGGAGCGGG TGTGGAAACC GGCGCGCGCC
CGCGCGATGG CCGACCGCGA CGCGCTGCAG GAGCTGGTCG CCGAGGACGG CGGCAATTTC
AAGCTGGCGC CGTGGGACTG GCGCTATTAC GCCGAGAGGC TGCGGCAGCG CCGCGCCAAT
TTCGACGATT CCGCGATCAA GCCGTATCTC GCGCTCGACA ACATGATCGC CGCCGCGTTC
GACACCGCGA CGCGGCTGTT CGGCGTCAGC TTCGCCGAGC GCAACGACGT CCCGGTGTGG
CACCCCGACG TCCGGGTCTG GGAGGTCAAG GATGCCGACG GCGCGCATCG CGGGCTGTTC
TACGGCGATT ACTACGCCCG GCCGTCGAAG CGCTCCGGCG CCTGGATGAC CTCGCTGCGC
GACCAGCAGA AGCTCGACGG CGCGGTGGCG CCGCTGATCA TCAATGTCTG CAACTTCGCC
AAGGGCGCCG ACGGCGAACC GTCGCTGCTG TCGCCCGATG ACGCCCGCAC GCTGTTCCAC
GAATTCGGCC ACGGCCTGCA CGGGATGCTG TCGGACGTGG TCTACCCGTC GCTGTCCGGC
ACCAGCGTGT TCACCGATTT CGTCGAACTG CCGTCGCAGC TCTATGAACA TTGGCAGGAG
CAGCCGCAGG TTCTGCAGCA GTTCGCCCGG CACTACCAGA CCGGCGAGCC GCTGCCCGAC
GATCTGTTGA GGCGCTTCAT CGCCGCGCGC AAATTCAACC AGGGCTTCGC CACGGTGGAA
TTCGTGTCCT CGGCGCTGCT CGACCTCGAG TTCCACACCC AGCCGGCCTC CGCGATCGGC
GAGGTCCGCG CCTTCGAACG CCGGGAGCTC GAGAAGATCG GGATGCCGGA GGAGATCGCG
CTGCGCCACC GGCCGCCACA GTTCGCCCAC ATCTTCACCG GCGATCATTA CGCCTCGGGT
TACTACAGTT ATATGTGGTC CGAGGTGATG GACGCCGACG CGTTCGGCGC GTTCGAGGAG
GCCGGCGACA TCTACGATCC GCAAGTCGCC AAGAGGCTGC GCGACGACAT CTACGCCTCG
GGCGGCTCGC GCGATCCGGA GGAGGCCTAT ATCGCCTTCC GCGGCCGCGC GCCGGAGCCC
GATGCGCTGC TGCGCCGGCG TGGCCTGCTC GAAACCCCGG AGGCCGCGTA A
 
Protein sequence
MSESSGPNAA APSTGNPLLQ AWTTPFETPP FAEIAPEHFL PAFEQAFADH AAEIAAITHD 
PTEPDFDNTI TALERSGKLL NRVAAVFYDL VSAHSNPALL EIDKDVSLRM ARHWNPIMMN
AVLFGRIAAL RDKRAELKLT SEQQRLLERT YTRFHRSGAG LDDAAKARMA EINERLAQLG
TNFGHHLLGD EQDWFMELGE SDTDGLPASY VAAARAAAKE RGMPGKAVVT LSRSSVEPFL
KSSSRRDLRE KVYRAFIARG DNGNANDNNA LIGEILSLRE ETAKLLGYPT YAAYRLEDSM
AKTPEAVRGL LERVWKPARA RAMADRDALQ ELVAEDGGNF KLAPWDWRYY AERLRQRRAN
FDDSAIKPYL ALDNMIAAAF DTATRLFGVS FAERNDVPVW HPDVRVWEVK DADGAHRGLF
YGDYYARPSK RSGAWMTSLR DQQKLDGAVA PLIINVCNFA KGADGEPSLL SPDDARTLFH
EFGHGLHGML SDVVYPSLSG TSVFTDFVEL PSQLYEHWQE QPQVLQQFAR HYQTGEPLPD
DLLRRFIAAR KFNQGFATVE FVSSALLDLE FHTQPASAIG EVRAFERREL EKIGMPEEIA
LRHRPPQFAH IFTGDHYASG YYSYMWSEVM DADAFGAFEE AGDIYDPQVA KRLRDDIYAS
GGSRDPEEAY IAFRGRAPEP DALLRRRGLL ETPEAA