Gene RPC_4839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4839 
Symbol 
ID3973543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5399807 
End bp5401897 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content66% 
IMG OID637927951 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_534680 
Protein GI90426310 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACT CCTCGGTCGC GACGGCCAAT TCCGAACGGC TCGACAATCC CTTGTTGCAG 
GCCTGGCGGA CGCCGTTCGA GACCCCGCCG TTCGCAGAGA TCGCGCCGGA ACACTTCCTG
CCGGCGTTCG AGCAGGCCTT CGACGACCAC ACCGCCGAGA TCGCCGCGAT CACCGACGAT
CCCTCGGCGC CCGACTTCGC CAACACCATC ACCGCGCTGG AGCGCTCCGG CCGGCTTTTG
AACCGGGTCG CCGCGGTGTT CTACGATCTG GTGTCGGCGC ATTCCAACCC GGCGCTGCTC
GCGATCGACA CCGAAGTGTC GCAGCGGATG GCGCGGCACT GGAATCCGAT CATGATGAAC
GCCGCGCTGT TCGGCCGGAT CGCGCTGCTT TACGGGCTGC GGTCCAGGCT CAAGCTGTCC
GGCGAAGAGC TGCGGCTGTT GGAGCGCACC TACACCCGCT TCCATCGCTC CGGCGCCGGG
CTCGACGCCA AGGCCAAGGC GCGGATGGCC GAGATCAACG AGCGGCTGGC CAACCTCGGC
ACCGCATTCA GCCATCATCT GCTCGGCGAC GAGCAGGACT GGACCCTGGA GCTCGGCGAC
AACGATTATG ACGGGCTGTC CGACAGCTTC GTCGCCGCCG CCAAGGCGGC GGCTGCCGAG
CGCGGACGGC CCGGCAAGGC GGTGGTGACG CTGTCGCGCT CCTCGGTCGA GCCGTTCCTG
AAGAGCTCAT CCCGGCGTGA TTTGCGCGAA AAGGTCTACA AGGCGTTCAC CGCGCGCGGC
AACAACGGCA ACGCCAACGA CAACAACGCG GCGATCACCG AGATCCTGCA GCTGCGCGAA
GAGACCGCCA AGCTGCTCGG CTTCGCCAGC TTCGCCGAAT ATCGGCTGGA AGATTCCATG
GCTAAGACGC CGCAGGCAGT GCGCGGGCTG CTGGAACGGG TGTGGCGGCC GGCGCGCGCC
CGGGCGCTGG CGGATCGCGA CGCGCTGCAG GCGCTGATCG CGGAGGAAGG CGGCAATTTC
GCGCTGGCAG CCTGGGACTG GCGCTATTAC GCCGAGAAGC TCAGGCAGCG CCGCGCCAAT
TTCGACGACG CCGCGATCAA GCCGTATCTT TCGCTCGACA ATATGATCGC CGCCGCCTTC
GACACCGCAA CCAAGCTGTT CGGCGTCACG TTCAACGAGC GCAACGACAT CCCGGTCTGG
CATCCCGACG TCCGGGTCTG GGAGGTGTTC GATCCCGCAG GCACCCACAA GGGGCTGTTC
TACGGCGATT ACTATGCCCG CCCCTCGAAG CGCTCCGGCG CCTGGATGAC CTCGCTGCGC
GACCAGCAGA AGCTCGATGG CAACGTCGCG CCGCTGATCA TCAACGTCTG CAACTTCGCC
AAGGGTTCGA ACGGCGAACC GGCGCTGCTG TCGCCGGACG ACGCCCGCAC TTTGTTCCAC
GAATTCGGCC ACGGCCTGCA CGGCATGCTG TCCGACGTGG TGTATCCGTC GCTGTCCGGC
ACCGCGGTGT TCACCGATTT CGTCGAATTG CCGTCGCAGC TCTACGAGCA CTGGCAGGAA
CAGCCGCAGG TGCTGCGGCA ATTCGCGACG CATTACCAGA CCGGCGAGCC GCTGCCCGAC
GATCTGTTGC GGCGCTTCCT CGCCGCGCGA AAATTCAACC AGGGCTTCGC CACCGTGGAA
TTCGTCTCTT CGGCGCTGAT CGATCTCGAA TTCCATACCC AGCCGGCTTC GGCGATCGGC
GACATCGCCG AGTTCGAGCG CCGCGAATTG CAAAAGATCG GCATGCCGGA GGAAATCGCG
ATGCGGCACC GGCCGACGCA GTTCGGCCAC ATCTTCTCCG GCGACCATTA TGCCTCGGGC
TATTACAGCT ACATGTGGAG CGAAGTGATG GACGCCGACG CGTTCGGCGC GTTCGAGGAG
GCCGGCGACA TCTTCGATTC CGCGGTGGCC AAGCGGCTGC AGGACGATAT CTACGCCGCG
GGCGGCTCCC GCGATCCGGA GCACGCCTAT ATCGCGTTCC GCGGACGACC GCCCGAGCCC
GACGCGCTGC TGCGCCGCCG CGGCCTGCTC GATGTGCCCG AGGCGGCGTA A
 
Protein sequence
MPDSSVATAN SERLDNPLLQ AWRTPFETPP FAEIAPEHFL PAFEQAFDDH TAEIAAITDD 
PSAPDFANTI TALERSGRLL NRVAAVFYDL VSAHSNPALL AIDTEVSQRM ARHWNPIMMN
AALFGRIALL YGLRSRLKLS GEELRLLERT YTRFHRSGAG LDAKAKARMA EINERLANLG
TAFSHHLLGD EQDWTLELGD NDYDGLSDSF VAAAKAAAAE RGRPGKAVVT LSRSSVEPFL
KSSSRRDLRE KVYKAFTARG NNGNANDNNA AITEILQLRE ETAKLLGFAS FAEYRLEDSM
AKTPQAVRGL LERVWRPARA RALADRDALQ ALIAEEGGNF ALAAWDWRYY AEKLRQRRAN
FDDAAIKPYL SLDNMIAAAF DTATKLFGVT FNERNDIPVW HPDVRVWEVF DPAGTHKGLF
YGDYYARPSK RSGAWMTSLR DQQKLDGNVA PLIINVCNFA KGSNGEPALL SPDDARTLFH
EFGHGLHGML SDVVYPSLSG TAVFTDFVEL PSQLYEHWQE QPQVLRQFAT HYQTGEPLPD
DLLRRFLAAR KFNQGFATVE FVSSALIDLE FHTQPASAIG DIAEFERREL QKIGMPEEIA
MRHRPTQFGH IFSGDHYASG YYSYMWSEVM DADAFGAFEE AGDIFDSAVA KRLQDDIYAA
GGSRDPEHAY IAFRGRPPEP DALLRRRGLL DVPEAA