Gene RPB_4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4545 
Symbol 
ID3912362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5139349 
End bp5141433 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content68% 
IMG OID637886449 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_488139 
Protein GI86751643 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.757808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA GCTCCGGACC GATTGCCGCG CCCACGGGCA ATCCGCTGTT GCAGGCCTGG 
ACCACGCCGT TCGAAACCCC GCCCTTCACC GAGATCGTGC CCGAGCATTT CCTGCCGGCG
TTCGAGCGGG CGTTCACCGA CCATGCCGCC GAGATCGCCG CGATCGCCAA CGATCCGACC
GAGCCGGACT TCGCCAACAC CATCACGGCG CTGGAGCGCT CCGGCAAGCT GCTCAACCGG
GTCGCCGCGG TGTTCTACGA CCTGGTCTCG GCGCACTCCA ATCCGGCGCT GCTGGAGATC
GACAAGGACG TGTCGCTGCG GATGGCGCGA CACTGGAATC CGATCATGAT GAACGCCGTG
CTGTTCGGCC GCATCGCGGC GCTGCACGAC AAGCGCGCCG AGCTGAAGCT GACCTCGGAA
GAGCGTCGCC TGCTGGAGCG CACCTACACC CGCTTCCACC GCTCCGGCGC CGGCCTCGAC
GAGGCCGCGA AGGCGCGGAT GGCCGAGATC AACGAGCGGC TGGCGCAGCT CGGCACCAAC
TTCAGCCACC ATCTGCTCGG CGACGAGCAG GACTGGTTCA TGGAGATCGG CGAGGGCGAT
ACCGAGGGGC TGCCGGACAG CTTCGTCGCC GCCGCGCGCG CTGCGGCGGA CGAGCGTGGC
CTGCCCGGCA AGGCGGTGGT GACGCTGTCG CGCTCCTCGG TCGAGCCGTT CCTGAAGATG
TCCGGCCGCC GCGATCTGCG CGAGAAGGTC TACCGCGCCT TCATCGCCCG CGGCGACAAC
GGCAACGCCA ACGACAACAA CGCGCTGATC GGCGAGATCC TCGGCCTGCG CGAGGAGAGC
GCCAAGCTGC TCGGCTATCC GACCTTCGCG GCCTACCGGC TGGAGGATTC GATGGCCAAG
ACGCCGGAAG CGGTGCGCGG CCTGCTGGAG CGGGTGTGGA AGCCGGCGCG CGCCCGCGCG
ATGGCCGACC GCGACGCGCT GCAGGAGCTG GTCACGGAGG AGGGCGGCAA TTTCGAGCTG
GCGCCGTGGG ACTGGCGCTT CTACGCCGAG AAGCTGCGCC AGCGCCGCGC CAATTTCGAC
GACGCCGCGA TCAAGCCGTA TCTGTCGCTC GACAACATGA TCGTCGCGGC CTTCGACACC
GCGACCCGGC TGTTCGGCGT CACGTTCGCC GAGCGCAAGG ACGTGCCGGT GTGGCACCCG
GACGTCCGGG TCTGGGAAGT GAAGGATGCC GACGGCAGCC ATCGCGGGCT GTTCTACGGC
GATTACTATG CCCGGCCGTC GAAGCGCTCC GGCGCCTGGA TGACGTCGCT GCGCGACCAG
CAGAAGCTCG ACGGCGCGGT GGCGCCGCTG ATCATCAATG TCTGCAATTT TTCGAAGGGC
GCCGACGGCG AACCGTCGCT GCTGTCGCCC GACGACGCCC GCACGCTGTT CCACGAATTC
GGCCACGGCC TGCACGGCAT GATGTCGGAC GTGACCTATC CGTCGCTGTC CGGCACCAGC
GTGTTCACCG ATTTCGTCGA ACTGCCCTCG CAGCTCTACG AGCACTGGCA GGAGCGGCCC
GAGGTGCTGC GGCGTTTCGC CCGGCACTAC CAGACCGGCG AGCCGCTGCC CGACGATCTG
CTGCAGCGCT TCATCGCCGC GCGCAAATTC AACCAGGGCT TCGCCACGGT GGAATTCGTG
TCCTCGGCGC TGCTCGACCT CGAATTCCAC ACCCAGCCGG CCGCGAGCGT CGGCGATATC
CGCGATTTCG AGCGCCGCGA GCTCGACAAG ATCGGCATGC CGGACGAGAT CGCGCTGCGC
CACCGGCCGA CCCAGTTCGG CCACATCTTC TCCGGCGATC ACTACGCCTC GGGCTATTAC
AGCTACATGT GGTCCGAAGT GATGGACGCC GACGCCTTCG GCGCCTTCGA GGAGGCCGGC
GACATCTTCG ACCCCAAGGT GGCGAAGCGG CTGCGCGACG ACATCTACGC CTCGGGCGGC
TCACGCGATC CGGAGGAGGC CTATATCGCC TTCCGCGGCC GTGCGCCGGA GCCCGACGCG
CTGCTGCGCC GGCGCGGCCT GCTCGAAACC CCGGAGGCGG CGTAG
 
Protein sequence
MSESSGPIAA PTGNPLLQAW TTPFETPPFT EIVPEHFLPA FERAFTDHAA EIAAIANDPT 
EPDFANTITA LERSGKLLNR VAAVFYDLVS AHSNPALLEI DKDVSLRMAR HWNPIMMNAV
LFGRIAALHD KRAELKLTSE ERRLLERTYT RFHRSGAGLD EAAKARMAEI NERLAQLGTN
FSHHLLGDEQ DWFMEIGEGD TEGLPDSFVA AARAAADERG LPGKAVVTLS RSSVEPFLKM
SGRRDLREKV YRAFIARGDN GNANDNNALI GEILGLREES AKLLGYPTFA AYRLEDSMAK
TPEAVRGLLE RVWKPARARA MADRDALQEL VTEEGGNFEL APWDWRFYAE KLRQRRANFD
DAAIKPYLSL DNMIVAAFDT ATRLFGVTFA ERKDVPVWHP DVRVWEVKDA DGSHRGLFYG
DYYARPSKRS GAWMTSLRDQ QKLDGAVAPL IINVCNFSKG ADGEPSLLSP DDARTLFHEF
GHGLHGMMSD VTYPSLSGTS VFTDFVELPS QLYEHWQERP EVLRRFARHY QTGEPLPDDL
LQRFIAARKF NQGFATVEFV SSALLDLEFH TQPAASVGDI RDFERRELDK IGMPDEIALR
HRPTQFGHIF SGDHYASGYY SYMWSEVMDA DAFGAFEEAG DIFDPKVAKR LRDDIYASGG
SRDPEEAYIA FRGRAPEPDA LLRRRGLLET PEAA