Gene Rsph17025_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0514 
Symbol 
ID5082708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp514197 
End bp516215 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content70% 
IMG OID640482068 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_001166725 
Protein GI146276566 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0574959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC CGCTGCTTGC GCCCTGGACC GCGCCATTCG CCCTGCCTCC CTTCGCCGAG 
ATCCGCGACG AGCAGTTCGG TCCGGCCTTC GAGGCGGGAC TTGCCGAGGC GCGCGCCAAC
ATCCGCGCCA TCGCCGACAA TCCCGAAGCG CCGAGCTTTG CCAACACGAT CGAGGCGCTT
GAGTTGGCCC AGGAGACGCT CGACCGGGTG GCGGGCGTCT TCTACAACCT CGCCGGGGCC
GACAGCAACG CGGCGCGTGA GGCGCTCCAG CGCGAGCTGG CGCCGAGGAT GTCGGCCTTC
TCCTCCGAGG TGGTGACCAA CCGCCCGCTC TTCCAGCGGA TCGAGACGCT CTGGCAGCAG
CGCGACGGGC TCGGCCTCAC GTCCGAACAG GAGCGGGTGC TGATGCTCTA CCGGCGGATG
TTCGTGCGCT CGGGCGCCCG GCTCGAGGGG GCCGAGGCCG AGCGGCTGAC CGAGGTCAAG
GCGCGGCTGG CGGTGCTGGG CACCACGTTC GCGCAGAACC TGCTGGCCGA TGAGCGCGAG
TGGATGATGC CGCTGGCCGA AGAGGATCTG GAGGGGCTGC CGGAGTTCGT GGTCGAGACC
GCCCGCGCGG CGGGCGCCGA ACGCGGGGCC GAAGGGCCGG TCGTCACGCT CAACCGCTCG
CTGATCGTGC CCTTCCTGCA ATTCTCGCCG CGGCGCGAGC TGCGCCGGCG CGCCTATGAG
GCCTGGGTTT CGCGGGGGGC CAACGGCAAC GCCACCGACA ACCGCGCCGT GGCGGCCGAG
ATCCTGGCGC TGCGCGAGGA GCGGGCGAAG CTCCTCGGCT ATCCGGGCTT TGCGGCCTAC
AAGCTCGAGA CCGAAATGGC CAAGACCCCC GACGCGGTGC GAGAGCTTCT GCTGCGCGTC
TGGGAGCCTG CCAAGGCGCG GGCCGAGGCG GACGGGGCCG TGCTCGAGGC GATGATGCAC
CGCGACGGGA TCAACGGCGA TCTCGAACCC TGGGACTGGC GCTACTATTC CGAGAAGCGC
CGCGCGGCCG AGTTCGACCT CGACGAGGCG GCGCTGAAAC CCTACCTGCC GCTCGAGCGG
ATGATCGAGG CGGCCTTCGA CTGCGCGCAC CGCCTCTTCG GGCTGGAATT CCGGCCGCTC
GACGTGCCGC TCTACCACCC GGACGTGCGC GCCTGGGAGG TGACGCGCGA GGGCCGGCAC
ATGGCGGTCT TCCTCGGCGA CTGGTTCGCG CGCGCCTCGA AACGCTCCGG CGCCTGGTGC
TCGACCATGC GGGGGCAGCG CAAGCTTGGC GGCGAGGTGC GGCCCATCGT GGTCAATGTC
TGCAACTTCG CCAAGGGCGA GCCGGCGCTG CTGTCGTGGG ACGATGCGCG CACGCTCTTT
CACGAGTTCG GCCACGCGCT GCACCAGATG CTCTCGGACG TGACCTACGG CTACATCTCG
GGCACCTCGG TTGCGCGCGA TTTCGTCGAA CTGCCGAGCC AGCTTTACGA ACATTGGCTC
GAGGTGCCCG AGGTGCTGGA ACGGCACGCG CGCCACTGGC AGACGGACGA GCCGATGCCG
GCCGAGACGC GGGAACGGCT GCTCGCCGCC TCGACCTACG ACCAGGGCTT TGCGACCGTC
GAGTTCATCT CGTCGGCCAT GGTGGATCTG GCGTTCCACG AGGGTGAGGC CCCGGCCGAT
CCGATGGCGC GGCAGGCCGA GGTGCTCGAG AGCCTCGGGA TGCCCCGGGC AATCCGTATG
CGCCACGCGA CGCCGCACTT TGCGCATGTC TTCACTGGCG ACGGCTATTC CGCGGGCTAC
TACAGTTACA TGTGGTCCGA GGTGATGGAT GCGGACGCCT TCGCCGCCTT CGAGGAGGCG
GGAGGCGCCT TCAGCCCCGA GATGGCACGG CGGCTCGAGC GGCATGTGCT GTCGGCCGGC
GGGTCGGATG AGGCAGAGGC GCTCTACACC GCCTTCCGCG GCCGGATGCC GGGGGTGGAG
GCGCTGCTTC GCGGCCGCGG ACTGCTCGAC GCCGCCTGA
 
Protein sequence
MTHPLLAPWT APFALPPFAE IRDEQFGPAF EAGLAEARAN IRAIADNPEA PSFANTIEAL 
ELAQETLDRV AGVFYNLAGA DSNAAREALQ RELAPRMSAF SSEVVTNRPL FQRIETLWQQ
RDGLGLTSEQ ERVLMLYRRM FVRSGARLEG AEAERLTEVK ARLAVLGTTF AQNLLADERE
WMMPLAEEDL EGLPEFVVET ARAAGAERGA EGPVVTLNRS LIVPFLQFSP RRELRRRAYE
AWVSRGANGN ATDNRAVAAE ILALREERAK LLGYPGFAAY KLETEMAKTP DAVRELLLRV
WEPAKARAEA DGAVLEAMMH RDGINGDLEP WDWRYYSEKR RAAEFDLDEA ALKPYLPLER
MIEAAFDCAH RLFGLEFRPL DVPLYHPDVR AWEVTREGRH MAVFLGDWFA RASKRSGAWC
STMRGQRKLG GEVRPIVVNV CNFAKGEPAL LSWDDARTLF HEFGHALHQM LSDVTYGYIS
GTSVARDFVE LPSQLYEHWL EVPEVLERHA RHWQTDEPMP AETRERLLAA STYDQGFATV
EFISSAMVDL AFHEGEAPAD PMARQAEVLE SLGMPRAIRM RHATPHFAHV FTGDGYSAGY
YSYMWSEVMD ADAFAAFEEA GGAFSPEMAR RLERHVLSAG GSDEAEALYT AFRGRMPGVE
ALLRGRGLLD AA