Gene Rsph17029_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2249 
Symbol 
ID4896399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2381523 
End bp2383541 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content70% 
IMG OID640112843 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_001044124 
Protein GI126463010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0488613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.482949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACC CGTTGCTCGC CCCCTGGACC ACGCCCTTTG CCCTGCCGCC CTTCGCCGAG 
ATCCGCGACG ACCAGTTCGG TCCCGCCATC GAGGCGGGGC TCGCGGAGGC GCGCCGGGCG
ATCGCGGCCA TCGCGGACAA TCCCGAAGCG CCCACCTTCG CCAATACGAT CGAGGCGCTG
GAGCTGGCCG AGGAGACGCT CGACCGGGTG GCGGGGGTGT TCTACAACCT CGCCGGCGCC
GACAGCAACG CGGCGCGCGA GGCGCTGCAG CGCGAGCTTG CGCCGAAGAT GTCGGCCTTT
TCCTCCGAGA TCGTGAACAA CCGCCCGCTC TTCCGGCGGA TCGAGACGCT CTGGCAGGGG
CGGGAGGCGC TGGGCCTTTC GCCCGAGCAG GAGCGGGTGC TGATGCTCTA CCGGCGGATG
TTCGTGCGTT CGGGCGCCGA GCTGGAGGGG GCCGAGGCCG AGAAGTTGAC AGCTGTCAAG
GCGCGGCTCG CGGTGCTGGG CACCACCTTC TCGCAGAACC TGCTGGCCGA TGAGCGCGAC
TGGACGATGA CGCTCGCCGA GGAGGATCTG GAGGGGCTGC CCGATTTCGT GATCGAGACG
GCGCGGGCGG CGGGCGCCGA GCGCGGCGCT GGAGGGCCGA TTGTCACGCT CAACCGCTCG
CTGATCGTGC CCTTCCTGCA GTTCTCGCCC CGGCGCGAGC TGCGGCGGCG GGCTTACGAG
GCCTGGGTGG CGCGGGGGGC GAACGGCAAT GCCTCCGACA ACCGGGCGGT CGCGGCGGAG
ATCCTTGCGC TGCGGTCGGA GCGGGCGCAG CTGCTGGGCT ACCCGGACTT TGCCTCCTAC
AAGCTCGAGA CCGAGATGGC CAAGACGCCG GAGGCGGTGC GCGAGCTGCT GCTGCGGGTC
TGGACACCCG CCAAGGCCCG CGCCGAGGCG GATCGGGCGG TGCTCGAGGC CATGATGCAC
CGCGACGGGA TCAACGCCGA TCTCGAGCCC TGGGACTGGC GCTATTATTC CGAGAAGCGC
CGCGCGGCCG AGTTCGATCT CGATGAGGCG GCGCTGAAGC CCTACCTGCC GCTGGAGCGG
ATGATCGAGG CGGCCTTCGA CTGCGCACGG CGCCTGTTCG GGCTGGAGTT CCGGCCGCTC
GACGTGCCGC TCTATCACCC GGACGTGCGC GCCTGGGAGG TCACGCGCGA GGGGCGGCAC
ATGGCGGTGT TCCTCGGCGA CTATTTTGCG CGGTCCTCGA AACGGTCGGG CGCCTGGTGT
TCGACCATGC GCGGGCAGCG CAGGCTCGGC GGCGAGGTGC GGCCGATCGT GGTCAATGTC
TGCAACTTCG CCAAGGGCGA GCCCGCGCTT CTGTCCTGGG ACGATGCGCG GACGCTGTTC
CACGAGTTCG GCCATGCGCT GCACCAGATG CTGTCGGACG TGACCTACGG CTTCATCTCG
GGCACGTCGG TGGCGCGCGA CTTCGTGGAG CTGCCGAGCC AGCTCTACGA GCACTGGCTC
GAGGTGCCGG AGGTGCTGGA GGCCCATGCG CGGCACTGGC AGACCGGGGC GCCGATGCCG
GCCGAGATGC GCGAGCGGCT GCTCGCGGCC TCGACCTACG ATCAGGGCTT TGCCACGGTC
GAATTCATCT CTTCCGCGAT GGTGGATCTG GCCTTCCACG AGGGCGCGCC CCCGGCCGAT
CCGATGGCGA AGCAGGCGGA GGTGCTGGCG GGGCTCGGGA TGCCGAAGGC GATCCGGATG
CGCCACGCGA CGCCGCATTT CGCGCATGTC TTCTCTGGCG ACGGCTATTC CGCGGGCTAT
TACAGTTACA TGTGGTCCGA GGTGATGGAT GCGGACGCGT TCGCGGCCTT CGAGGAGGCG
GGCGGGGCCT TCGATCCCGA GATGGCGCGC AAGCTCGAGC GGCATGTGCT CTCGGCCGGC
GGATCGGAGG AGGCGGAGGC GCTCTATACC GCGTTCCGCG GCCGGATGCC GGGGGTGGAG
GCGCTGCTGC GGGGCCGGGG GCTGCTCGAC GCCGCGTGA
 
Protein sequence
MPNPLLAPWT TPFALPPFAE IRDDQFGPAI EAGLAEARRA IAAIADNPEA PTFANTIEAL 
ELAEETLDRV AGVFYNLAGA DSNAAREALQ RELAPKMSAF SSEIVNNRPL FRRIETLWQG
REALGLSPEQ ERVLMLYRRM FVRSGAELEG AEAEKLTAVK ARLAVLGTTF SQNLLADERD
WTMTLAEEDL EGLPDFVIET ARAAGAERGA GGPIVTLNRS LIVPFLQFSP RRELRRRAYE
AWVARGANGN ASDNRAVAAE ILALRSERAQ LLGYPDFASY KLETEMAKTP EAVRELLLRV
WTPAKARAEA DRAVLEAMMH RDGINADLEP WDWRYYSEKR RAAEFDLDEA ALKPYLPLER
MIEAAFDCAR RLFGLEFRPL DVPLYHPDVR AWEVTREGRH MAVFLGDYFA RSSKRSGAWC
STMRGQRRLG GEVRPIVVNV CNFAKGEPAL LSWDDARTLF HEFGHALHQM LSDVTYGFIS
GTSVARDFVE LPSQLYEHWL EVPEVLEAHA RHWQTGAPMP AEMRERLLAA STYDQGFATV
EFISSAMVDL AFHEGAPPAD PMAKQAEVLA GLGMPKAIRM RHATPHFAHV FSGDGYSAGY
YSYMWSEVMD ADAFAAFEEA GGAFDPEMAR KLERHVLSAG GSEEAEALYT AFRGRMPGVE
ALLRGRGLLD AA