Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0514 |
Symbol | |
ID | 5082708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 514197 |
End bp | 516215 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640482068 |
Product | peptidyl-dipeptidase Dcp |
Protein accession | YP_001166725 |
Protein GI | 146276566 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0408972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0574959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATC CGCTGCTTGC GCCCTGGACC GCGCCATTCG CCCTGCCTCC CTTCGCCGAG ATCCGCGACG AGCAGTTCGG TCCGGCCTTC GAGGCGGGAC TTGCCGAGGC GCGCGCCAAC ATCCGCGCCA TCGCCGACAA TCCCGAAGCG CCGAGCTTTG CCAACACGAT CGAGGCGCTT GAGTTGGCCC AGGAGACGCT CGACCGGGTG GCGGGCGTCT TCTACAACCT CGCCGGGGCC GACAGCAACG CGGCGCGTGA GGCGCTCCAG CGCGAGCTGG CGCCGAGGAT GTCGGCCTTC TCCTCCGAGG TGGTGACCAA CCGCCCGCTC TTCCAGCGGA TCGAGACGCT CTGGCAGCAG CGCGACGGGC TCGGCCTCAC GTCCGAACAG GAGCGGGTGC TGATGCTCTA CCGGCGGATG TTCGTGCGCT CGGGCGCCCG GCTCGAGGGG GCCGAGGCCG AGCGGCTGAC CGAGGTCAAG GCGCGGCTGG CGGTGCTGGG CACCACGTTC GCGCAGAACC TGCTGGCCGA TGAGCGCGAG TGGATGATGC CGCTGGCCGA AGAGGATCTG GAGGGGCTGC CGGAGTTCGT GGTCGAGACC GCCCGCGCGG CGGGCGCCGA ACGCGGGGCC GAAGGGCCGG TCGTCACGCT CAACCGCTCG CTGATCGTGC CCTTCCTGCA ATTCTCGCCG CGGCGCGAGC TGCGCCGGCG CGCCTATGAG GCCTGGGTTT CGCGGGGGGC CAACGGCAAC GCCACCGACA ACCGCGCCGT GGCGGCCGAG ATCCTGGCGC TGCGCGAGGA GCGGGCGAAG CTCCTCGGCT ATCCGGGCTT TGCGGCCTAC AAGCTCGAGA CCGAAATGGC CAAGACCCCC GACGCGGTGC GAGAGCTTCT GCTGCGCGTC TGGGAGCCTG CCAAGGCGCG GGCCGAGGCG GACGGGGCCG TGCTCGAGGC GATGATGCAC CGCGACGGGA TCAACGGCGA TCTCGAACCC TGGGACTGGC GCTACTATTC CGAGAAGCGC CGCGCGGCCG AGTTCGACCT CGACGAGGCG GCGCTGAAAC CCTACCTGCC GCTCGAGCGG ATGATCGAGG CGGCCTTCGA CTGCGCGCAC CGCCTCTTCG GGCTGGAATT CCGGCCGCTC GACGTGCCGC TCTACCACCC GGACGTGCGC GCCTGGGAGG TGACGCGCGA GGGCCGGCAC ATGGCGGTCT TCCTCGGCGA CTGGTTCGCG CGCGCCTCGA AACGCTCCGG CGCCTGGTGC TCGACCATGC GGGGGCAGCG CAAGCTTGGC GGCGAGGTGC GGCCCATCGT GGTCAATGTC TGCAACTTCG CCAAGGGCGA GCCGGCGCTG CTGTCGTGGG ACGATGCGCG CACGCTCTTT CACGAGTTCG GCCACGCGCT GCACCAGATG CTCTCGGACG TGACCTACGG CTACATCTCG GGCACCTCGG TTGCGCGCGA TTTCGTCGAA CTGCCGAGCC AGCTTTACGA ACATTGGCTC GAGGTGCCCG AGGTGCTGGA ACGGCACGCG CGCCACTGGC AGACGGACGA GCCGATGCCG GCCGAGACGC GGGAACGGCT GCTCGCCGCC TCGACCTACG ACCAGGGCTT TGCGACCGTC GAGTTCATCT CGTCGGCCAT GGTGGATCTG GCGTTCCACG AGGGTGAGGC CCCGGCCGAT CCGATGGCGC GGCAGGCCGA GGTGCTCGAG AGCCTCGGGA TGCCCCGGGC AATCCGTATG CGCCACGCGA CGCCGCACTT TGCGCATGTC TTCACTGGCG ACGGCTATTC CGCGGGCTAC TACAGTTACA TGTGGTCCGA GGTGATGGAT GCGGACGCCT TCGCCGCCTT CGAGGAGGCG GGAGGCGCCT TCAGCCCCGA GATGGCACGG CGGCTCGAGC GGCATGTGCT GTCGGCCGGC GGGTCGGATG AGGCAGAGGC GCTCTACACC GCCTTCCGCG GCCGGATGCC GGGGGTGGAG GCGCTGCTTC GCGGCCGCGG ACTGCTCGAC GCCGCCTGA
|
Protein sequence | MTHPLLAPWT APFALPPFAE IRDEQFGPAF EAGLAEARAN IRAIADNPEA PSFANTIEAL ELAQETLDRV AGVFYNLAGA DSNAAREALQ RELAPRMSAF SSEVVTNRPL FQRIETLWQQ RDGLGLTSEQ ERVLMLYRRM FVRSGARLEG AEAERLTEVK ARLAVLGTTF AQNLLADERE WMMPLAEEDL EGLPEFVVET ARAAGAERGA EGPVVTLNRS LIVPFLQFSP RRELRRRAYE AWVSRGANGN ATDNRAVAAE ILALREERAK LLGYPGFAAY KLETEMAKTP DAVRELLLRV WEPAKARAEA DGAVLEAMMH RDGINGDLEP WDWRYYSEKR RAAEFDLDEA ALKPYLPLER MIEAAFDCAH RLFGLEFRPL DVPLYHPDVR AWEVTREGRH MAVFLGDWFA RASKRSGAWC STMRGQRKLG GEVRPIVVNV CNFAKGEPAL LSWDDARTLF HEFGHALHQM LSDVTYGYIS GTSVARDFVE LPSQLYEHWL EVPEVLERHA RHWQTDEPMP AETRERLLAA STYDQGFATV EFISSAMVDL AFHEGEAPAD PMARQAEVLE SLGMPRAIRM RHATPHFAHV FTGDGYSAGY YSYMWSEVMD ADAFAAFEEA GGAFSPEMAR RLERHVLSAG GSDEAEALYT AFRGRMPGVE ALLRGRGLLD AA
|
| |