Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0503 |
Symbol | dpp |
ID | 2552314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 542100 |
End bp | 544271 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637149268 |
Product | dipeptidyl aminopeptidase IV |
Protein accession | NP_904798 |
Protein GI | 34540319 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.472186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC CGGTAATAAT TCTGCTACTT GGCATCGTAA CCATGTGTGC CATGGCTCAG ACGGGGAACA AACCCGTAGA TTTGAAAGAA ATCACGAGCG GAATGTTTTA TGCTCGCAGT GCAGGGAGTG GAATACGCTC CATGCCGGAT GGAGAACACT ATACGGAGAT GAACCGTGAG CGAACGGCTA TCATTCGCTA CAATTATGCC TCCGGCAAGG CGGTAGATAC GCTCTTCAGT GTCGAACGAG CACGCGAATG CCCGTTTAAA CAAATACAGA ACTACGAGGT AAGCAGTACC GGACATCATA TTTTGCTCTT TACGGATATG GAGAGCATCT ATCGGCATTC GTATCGTGCT GCCGTCTACG ACTATGATGT TCGCCGCAAT TTGGTAAAAC CACTGAGCGA GCATGTCGGC AAAGTGATGA TCCCTACATT CAGCCCCGAT GGCCGGATGG TAGCGTTCGT CAGAGACAAT AACATCTTTA TCAAGAAATT CGATTTCGAC ACGGAAGTAC AGGTTACTAC CGATGGGCAG ATCAACTCTA TTCTTAACGG AGCGACGGAT TGGGTGTACG AAGAAGAGTT CGGTGTGACC AATCTGATGA GCTGGAGTGC GGACAATGCT TTTCTGGCTT TTGTGCGCAG CGATGAATCT GCCGTCCCCG AATATCGAAT GCCTATGTAT GAGGACAAAC TTTATCCCGA AGACTATACC TATAAGTATC CCAAGGCAGG GGAGAAGAAT AGTACCGTCT CCCTGCATCT CTATAATGTG GCGGATCGGA ATACCAAGTC GGTAAGCCTG CCGATCGATG CGGATGGATA TATTCCCCGA ATTGCTTTCA CGGACAACGC GGATGAGTTG GCTGTCATGA CACTCAACCG TTTGCAGAAC GACTTCAAAA TGTACTATGT GCATCCGAAG AGTCTCGTCC CCAAGCTGAT ACTACAGGAT ATGAACAAGC GATATGTGGA TAGCGATTGG ATTCAGACCT TGAAGTTTAC GACCGGAGGT GGATTCGCCT ATGTGAGCGA AAAGGATGGG TTTGCCCATA TCTATCTCTA CGATAATAAG GGGGTAATGC ACCGTCGGAT TACCTCAGGA AATTGGGATG TGACCAAACT GTACGGAGTG GATGCTTCGG GAACGGTCTT CTACCAGTCG GCGGAAGAAA GCCCCATCCG TCGAGCTGTC TATGCCATAG ATGCCAAAGG CAGGAAAACA AAGCTCAGCC TGAATGTAGG CACGAATGAT GCTCTCTTTA GTGGCAATTA TGCATACTAT ATTAACACGT ATAGCAGTGC TGCTACCCCA GCGGTGGTTT CGGTATTCAG AAGCAAAGGT GCCAAAGAGC TGCGTACACT GGAGGATAAC GTTGCTCTCC GTGAACGGCT GAAAGCCTAT CGTTACAACC CGAAGGAGTT TACCACTATC AAAACTCAAT CGGGTCTTGA ACTGAATGCC TGGATCGTGA AGCCTATTGA TTTCGATCCC TCTCGCCACT ATCCTGTCCT GATGGTACAG TATAGCGGTC CCAACTCCCA GCAGGTATTG GATCGCTATT CATTCGATTG GGAACACTAC CTTGCATCGA AAGGTTACGT CGTGGCATGT GTGGATGGGC GTGGCACCGG TGCTCGCGGC GAAGAATGGC GCAAGTGTAC CTACATGCAA CTCGGTGTAT TCGAAAGCGA TGATCAGATA GCAGCGGCCA CTGCTATAGG ACAGCTGCCC TATGTGGATG CAGCTCGTAT CGGCATATGG GGGTGGAGCT ATGGCGGCTA TACCACACTA ATGAGTTTGT GTCGGGGAAA TGGTACATTC AAAGCGGGGA TAGCCGTTGC TCCTGTGGCA GACTGGCGTT TCTACGATTC GGTTTACACC GAACGCTTCA TGCGTACACC CAAGGAGAAT GCTTCCGGAT ACAAGATGTC TTCTGCTCTT GATGTGGCAA GCCAATTACA AGGAAACCTC TTGATCGTAA GCGGATCGGC AGACGACAAT GTTCATCTTC AGAACACGAT GCTTTTTACA GAGGCACTGG TTCAGGCCAA TATCCCCTTC GACATGGCTA TCTATATGGA CAAGAACCAT AGTATATACG GGGGGAATAC CCGCTATCAT CTCTATACTC GCAAAGCAAA GTTTTTGTTC GACAATCTTT AA
|
Protein sequence | MKRPVIILLL GIVTMCAMAQ TGNKPVDLKE ITSGMFYARS AGSGIRSMPD GEHYTEMNRE RTAIIRYNYA SGKAVDTLFS VERARECPFK QIQNYEVSST GHHILLFTDM ESIYRHSYRA AVYDYDVRRN LVKPLSEHVG KVMIPTFSPD GRMVAFVRDN NIFIKKFDFD TEVQVTTDGQ INSILNGATD WVYEEEFGVT NLMSWSADNA FLAFVRSDES AVPEYRMPMY EDKLYPEDYT YKYPKAGEKN STVSLHLYNV ADRNTKSVSL PIDADGYIPR IAFTDNADEL AVMTLNRLQN DFKMYYVHPK SLVPKLILQD MNKRYVDSDW IQTLKFTTGG GFAYVSEKDG FAHIYLYDNK GVMHRRITSG NWDVTKLYGV DASGTVFYQS AEESPIRRAV YAIDAKGRKT KLSLNVGTND ALFSGNYAYY INTYSSAATP AVVSVFRSKG AKELRTLEDN VALRERLKAY RYNPKEFTTI KTQSGLELNA WIVKPIDFDP SRHYPVLMVQ YSGPNSQQVL DRYSFDWEHY LASKGYVVAC VDGRGTGARG EEWRKCTYMQ LGVFESDDQI AAATAIGQLP YVDAARIGIW GWSYGGYTTL MSLCRGNGTF KAGIAVAPVA DWRFYDSVYT ERFMRTPKEN ASGYKMSSAL DVASQLQGNL LIVSGSADDN VHLQNTMLFT EALVQANIPF DMAIYMDKNH SIYGGNTRYH LYTRKAKFLF DNL
|
| |