Gene PG0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0503 
Symboldpp 
ID2552314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp542100 
End bp544271 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content49% 
IMG OID637149268 
Productdipeptidyl aminopeptidase IV 
Protein accessionNP_904798 
Protein GI34540319 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.472186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC CGGTAATAAT TCTGCTACTT GGCATCGTAA CCATGTGTGC CATGGCTCAG 
ACGGGGAACA AACCCGTAGA TTTGAAAGAA ATCACGAGCG GAATGTTTTA TGCTCGCAGT
GCAGGGAGTG GAATACGCTC CATGCCGGAT GGAGAACACT ATACGGAGAT GAACCGTGAG
CGAACGGCTA TCATTCGCTA CAATTATGCC TCCGGCAAGG CGGTAGATAC GCTCTTCAGT
GTCGAACGAG CACGCGAATG CCCGTTTAAA CAAATACAGA ACTACGAGGT AAGCAGTACC
GGACATCATA TTTTGCTCTT TACGGATATG GAGAGCATCT ATCGGCATTC GTATCGTGCT
GCCGTCTACG ACTATGATGT TCGCCGCAAT TTGGTAAAAC CACTGAGCGA GCATGTCGGC
AAAGTGATGA TCCCTACATT CAGCCCCGAT GGCCGGATGG TAGCGTTCGT CAGAGACAAT
AACATCTTTA TCAAGAAATT CGATTTCGAC ACGGAAGTAC AGGTTACTAC CGATGGGCAG
ATCAACTCTA TTCTTAACGG AGCGACGGAT TGGGTGTACG AAGAAGAGTT CGGTGTGACC
AATCTGATGA GCTGGAGTGC GGACAATGCT TTTCTGGCTT TTGTGCGCAG CGATGAATCT
GCCGTCCCCG AATATCGAAT GCCTATGTAT GAGGACAAAC TTTATCCCGA AGACTATACC
TATAAGTATC CCAAGGCAGG GGAGAAGAAT AGTACCGTCT CCCTGCATCT CTATAATGTG
GCGGATCGGA ATACCAAGTC GGTAAGCCTG CCGATCGATG CGGATGGATA TATTCCCCGA
ATTGCTTTCA CGGACAACGC GGATGAGTTG GCTGTCATGA CACTCAACCG TTTGCAGAAC
GACTTCAAAA TGTACTATGT GCATCCGAAG AGTCTCGTCC CCAAGCTGAT ACTACAGGAT
ATGAACAAGC GATATGTGGA TAGCGATTGG ATTCAGACCT TGAAGTTTAC GACCGGAGGT
GGATTCGCCT ATGTGAGCGA AAAGGATGGG TTTGCCCATA TCTATCTCTA CGATAATAAG
GGGGTAATGC ACCGTCGGAT TACCTCAGGA AATTGGGATG TGACCAAACT GTACGGAGTG
GATGCTTCGG GAACGGTCTT CTACCAGTCG GCGGAAGAAA GCCCCATCCG TCGAGCTGTC
TATGCCATAG ATGCCAAAGG CAGGAAAACA AAGCTCAGCC TGAATGTAGG CACGAATGAT
GCTCTCTTTA GTGGCAATTA TGCATACTAT ATTAACACGT ATAGCAGTGC TGCTACCCCA
GCGGTGGTTT CGGTATTCAG AAGCAAAGGT GCCAAAGAGC TGCGTACACT GGAGGATAAC
GTTGCTCTCC GTGAACGGCT GAAAGCCTAT CGTTACAACC CGAAGGAGTT TACCACTATC
AAAACTCAAT CGGGTCTTGA ACTGAATGCC TGGATCGTGA AGCCTATTGA TTTCGATCCC
TCTCGCCACT ATCCTGTCCT GATGGTACAG TATAGCGGTC CCAACTCCCA GCAGGTATTG
GATCGCTATT CATTCGATTG GGAACACTAC CTTGCATCGA AAGGTTACGT CGTGGCATGT
GTGGATGGGC GTGGCACCGG TGCTCGCGGC GAAGAATGGC GCAAGTGTAC CTACATGCAA
CTCGGTGTAT TCGAAAGCGA TGATCAGATA GCAGCGGCCA CTGCTATAGG ACAGCTGCCC
TATGTGGATG CAGCTCGTAT CGGCATATGG GGGTGGAGCT ATGGCGGCTA TACCACACTA
ATGAGTTTGT GTCGGGGAAA TGGTACATTC AAAGCGGGGA TAGCCGTTGC TCCTGTGGCA
GACTGGCGTT TCTACGATTC GGTTTACACC GAACGCTTCA TGCGTACACC CAAGGAGAAT
GCTTCCGGAT ACAAGATGTC TTCTGCTCTT GATGTGGCAA GCCAATTACA AGGAAACCTC
TTGATCGTAA GCGGATCGGC AGACGACAAT GTTCATCTTC AGAACACGAT GCTTTTTACA
GAGGCACTGG TTCAGGCCAA TATCCCCTTC GACATGGCTA TCTATATGGA CAAGAACCAT
AGTATATACG GGGGGAATAC CCGCTATCAT CTCTATACTC GCAAAGCAAA GTTTTTGTTC
GACAATCTTT AA
 
Protein sequence
MKRPVIILLL GIVTMCAMAQ TGNKPVDLKE ITSGMFYARS AGSGIRSMPD GEHYTEMNRE 
RTAIIRYNYA SGKAVDTLFS VERARECPFK QIQNYEVSST GHHILLFTDM ESIYRHSYRA
AVYDYDVRRN LVKPLSEHVG KVMIPTFSPD GRMVAFVRDN NIFIKKFDFD TEVQVTTDGQ
INSILNGATD WVYEEEFGVT NLMSWSADNA FLAFVRSDES AVPEYRMPMY EDKLYPEDYT
YKYPKAGEKN STVSLHLYNV ADRNTKSVSL PIDADGYIPR IAFTDNADEL AVMTLNRLQN
DFKMYYVHPK SLVPKLILQD MNKRYVDSDW IQTLKFTTGG GFAYVSEKDG FAHIYLYDNK
GVMHRRITSG NWDVTKLYGV DASGTVFYQS AEESPIRRAV YAIDAKGRKT KLSLNVGTND
ALFSGNYAYY INTYSSAATP AVVSVFRSKG AKELRTLEDN VALRERLKAY RYNPKEFTTI
KTQSGLELNA WIVKPIDFDP SRHYPVLMVQ YSGPNSQQVL DRYSFDWEHY LASKGYVVAC
VDGRGTGARG EEWRKCTYMQ LGVFESDDQI AAATAIGQLP YVDAARIGIW GWSYGGYTTL
MSLCRGNGTF KAGIAVAPVA DWRFYDSVYT ERFMRTPKEN ASGYKMSSAL DVASQLQGNL
LIVSGSADDN VHLQNTMLFT EALVQANIPF DMAIYMDKNH SIYGGNTRYH LYTRKAKFLF
DNL