Gene PG0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0137 
SymbolpepD-1 
ID2552227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp159244 
End bp160698 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content51% 
IMG OID637148946 
Productaminoacyl-histidine dipeptidase 
Protein accessionNP_904479 
Protein GI34540000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.49846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAG AGATCAAAAC CCTCAAACCA CAAGCTGTCT GGGAGTACTT CTACGACCTG 
ACACAAATAC CCCGACCTAC CGGACAGATG GACGAGGTGA CCAAGTACGT ATTCGACTTC
GGTAAGAGGC TTGGTTTGGA GACCGAGCAG GACGAGGTGG GCAATGTCAT CATTCGCAAG
CCGGCCACAC CGGGCATGGA AAACAAACCG ATCGTGACCC TCCAATCGCA CTTGGATATG
GTGCCTCAAA AGAATTCGGA CATCGATCAT GATTTCACCA AAGACCCCAT CGATGCATAT
ATCGACGGAG AGTGGGTGAA AGCTCGAGGC ACTACTCTGG GAGCCGATAA CGGTATCGGA
GTGGCTTATG CCATGGCGGC TATGGCAGAC CCAATGCTCA AACATGGTCC GCTGGAAGCT
CTTTTCACTA TTAATGAAGA AGTGGGAATG GACGGTGCCA ATGGTCTCAA GCCCGGGTTC
TCTCGTGGTG ACATCCTGCT CAATATAGAT TCGGAGGAAG AAGGCAAGCT CTTCGTAGGC
TGTGCGGGAG GTATTGATGT CAATGTCACG CTCGAATATA AGGAGGAAGA GCTGGTTTCG
GACGAAGAAA TAGGCGTAAA GATATCTTTG ACAGGACTCA AAGGCGGACA TTCGGGTGTC
GATATTCACC TCGGACGTGC CAATGCCAAT AAACTGATGT TCAGATTCTT GAAAGAGGCT
GTAGGATTCT ATGGTGCTCG CTTGGCATGG GTAGAGGGAG GATCGCTCCG CAATGCTATT
CCGCGCGAAG CGTTTGCCGT TATCACCATT CAGGAAGAAG AGGCCGAAGC TGTCTGGGAA
CTGGTATCCG ACTACCAGGA TTTGTTCCGT AAGGAATTCA GCGGCATAGA GGAAAACATC
AAGTTCGAGG CAAGTCGTAT CGAATGTCCC CGAATGATTA TTCCCGAAGA GATCCAAGAC
TGTCTGATCA ATGCCGTGGA AGCCTGCGTC AATGGCCCGC AGTCCATGCT CCAAGATTTC
CCCGGTACGG TAGAGTCATC ATCCAATCTG GCGTTGATCA CGGCTAAGGA AGGGTCAATC
TCCGTACGTT TCCTGGTGCG TAGCTCTTCC GAATCGCACA AGATGTGGGT AGCATCGGCC
ATCGAGAGCG TTTTCTCATT GGCAGGAGCA CGAGTGGAAT TCGACGCCTC GTACAACGGC
TGGCAACCTA ATATCCAGTC GCACATTCTC GAAGTGATGA GCAAAGTCTT CGAAGAGTAT
TACGGCCAAA AGCCCGAAGT ACAGGTGATG CATGCCGGTC TGGAATGCGG TATCATCCAA
GGCGTAATGC CGGATATGGA CATGATCTCG GTAGGTCCCG AACTCCAATC GCCACACTCT
CCGGACGAAA GGATTCATAT CGAATCCGTA GCTCGCACAT GGGAAGTACT GGTAAAAGTA
CTGGAGCGAG TGTAA
 
Protein sequence
MSTEIKTLKP QAVWEYFYDL TQIPRPTGQM DEVTKYVFDF GKRLGLETEQ DEVGNVIIRK 
PATPGMENKP IVTLQSHLDM VPQKNSDIDH DFTKDPIDAY IDGEWVKARG TTLGADNGIG
VAYAMAAMAD PMLKHGPLEA LFTINEEVGM DGANGLKPGF SRGDILLNID SEEEGKLFVG
CAGGIDVNVT LEYKEEELVS DEEIGVKISL TGLKGGHSGV DIHLGRANAN KLMFRFLKEA
VGFYGARLAW VEGGSLRNAI PREAFAVITI QEEEAEAVWE LVSDYQDLFR KEFSGIEENI
KFEASRIECP RMIIPEEIQD CLINAVEACV NGPQSMLQDF PGTVESSSNL ALITAKEGSI
SVRFLVRSSS ESHKMWVASA IESVFSLAGA RVEFDASYNG WQPNIQSHIL EVMSKVFEEY
YGQKPEVQVM HAGLECGIIQ GVMPDMDMIS VGPELQSPHS PDERIHIESV ARTWEVLVKV
LERV