Gene PG1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1004 
Symbol 
ID2553035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1060523 
End bp1062802 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content52% 
IMG OID637149712 
Productprolyl oligopeptidase family protein 
Protein accessionNP_905227 
Protein GI34540748 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AAATCCTATT ACTAACAATC CTGACATTAG GAGCAATGAC TGTGCATGCA 
CAAAAGATCA CAGGCGACTG GAAAGGAATG CTCTCCATTC CGCAAGCCAA CATGGAGCTG
GAACTCATAT TCCACATCAC CGGAGAGGGT GCCGACCTCT CCACGACGAT GGATGTACCT
GCTCAGGGAG CAACCGGCAT ACCCGTAGAG AAGACCTCCT TTGCCGATGG CAAACTGACA
CTCTCCGCAG CTGCCCTTCA GTTCACATTC AAGGGCACCC TGTCCGGCAA TACGATAGAA
GGCAATGTAG AGCAGATGGG CTTCAGCCTG CCGCTTACGC TACAACGATT CGAATCCAAA
TTGCCCGGCA ATACAGCCTT GCCTTCGACC GAAGAAGAGC TTAAGGCACT GGCAGCTTTG
GACAAGGGCA ACTACAAATA CAAGGTAGAA GACTACTTTG CCAAACCCAA GGCTTCCGCT
TTTCAGCTAA GCCCCAACGG CAAGTACCTC TCATACATGG AAAAGGACGA TGCCGGCAAA
CGCCATGTCT ATGTCAAGGA AATTGCCACC GGCACCGTCA AGCGTGCCAT CGAAGAAAAG
GACGAACTGA TCAAAGGCTA CGGATGGATC AACGACGAAC GTCTCTTCTT TGTCATGGAC
AAAGGAGGGA ATGAGAACTA TCACCTCTTT GCTTCGAATA TCGACGGCAG CAATACCCGC
GATCTCACCC CCTTTGACGG AGTGAAGGCT TCGATCCTCA ACATGCTCAA AGAGCAGAAG
GACTACATGA TCATATCCAT GAACAAAAAC AATCCGCAGA TCTTCGAACC CTACAAACTG
AATGTAGTAA CAGGCGAGCT GACCCAGCTC TACGAGAATA AGGATGCGGC CAACCCCATT
CAAGGTTACG AGTTCGACAA GGACGGCGAA CTGCGTGGAT ACAGCCGCCT CGTAAACGGG
ATCGAATCCG AGTTGTACTA CAAGGATTTG GCTACGGGCG AGTTCCGTCT GCTGAAGAAA
ACACACTGGG ACGACACCTT CGGAGTCATC GCGTTCAACT ATGCCTCCAA AAACAAAGAC
GAAGCCTATG TACTGACCAA CCTGGACAGC GACAAGACTC GTATCGTACT CTACGACCTG
AAGCAGAACA AGATCATCCG CGAGATCTTC GCCAACGAAG ACTACGACGT CAGCGGCCTG
CACCTCTCTC GTAAGAGAAA CTACGAAATA GACCTCATGG CCTACGAAGG CGAGAAGTCC
GTAGTCGTAC CCGTAAGTGC CACCTACAAA GAGCTGCACA AGCTGATGGA AAAGGAATTC
AAGGGCAAAG AATTCTCCGT GGTCGATTAC GATGATGATG AGACCATCCT GCTTATCGCC
GTACAAAGCG ACAAGCTATA CGGCACCTAC TACCAGTTCG ATACGCGCAC CAAGAAGTTT
ACCCTCCTCT ATGACCTGAT GCCTCAGCTC AAGGAGGAAG ATATGGCCGA GATGCGCCCC
ATCAAATTCA AGAGCCGCGA CGGACTCACT ATCCATGGCT TTATCACTCT GCCGAAAGCA
GCCCTCGAAG GGAAGAAAGT ACCCCTGATC GTCAATCCGC ATGGAGGCCC CCAAGGCATA
CGCGACTCAT GGGGCTTCAA TCCCGAGACC CAGCTCTTCG CCAGCCGCGG ATATGCCACC
CTGCAAGTCA ATTTCCGCAT CTCAGGCGGA TACGGCAAGG AATTCCTCCG TGCCGGATTC
AAACAGATCG GTCGCAAAGC CATGGACGAT GTGGAGGACG GTGTGCGCTA TGCTATCAGC
CAAGGTTGGG TGGATCCTGA CAGGATCGCC ATATACGGTG CCAGCCACGG TGGTTATGCC
ACGCTGATGG GTCTGGTGAA AACACCCGAT CTCTATGCCT GCGGTGTGGA TTACGTAGGT
GTATCGAACA TTTACACCTT CTTCGACTCC TTCCCAGAAT ATTGGAAGCC GTTTAAGGAA
ATGGTCAAGG AAATTTGGTA CGACCTCGAC AATCCGGAGG AAGCAGCTAT CGCCAAGGAA
GTGTCCCCCT TCTTCCAGAT CGACAAGATC AATAAGCCAC TGTTCGTCGT ACAGGGAGCC
AACGACCCGC GCGTGAATAT CAACGAGTCC GATCAGATCG TAACGGCACT GCGTGCCCGC
GGATTCGAAG TACCCTATAT GGTGAAGTAC AACGAAGGCC ACGGATTCCA TCGTGAAGAA
AACTCCATGG AGCTATACCG TGCCATGCTC GGTTTCTTCG CCAAACACCT GAAGAAATAA
 
Protein sequence
MKLKILLLTI LTLGAMTVHA QKITGDWKGM LSIPQANMEL ELIFHITGEG ADLSTTMDVP 
AQGATGIPVE KTSFADGKLT LSAAALQFTF KGTLSGNTIE GNVEQMGFSL PLTLQRFESK
LPGNTALPST EEELKALAAL DKGNYKYKVE DYFAKPKASA FQLSPNGKYL SYMEKDDAGK
RHVYVKEIAT GTVKRAIEEK DELIKGYGWI NDERLFFVMD KGGNENYHLF ASNIDGSNTR
DLTPFDGVKA SILNMLKEQK DYMIISMNKN NPQIFEPYKL NVVTGELTQL YENKDAANPI
QGYEFDKDGE LRGYSRLVNG IESELYYKDL ATGEFRLLKK THWDDTFGVI AFNYASKNKD
EAYVLTNLDS DKTRIVLYDL KQNKIIREIF ANEDYDVSGL HLSRKRNYEI DLMAYEGEKS
VVVPVSATYK ELHKLMEKEF KGKEFSVVDY DDDETILLIA VQSDKLYGTY YQFDTRTKKF
TLLYDLMPQL KEEDMAEMRP IKFKSRDGLT IHGFITLPKA ALEGKKVPLI VNPHGGPQGI
RDSWGFNPET QLFASRGYAT LQVNFRISGG YGKEFLRAGF KQIGRKAMDD VEDGVRYAIS
QGWVDPDRIA IYGASHGGYA TLMGLVKTPD LYACGVDYVG VSNIYTFFDS FPEYWKPFKE
MVKEIWYDLD NPEEAAIAKE VSPFFQIDKI NKPLFVVQGA NDPRVNINES DQIVTALRAR
GFEVPYMVKY NEGHGFHREE NSMELYRAML GFFAKHLKK