Gene PG2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2003 
Symboldgt 
ID2552100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2091738 
End bp2093081 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content54% 
IMG OID637150585 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionNP_906075 
Protein GI34541596 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.140408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGA TGCAGTGGGA GCGATTGCTC TCCGACAAAA GGGTGGGCAT GGAGCACTAC 
CACCAACCGA AACAATCGGC TCAGCCATGG GAGAGAACGG ACTTCGAACG AGATTATGAC
CGCATGGTCT TTTCTTCACC TTTTCGCCGA CTGCAAAACA AGGCACAGGT ATTCCCCCTT
GCAGGCAATA TATTCGTACA CAATCGTCTT ACGCACAGTC TCGAAGTGAG CTGTGTGGGA
CGCTCGCTCG GCAATAATAT CACACGCGGA CTGAAAGCGC GGTATGGCGA ATTGCCATGG
GAGTCGGGGG CCATCAGTGC CATCGTGTCG GCTGCCTGTC TGGCTCACGA CATGGGCAAT
CCTCCTTTCG GTCACAGCGG CGAGCGTGCC ATTTCGGCTT ATTTCCGCGA AGGAAAGGGT
AGGGTATGGG AGGACGCAGT CCGCAAGGAA GGAGGTCGCT GGGAAGACTT CCTGCACTTC
GAAGGCAATG CAAACGCTTT CCGTCTGCTG ACGCATCAGT TCGAAGGCCG TCGGAAAGGA
GGCTTTGCCC TTACGTACAG TACGGTGGCT TCGATCGTGA AGTATCCATA CAGCAGCGAG
ATGGCAGGCA AGGCCGGCAA ATTCGGATTC TTCGCCACGG AAGAGGATAC TTTCCTTATG
ATCGCCGAAG AGCTTGGCAT GCTATGCCGA AACGAACATC CGGTGAAATA CGTTCGCTAT
CCGCTGGTCT ATCTGGTGGA GGCAGCCGAC GACATCTGCT ACCAAATCAT GGATATAGAG
GATGCTTATA AGCTCCGTAT CCTTACTTAT GCCGAGACGG AAAACCTCTT TCTGGCTTTT
TGTCCCGAGG AGGAACTCGG GCATATACGG GGAGTACTCG ATCATATCAC GGATGCCAAT
GAGCAGATCG CTTACCTCCG TTCGCGTGTG ATCAGTCTGT TGGTCGAATC ATGCACCCGG
GTCTTTCTGG ATCATGAGGA CGAGATATTG TCCGGTACCT TCTCCGGCAC TCTCATCGGA
GCCATGGAAC CACGGCTGAA AGAGGCTTAC GGGGCATGTG CCCGAATGGC CTATTCGAAG
ATCTACGTGG CACGCGACGT GGTGGATGTG GAGCTTGCCG GACATCAGAT ATTCGGTGCG
CTGATAGATA AGATGATGCA GGCCCTCACC AATCCGGATC ATGCCTATAG CCGGACGCTG
CTCAGTCGGG TAAGTACGCA GTACAACATA CGGGAGGAGA GCCTCTACGG TAAGATACAG
TGCACGCTGG ACTATATATC CGGCATGACC GATATATACG CGCTGGATCT GTACCGCAAG
ATCACGGGAA TGAACTTGCG CTAA
 
Protein sequence
MTMMQWERLL SDKRVGMEHY HQPKQSAQPW ERTDFERDYD RMVFSSPFRR LQNKAQVFPL 
AGNIFVHNRL THSLEVSCVG RSLGNNITRG LKARYGELPW ESGAISAIVS AACLAHDMGN
PPFGHSGERA ISAYFREGKG RVWEDAVRKE GGRWEDFLHF EGNANAFRLL THQFEGRRKG
GFALTYSTVA SIVKYPYSSE MAGKAGKFGF FATEEDTFLM IAEELGMLCR NEHPVKYVRY
PLVYLVEAAD DICYQIMDIE DAYKLRILTY AETENLFLAF CPEEELGHIR GVLDHITDAN
EQIAYLRSRV ISLLVESCTR VFLDHEDEIL SGTFSGTLIG AMEPRLKEAY GACARMAYSK
IYVARDVVDV ELAGHQIFGA LIDKMMQALT NPDHAYSRTL LSRVSTQYNI REESLYGKIQ
CTLDYISGMT DIYALDLYRK ITGMNLR