Gene PG2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2109 
SymbolthiE 
ID2551490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2217599 
End bp2219542 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content58% 
IMG OID637150687 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionNP_906169 
Protein GI34541690 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.671125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGT GGCTGATCAC TCCCGAAAGG CTTACGAGCG AGCAAATCGG CCTGCTCGAC 
CACTTTTTCG ACCGGGGCTT GGAGCGGCTG CATCTTCGTT TGCCCAGTGC CGGCGAAGAG
GACTATTCGC AGGTTATCGA AGCTGTGGCA GCTTGCTATC GCCGGAGAAT CGTGGTGCAC
GACCATCCCC GGCTCGTCGG CCGCTACGGA CTGTGCGGTT TGCATCTGCC CGAACGGGTC
TGGAAAGGCA TGACCGATCG GCCACAGCTC CCTGATGGTT GTACGGTGTC AGCTTCGTGC
CACGCGATAG AGGATATAGA GTCCTTCCCC TTTCGACTCG ACTACTGTTT CTTAAGCCCC
GTGTTCGACA GTCTGTGCAA GGTGGGCTAT GCAGGACGAT TTTCCCCCGA TTCTTTGGGC
GATCGTCTGC AGCGACTCCA TCTGCCCGTG GTGGCACTGG GCGGCATTAC GCCCGATCGT
CTGCCACAGC TTCGTAGAGC AGGCTTTGCC TCGGCAGCTG CACTCGGCTA TGTGTGGTTG
GTCGAAGGCC GGGAACTGAT GCGTTGGCAG GAATTGTGTA CGCCGGCTGT TATCTGTGTC
GGAGGAGTGG ATCCGTCGGC CGGAGCCGGC ATTACGGCAG ATGTGCGGAC GGCCGAAAAC
ATGGGCGTGC GGGCTTATAC CGTGGCTACG GCTATTACCT TTCAGGGGAG TGGCAGCTAT
CGGGGGGAGC GATGGGTGGA TTCGGCGGAC ATCATCCGGC AGATCGAATC TTTGTCGGCC
GAGATGGAGC CTGCCGTCGC CAAGATCGGC CTGATCCGCG ACTCGGACAC CCTCTCTCTC
GTAGTGGATT GTTTGAAAAA GGTTTTCCCC TCGATTCGGA TCGTATGGGA TCCCGTACTC
AGAGCTTCGG CCGACAGCTC CGCGGGACAA GCGGATCGTT TCAATCTTGA AGATATAACG
GCTTTGTCCC GGATAGACTT CATCACGCCG AATCTTCCCG AAGCTCGCCA CCTTTTGGGC
TGCGAACCGG ACGATGAGAC CCTGTTAGAC TTCTATCGAA GGAGCGGTGT CGGCCTCGTC
CTCAAGGGGG GGCATGCCGG AGAATCGGTC GTAACGGATC GGATCGTTTA CGACGGTCGA
TGCGAAGCTC TCCGACTCTT ACGCGGCGGT ACGGGGAAGC ATGGCACGGG TTGTGCCCAC
AGCACCGCTT TTGCAGCTGC TTTGGCCTTG GAGCAAGAAC CTTTTACGGC TGCCGGGATG
GCACAGTTAT ATGTCAGTCG ACTACGTGAG CGTGCTTCGG GGTTGCTCGC TATGCACAAA
GACCTGCCGG TCGATCCGGT CGTCAGGCTG ATGAGCGAAA TAGATTTGCA GTTCATCACG
CACCGTCAGC CCGACCTGTC CGAACTCGAA GAAGCGGAGG CCGTCTGCCG TATAGGTGTG
CGCTGGGTAC AGCTTCGGAT GAAGGAGGCT TCGGACGAAG AGATGCTTCA CACGGCTTGT
GCCGTCAAGG CTGTCTGCCG TCACCACGGA GCACTTTTTG TCGTCAATGA CCGTGTCGAA
ATAGCCCGTC AGGTGGATGC TGACGGCGTA CACTTGGGCA AAGAGGATAT GGCGATAGTC
GAAGCGCGTC GCATCCTCGG TTCGAATAAG ATCATAGGAC GCACATGCAA TACGATGGAG
GATGTGCGCC GAGCATATGC CGAAGGAGCC GACTACGTGG GTATAGGCCC GTATCGCTAT
ACGGAGACGA AGCAGCGTTT AGCTCCCGTC CTCGGACTCG AAGGCTACAA AGCCATCGCC
GCCTGTATGC AAGCCGAAGG CATCCGACTG CCGGCCTTTG CCATCGGTGG GATAGAGGAT
GCAGACATTC CCCTCATTCG CGACTGTGGC ATAGGAGGTA TTGCCGTGAG CGGCAGCCTT
ATCAGGAAAA TAAAAAAGAA CTAA
 
Protein sequence
MKPWLITPER LTSEQIGLLD HFFDRGLERL HLRLPSAGEE DYSQVIEAVA ACYRRRIVVH 
DHPRLVGRYG LCGLHLPERV WKGMTDRPQL PDGCTVSASC HAIEDIESFP FRLDYCFLSP
VFDSLCKVGY AGRFSPDSLG DRLQRLHLPV VALGGITPDR LPQLRRAGFA SAAALGYVWL
VEGRELMRWQ ELCTPAVICV GGVDPSAGAG ITADVRTAEN MGVRAYTVAT AITFQGSGSY
RGERWVDSAD IIRQIESLSA EMEPAVAKIG LIRDSDTLSL VVDCLKKVFP SIRIVWDPVL
RASADSSAGQ ADRFNLEDIT ALSRIDFITP NLPEARHLLG CEPDDETLLD FYRRSGVGLV
LKGGHAGESV VTDRIVYDGR CEALRLLRGG TGKHGTGCAH STAFAAALAL EQEPFTAAGM
AQLYVSRLRE RASGLLAMHK DLPVDPVVRL MSEIDLQFIT HRQPDLSELE EAEAVCRIGV
RWVQLRMKEA SDEEMLHTAC AVKAVCRHHG ALFVVNDRVE IARQVDADGV HLGKEDMAIV
EARRILGSNK IIGRTCNTME DVRRAYAEGA DYVGIGPYRY TETKQRLAPV LGLEGYKAIA
ACMQAEGIRL PAFAIGGIED ADIPLIRDCG IGGIAVSGSL IRKIKKN