Gene PG2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2107 
SymbolthiH 
ID2551801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2215636 
End bp2216748 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content56% 
IMG OID637150685 
Productthiamine biosynthesis protein ThiH 
Protein accessionNP_906167 
Protein GI34541688 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCT ATGATCACTT GCACGCCTCC CCCTATACTT GGGAGCGAGT GGGTAACCTC 
TTGGCCGATG CTACGGCTGC CGATGTGGAG CGTTCGTTGG CATACCGACG CCGCACCATC
CAAGACTTTA TCACGCTTAT TTCGCCTGCA GCCATCCCTT ATCTGGAGCA GATGGCCCGG
CAGGCGCATG CCCTGACGGT AGAGCGATTC GGCCACACGA TGCAGCTCTA TATCCCCCTC
TACCTGTCGA ATATCTGTAG CAATGCCTGT GTATACTGCG GATTCAGTCG GTCGAACAAA
ATCCATCGCC GCCGACTCAC TGCCGAAGAG GTGGATCGGG AAGCGGAAGC GATCCTTCGG
CTCGGATACA AACACCTCCT TCTCGTGTCG GGAGAGTCCG AGAAAGCCAC TCCGGCAAGC
TACTATGAGA AAATGACGCG GCGACTGCGT CCACTCTTCT CCCAACTCTC GCTCGAAGTT
CAGCCCCTGA CCACGGAAGA ATATGCCCGT CTGCATGAAG CCGGTATCGG AGCCGTCTAT
GTGTATCAGG AGACATACAA CGAGCAGGCA TACCCCACCT ATCACCCTGC CGGCCGGAAG
GCGGACTATC GCTACCGCTT GGAGACTCCC GACAGAATCG GCCGAGCCAA TATGCAAAAG
ATCGGAATAG GGGCACTGCT GGGACTGGAG AATTGGCGTG TGGACTCCGT TTTCACTGCT
TTGCACCTGC GGTATCTGGA ACAGACGTAT TGGAAGAGCA AGTTTTCCAT CTCGCTGCCT
CGTCTGCGTC CTGCCACGGG CGGCTGGGAG CCTAAAGATC CTATTGACGA TGTCGGTATG
GTACAGCTTA TTACTGCCTT CCGTCTGTTG GATAAGGATG TCGAGATCAG CCTGTCCACA
CGGGAGAGTC GTGAGTTTCG TGACCACGTG ATGCCGCTCG GTATCACCTC GGTCAGTGCC
GGCAGCAAGA CCGAACCCGG AGGATATGCC GAAGAGAATG CCGATCTGGA GCAATTCGCC
ATCAACGATG CCCGCAGTCC GGCCGAAATG GCTGCCGATC TTCGCCGACT TGGCTACGAG
CCGGTTTGGA AAGACTGGGA TGCTTTCATG TAA
 
Protein sequence
MTFYDHLHAS PYTWERVGNL LADATAADVE RSLAYRRRTI QDFITLISPA AIPYLEQMAR 
QAHALTVERF GHTMQLYIPL YLSNICSNAC VYCGFSRSNK IHRRRLTAEE VDREAEAILR
LGYKHLLLVS GESEKATPAS YYEKMTRRLR PLFSQLSLEV QPLTTEEYAR LHEAGIGAVY
VYQETYNEQA YPTYHPAGRK ADYRYRLETP DRIGRANMQK IGIGALLGLE NWRVDSVFTA
LHLRYLEQTY WKSKFSISLP RLRPATGGWE PKDPIDDVGM VQLITAFRLL DKDVEISLST
RESREFRDHV MPLGITSVSA GSKTEPGGYA EENADLEQFA INDARSPAEM AADLRRLGYE
PVWKDWDAFM