Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG2107 |
Symbol | thiH |
ID | 2551801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 2215636 |
End bp | 2216748 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637150685 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | NP_906167 |
Protein GI | 34541688 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCT ATGATCACTT GCACGCCTCC CCCTATACTT GGGAGCGAGT GGGTAACCTC TTGGCCGATG CTACGGCTGC CGATGTGGAG CGTTCGTTGG CATACCGACG CCGCACCATC CAAGACTTTA TCACGCTTAT TTCGCCTGCA GCCATCCCTT ATCTGGAGCA GATGGCCCGG CAGGCGCATG CCCTGACGGT AGAGCGATTC GGCCACACGA TGCAGCTCTA TATCCCCCTC TACCTGTCGA ATATCTGTAG CAATGCCTGT GTATACTGCG GATTCAGTCG GTCGAACAAA ATCCATCGCC GCCGACTCAC TGCCGAAGAG GTGGATCGGG AAGCGGAAGC GATCCTTCGG CTCGGATACA AACACCTCCT TCTCGTGTCG GGAGAGTCCG AGAAAGCCAC TCCGGCAAGC TACTATGAGA AAATGACGCG GCGACTGCGT CCACTCTTCT CCCAACTCTC GCTCGAAGTT CAGCCCCTGA CCACGGAAGA ATATGCCCGT CTGCATGAAG CCGGTATCGG AGCCGTCTAT GTGTATCAGG AGACATACAA CGAGCAGGCA TACCCCACCT ATCACCCTGC CGGCCGGAAG GCGGACTATC GCTACCGCTT GGAGACTCCC GACAGAATCG GCCGAGCCAA TATGCAAAAG ATCGGAATAG GGGCACTGCT GGGACTGGAG AATTGGCGTG TGGACTCCGT TTTCACTGCT TTGCACCTGC GGTATCTGGA ACAGACGTAT TGGAAGAGCA AGTTTTCCAT CTCGCTGCCT CGTCTGCGTC CTGCCACGGG CGGCTGGGAG CCTAAAGATC CTATTGACGA TGTCGGTATG GTACAGCTTA TTACTGCCTT CCGTCTGTTG GATAAGGATG TCGAGATCAG CCTGTCCACA CGGGAGAGTC GTGAGTTTCG TGACCACGTG ATGCCGCTCG GTATCACCTC GGTCAGTGCC GGCAGCAAGA CCGAACCCGG AGGATATGCC GAAGAGAATG CCGATCTGGA GCAATTCGCC ATCAACGATG CCCGCAGTCC GGCCGAAATG GCTGCCGATC TTCGCCGACT TGGCTACGAG CCGGTTTGGA AAGACTGGGA TGCTTTCATG TAA
|
Protein sequence | MTFYDHLHAS PYTWERVGNL LADATAADVE RSLAYRRRTI QDFITLISPA AIPYLEQMAR QAHALTVERF GHTMQLYIPL YLSNICSNAC VYCGFSRSNK IHRRRLTAEE VDREAEAILR LGYKHLLLVS GESEKATPAS YYEKMTRRLR PLFSQLSLEV QPLTTEEYAR LHEAGIGAVY VYQETYNEQA YPTYHPAGRK ADYRYRLETP DRIGRANMQK IGIGALLGLE NWRVDSVFTA LHLRYLEQTY WKSKFSISLP RLRPATGGWE PKDPIDDVGM VQLITAFRLL DKDVEISLST RESREFRDHV MPLGITSVSA GSKTEPGGYA EENADLEQFA INDARSPAEM AADLRRLGYE PVWKDWDAFM
|
| |