Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_16500 |
Symbol | thiH |
ID | 8395540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 1855460 |
End bp | 1856914 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644986404 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003144018 |
Protein GI | 257064346 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAC ACGTCTACAA CCCGAGTTCG CCGCATGCCG ACGAATTCAT CAATCACCAG GAGATTCTCG ATACGCTTCA GTACGCCCAG GAGCACAAGG ACGACCTTGA GCTGTGCCGT AGCATCCTGA AGAAGGCACA CCCCAACCTG GCGCCGAAGA AGGAGCATTG CACCTGCATC ACGCATCGCG AGGCGGCTGT GCTGTTGGCC TGCGAGGACC CCGAAATCAA CGAGGAAATC AAGACGCTGG CGCGTCAGAT CAAGCTTGCC TACTATGGCA ATCGCATCGT GCTGTTCGCG CCGCTGTACC TTTCCAACTA CTGCGTGAAC GGTTGCCTGT ACTGCCCATA CCACGCCAAG AACCGCGAGA TCCCCCGCCG CAAGCTGACT CAGGACGAGA TCAGGGCGGA AGTCATCGCA CTGCAGGACA TGGGGCACAA GCGCCTGGCC ATCGAAGCGG GCGAGGATCC CAAGCACAAT CCCATCGAGT ACATCCTGGA GTCGATGCAG ACCATCTACT CCATCAAGCA CAAGAACGGC GCCATCCGCC GTGTGAACGT CAACATCGCG GCCACGACCG TCGAAGAGTA CCGCATGCTC AAAGAGGCCG AGATCGGCAC GTACATCCTG TTCCAGGAAA CCTATAACCG CGCCCGTTAC GAGGAGCTGC ATCCCACGGG ACCGAAGTCC GATTACGAAT GGCACACCGA GGCGCATGAC CGTGCCCAGG AGGCCGGCAT CGACGACGTG GGTCTGGGCG TTCTGTTCGG TCTGGAAGGC TACGCCTACG AGTTCTGCGG ACTCATCATG CATGCCGAGC ACCTCGAGGC GCGTTTCGGC GTGGGTCCGC ACACCATCAG CGTGCCCCGC GTGAAGCCCG CCATGGACAT TGACCCCGAC GTGTTCGACA ACGGTATTCC CGACGAGATG TTCGAGAAGA TCATCGCCCT CATCCGCATC ACCGTGCCTT ACACCGGCAT GATCATCAGC ACCCGCGAGT CGGAGGCCGT TCGTTCTGCC GCGCTGCAGT ACGGCATCTC GCAGATTTCG GGCGGTTCGC GCACCAGCGT GGGCGGCTAC ACCGAGGAGG AGCGTCCCCA CGACACCGAG CAGTTCGACG TGTCCGACCA GCGTACGCTC GACGAGGTCA TTGCCTGGCT TATGGATTGC GGCCACATCC CCAGCTTCTG CACGGCATGC TACCGCGCAG GGCGCACGGG CGACCGCTTC ATGAGCTTCT GCAAATCGGG CGAGATTCTG AACTACTGCC ATCCGAACGC GCTTATGACG CTGTCCGAGT ACCTGGTCGA CTACGCGACC CCGGCAACGG CCGAGCGCGG CTGGGAGATG ATTCGCGAGG AGCTTACCAA GATCCCCGAC GCCCGCAGGC GCGAGCTGTG CGCGGCCCAC ATCGAGGAGA TCCGCACCGG CAACGCCCGC GACTTCAGGT TCTAG
|
Protein sequence | MTEHVYNPSS PHADEFINHQ EILDTLQYAQ EHKDDLELCR SILKKAHPNL APKKEHCTCI THREAAVLLA CEDPEINEEI KTLARQIKLA YYGNRIVLFA PLYLSNYCVN GCLYCPYHAK NREIPRRKLT QDEIRAEVIA LQDMGHKRLA IEAGEDPKHN PIEYILESMQ TIYSIKHKNG AIRRVNVNIA ATTVEEYRML KEAEIGTYIL FQETYNRARY EELHPTGPKS DYEWHTEAHD RAQEAGIDDV GLGVLFGLEG YAYEFCGLIM HAEHLEARFG VGPHTISVPR VKPAMDIDPD VFDNGIPDEM FEKIIALIRI TVPYTGMIIS TRESEAVRSA ALQYGISQIS GGSRTSVGGY TEEERPHDTE QFDVSDQRTL DEVIAWLMDC GHIPSFCTAC YRAGRTGDRF MSFCKSGEIL NYCHPNALMT LSEYLVDYAT PATAERGWEM IREELTKIPD ARRRELCAAH IEEIRTGNAR DFRF
|
| |