Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0842 |
Symbol | thiH |
ID | 4461047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 1044286 |
End bp | 1045692 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639701604 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_844974 |
Protein GI | 116748287 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCGC AAAAAGGAGA TTTCATCGAC GACCGAAGGA TCGTGCAATT GATCGAGTCC GCGAAGGCCG GTGTGTCGAG GGAGGAAGTC GTGCGCATCA TCGAGAAGGC TTCCCAGTCG GACGGTCTCA CCCCCGCCGA AGTTGCGGTT CTGCTCGAAG TTCAGGCGCC CGACCTGCTC GAAATGATCT ACCGGACCGC CCAGGACATC AAGGAGCAGA TCTACGGGAA GAGGCTGGTA TTGTTCGCCC CTCTCTACAT CAGCGATCAC TGCGTCAACA ATTGCGTCTA TTGCGGCTAC AGGCGCCACA ACCGTTTTGA GCGTCGAAAA CTCACGATGG ATGAAATCAG AAAAGAAATC TCCGTTCTCG AAGAGATGGG GCACAAGCGC ATCGCTCTCG AGTGCGGCGA ACATCCCGAA CAATGTCCCA TCGACTACGT GCTGGATGCG ATCAGGACCA TTTATGAAGT CAAGGTGAAG AACGGCAGCA TCCGGCGTGT GAACGTGAAT ATCGCCGCAA CGAGCATCGA AAACTTCAGG CTGTTAAAGG AGGCCGGAAT CGGGACATAC ATCCTTTTCC AGGAGACCTA CCACCGTCCC ACCTACGCGA GGGTGCACCC GTCCGGTCCA AAGAGGAACT ACGACTGGCA CACCTGCGCT TTCGACCGCG CCATGGAGGG AGGCATTGAC GACGTGGGGT TCGGCGTGCT GTTCGGGCTT TACGACTACA AGTATGAAGT CCTTGCACTG TTGCTGCATG CCATGCACCT GGAAGAGGCC TTCGGGGTCG GGCCTCACAC CATTTCCGTT CCGCGGCTGA GACCGGCGGC CGGCGTCGAC TTGAATAAAT TCCCCCACCT GGTCGCAGAC CGCGAATTCA AGAAAATAAT CGCCGTGCTT CGACTTGCTG TGCCTTATAC CGGGATGATT CTCTCCACGC GGGAGCCTGC CCGGTTCCGC GATGAGCTCA TTTCGGTGGG TATCTCGCAG ATCAGCGCCG GTTCTTGTAC GGGGGTCGGC GGGTACTGCA AGGACAACAA TCTGGACCGG GAAGAACAGA GGCTGCAGTT CGCCATTGAA GATCATCGGA CCATGGATGA AGTCATCATG AGCGTCTGCG AATCAGGCTA CATCCCGAGC TTCTGCACGG CCTGCTATCG AAAGGGGCGC ACCGGGGACC GTTTCATGCA GCTTGCCAAG ACCGGGCAAA TTCAGGATGT CTGCCAACCC AACGCCATCC TCACCTTCAA GGAATTCCTG CTCGACTACG CGGGCCCCGA ACTGAAGGCC GCGGGTGAGT CGGCCATTCA TCGACATCAC CAGTTGATCG CCAACCGGAA GGTGCGACGA ATCACCGAGC AGAGATTGGC TGAGATCGAA CACGGCACAA GGGATCTCTA TTTCTGA
|
Protein sequence | MAAQKGDFID DRRIVQLIES AKAGVSREEV VRIIEKASQS DGLTPAEVAV LLEVQAPDLL EMIYRTAQDI KEQIYGKRLV LFAPLYISDH CVNNCVYCGY RRHNRFERRK LTMDEIRKEI SVLEEMGHKR IALECGEHPE QCPIDYVLDA IRTIYEVKVK NGSIRRVNVN IAATSIENFR LLKEAGIGTY ILFQETYHRP TYARVHPSGP KRNYDWHTCA FDRAMEGGID DVGFGVLFGL YDYKYEVLAL LLHAMHLEEA FGVGPHTISV PRLRPAAGVD LNKFPHLVAD REFKKIIAVL RLAVPYTGMI LSTREPARFR DELISVGISQ ISAGSCTGVG GYCKDNNLDR EEQRLQFAIE DHRTMDEVIM SVCESGYIPS FCTACYRKGR TGDRFMQLAK TGQIQDVCQP NAILTFKEFL LDYAGPELKA AGESAIHRHH QLIANRKVRR ITEQRLAEIE HGTRDLYF
|
| |