Gene Sfum_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0842 
SymbolthiH 
ID4461047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1044286 
End bp1045692 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content57% 
IMG OID639701604 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_844974 
Protein GI116748287 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGC AAAAAGGAGA TTTCATCGAC GACCGAAGGA TCGTGCAATT GATCGAGTCC 
GCGAAGGCCG GTGTGTCGAG GGAGGAAGTC GTGCGCATCA TCGAGAAGGC TTCCCAGTCG
GACGGTCTCA CCCCCGCCGA AGTTGCGGTT CTGCTCGAAG TTCAGGCGCC CGACCTGCTC
GAAATGATCT ACCGGACCGC CCAGGACATC AAGGAGCAGA TCTACGGGAA GAGGCTGGTA
TTGTTCGCCC CTCTCTACAT CAGCGATCAC TGCGTCAACA ATTGCGTCTA TTGCGGCTAC
AGGCGCCACA ACCGTTTTGA GCGTCGAAAA CTCACGATGG ATGAAATCAG AAAAGAAATC
TCCGTTCTCG AAGAGATGGG GCACAAGCGC ATCGCTCTCG AGTGCGGCGA ACATCCCGAA
CAATGTCCCA TCGACTACGT GCTGGATGCG ATCAGGACCA TTTATGAAGT CAAGGTGAAG
AACGGCAGCA TCCGGCGTGT GAACGTGAAT ATCGCCGCAA CGAGCATCGA AAACTTCAGG
CTGTTAAAGG AGGCCGGAAT CGGGACATAC ATCCTTTTCC AGGAGACCTA CCACCGTCCC
ACCTACGCGA GGGTGCACCC GTCCGGTCCA AAGAGGAACT ACGACTGGCA CACCTGCGCT
TTCGACCGCG CCATGGAGGG AGGCATTGAC GACGTGGGGT TCGGCGTGCT GTTCGGGCTT
TACGACTACA AGTATGAAGT CCTTGCACTG TTGCTGCATG CCATGCACCT GGAAGAGGCC
TTCGGGGTCG GGCCTCACAC CATTTCCGTT CCGCGGCTGA GACCGGCGGC CGGCGTCGAC
TTGAATAAAT TCCCCCACCT GGTCGCAGAC CGCGAATTCA AGAAAATAAT CGCCGTGCTT
CGACTTGCTG TGCCTTATAC CGGGATGATT CTCTCCACGC GGGAGCCTGC CCGGTTCCGC
GATGAGCTCA TTTCGGTGGG TATCTCGCAG ATCAGCGCCG GTTCTTGTAC GGGGGTCGGC
GGGTACTGCA AGGACAACAA TCTGGACCGG GAAGAACAGA GGCTGCAGTT CGCCATTGAA
GATCATCGGA CCATGGATGA AGTCATCATG AGCGTCTGCG AATCAGGCTA CATCCCGAGC
TTCTGCACGG CCTGCTATCG AAAGGGGCGC ACCGGGGACC GTTTCATGCA GCTTGCCAAG
ACCGGGCAAA TTCAGGATGT CTGCCAACCC AACGCCATCC TCACCTTCAA GGAATTCCTG
CTCGACTACG CGGGCCCCGA ACTGAAGGCC GCGGGTGAGT CGGCCATTCA TCGACATCAC
CAGTTGATCG CCAACCGGAA GGTGCGACGA ATCACCGAGC AGAGATTGGC TGAGATCGAA
CACGGCACAA GGGATCTCTA TTTCTGA
 
Protein sequence
MAAQKGDFID DRRIVQLIES AKAGVSREEV VRIIEKASQS DGLTPAEVAV LLEVQAPDLL 
EMIYRTAQDI KEQIYGKRLV LFAPLYISDH CVNNCVYCGY RRHNRFERRK LTMDEIRKEI
SVLEEMGHKR IALECGEHPE QCPIDYVLDA IRTIYEVKVK NGSIRRVNVN IAATSIENFR
LLKEAGIGTY ILFQETYHRP TYARVHPSGP KRNYDWHTCA FDRAMEGGID DVGFGVLFGL
YDYKYEVLAL LLHAMHLEEA FGVGPHTISV PRLRPAAGVD LNKFPHLVAD REFKKIIAVL
RLAVPYTGMI LSTREPARFR DELISVGISQ ISAGSCTGVG GYCKDNNLDR EEQRLQFAIE
DHRTMDEVIM SVCESGYIPS FCTACYRKGR TGDRFMQLAK TGQIQDVCQP NAILTFKEFL
LDYAGPELKA AGESAIHRHH QLIANRKVRR ITEQRLAEIE HGTRDLYF