Gene Sfum_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1822 
SymbolthiH 
ID4459867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2223987 
End bp2225123 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content64% 
IMG OID639702589 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_845942 
Protein GI116749255 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCT ACAATGAAAT CAAACAGTAC AAGTGGTCGG ACATCGGCCG CGACCTGAAA 
AGCCGGTCGC GATCGGACGT GGAAAGGGCA CTCGGGTCTC GGCCCGTGAA CCTGGACGGG
CTGATCAGCC TGCTCTCCCC GGCCGCCGAA CCTTTGCTCG AAGAGATGGC GCAAGAAGCG
CACAGGCTGA CGATCCGCAG GTTCGGCAAC GTGATCTCCA TGTTCGCGCC GCTCTACATT
TCCAACGTGT GCATGAATCG CTGCGCCTAC TGCGGTTTCA ACGCCGGCAA CCCGGTCGCC
CGCCTGACGC TCACCGTCGA GCAGATCGAA GCGGAAGGCA GGGCCATCAG GGCGCTCGGC
TTCCGGCACC TGCTCCTGGT TTCCGGAGAG GCCCCCAAGA TCGTCACCAT GGACTACCTG
AAGAGCGCGC TGGACGTGTT GCGCCCGTTG TTCCCGTCCC TTTCCGTCGA GATTTTCCCC
CTGGACACCG CCGCGTACGC CGAGCTCATC GACCATGGCC TGGACGGACT GGTCGTATTT
CAGGAAACCT ACGACGAGGA GTTGTACGGG AAAGTCCACC TGGGGGGGAA GAAGAGGGAC
TACCGGTGGC GGCTGGAAAC CCCGGACCGG GGCGGGTCGG CAGGCTTTCG CCGGCTCGGT
CTGGGCGCCC TCCTCGGGCT CAACGACTGG CGAGTGGAAG CGTTTTTCCT CGCCCTGCAC
GCACAGTACC TGCTGCGCAC CTACTGGAAG TCGCAGATCG GCATTTCCTT TCCGCGCCTG
CGACCGGCTG CCGGGGCCTT CCAGCCGGCT CATCCCGTTT CCGATATCGA CTTTGTGCAA
CTGCTGACCG CCCTGCGGCT GTTCCTGCCC GACGCCTCCC TGGTGCTCTC GACCCGCGAA
CCCGCGTCCC TGAGGGACCA CCTGGTGCCG CTGGGCATCA CCACCATGAG CGCCGGGTCT
CACACGGAAC CGGGCGGGTA CAGTCGCGAA TCGGAGGCCG AAGCGCAGTT CGAGATCGCC
GACAGACGAT CTCCCGAGGA GGTGGCGAAC ATGCTCAGGG AAAAAGGCTA CGAGCCGGTG
TGGAAGGACT GGGACAGCAT CTTCCTCACC CCGGGACGTG AAACCGCCGC CGCCTGA
 
Protein sequence
MSFYNEIKQY KWSDIGRDLK SRSRSDVERA LGSRPVNLDG LISLLSPAAE PLLEEMAQEA 
HRLTIRRFGN VISMFAPLYI SNVCMNRCAY CGFNAGNPVA RLTLTVEQIE AEGRAIRALG
FRHLLLVSGE APKIVTMDYL KSALDVLRPL FPSLSVEIFP LDTAAYAELI DHGLDGLVVF
QETYDEELYG KVHLGGKKRD YRWRLETPDR GGSAGFRRLG LGALLGLNDW RVEAFFLALH
AQYLLRTYWK SQIGISFPRL RPAAGAFQPA HPVSDIDFVQ LLTALRLFLP DASLVLSTRE
PASLRDHLVP LGITTMSAGS HTEPGGYSRE SEAEAQFEIA DRRSPEEVAN MLREKGYEPV
WKDWDSIFLT PGRETAAA