Gene Sfum_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1841 
SymbolthiH 
ID4459852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2247708 
End bp2249180 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content61% 
IMG OID639702608 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_845961 
Protein GI116749274 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0513393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.233409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCTC GATCTTCTCT CTGTCGACGC GAAACGGTGC TCAACGTGAA CGCTCGGGCG 
GGCAGCATGG ATCATTCGAC GCAAGGCGGC GCATTCATCG ACGAACGCGT CATTTTTCAG
ACACTGGAAG GTCTCGGTCC GGTCGCCGAC CCGGTGCACG TGCGGGAAAT CCTGGCGAAA
GCGCGCGAGC TCAAGGGGCT GAATCCCGAG GAAGTGGCGG TGCTGACGGT GGTCACGGAC
CGGGGGTTGC TGGAGGAGCT CTTCACCGCC GCCCGGTTTA TCAAGGAGAC CATCTACGGG
CCCCGGATCG TCCTGTTCGC GCCTCTGTAC ATTTCCAATC TATGTCATAA CGAATGCCTC
TATTGTGCAT TCCGGGCAAG CAACCGGGAA GTGCACCGCC GCGCCCTGAA CCAAGATGAA
ATCGCAAATG AAATCAAGCT GTTGGTGGAA CAGGGACATA AGCGGGTGCT CGTCGTCTCC
GGCGAGAGCT ATCCCCGGGA GGACGGCTTC GACTACATCA TCAAGTCTAT CGAAACCGTT
TACAAAACGC GCAGCGGTCC CGGAGAAATC CGACGGGTGA ACGTGAACCT GGCGCCATGC
ACGGTGGAAC AGTTCAAACA GCTCAAAGCC GCCGGCATCG GGACTTTTCA GCTGTTCCAG
GAAACCTATC ACCGTCGCAC CTATGACGTC ATGCATCCGG GCGGTCGCAA GCGCGACTAC
GATTGGCGCG TCACGGCCTT CGACCGCGCC ATGCGGGGCG GGATCGACGA CGTGGGGATG
GGCCTTCTGT TCGGGCTATA TGACTGGCGA TTCGAGGTCC TGGCCCTGTT GCAGCACGCA
CGGCACCTGG AAGAGGTCTT CGGCGTTGGT CCGCATACCA TCAGCGTCCC GCGCATGGAG
CCGGCAGTCG GCTCCGAAAT CGCCGCGAAT CCTCCGCGCC CGGTGAGTGA CGACGATTTT
CTGAAGATCG TCGCCATCCT GCGCATGGCG GTGCCTTACA CGGGAATGAT CATGTCCACG
CGCGAAACAC CGGAAACCCG GCGTGCGACG CTGGCGTTGG GGATCTCGCA GATCTCCGCC
GGAAGCCGCA CCAATCCCGG GGGCTATTCC GACGGCGTCC AGGAAACCGA CGCCCAGTTC
CAACTCGGCG ATCACCGTCC CTTGAACGAG GTGATCCGGG ATCTGGCCGA CATGGGTTAC
ATTCCGTCTT TTTGCACGGC CTGCTACCGG CTGGGACGCA CCGGGCACGA TTTCATGGAA
CTGGCCAAGC CGGGAGACAT CAAGTACCGC TGCGACCCGA ACGCTCTGTC GACTTTCCTG
GAATACCTGC TCGACTACGC CTCGCCGGAT ACCGTCGCCG CCGGCGAAAG GCTCATCGAG
AAACAGCTGG CGCGCATGGA CGACAGGCTG CGTCGAACGG CTTCGAAGAT GCTCGACAAG
GTGCGTGGCG GGCGGCGCGA CGTTTACATT TGA
 
Protein sequence
MPARSSLCRR ETVLNVNARA GSMDHSTQGG AFIDERVIFQ TLEGLGPVAD PVHVREILAK 
ARELKGLNPE EVAVLTVVTD RGLLEELFTA ARFIKETIYG PRIVLFAPLY ISNLCHNECL
YCAFRASNRE VHRRALNQDE IANEIKLLVE QGHKRVLVVS GESYPREDGF DYIIKSIETV
YKTRSGPGEI RRVNVNLAPC TVEQFKQLKA AGIGTFQLFQ ETYHRRTYDV MHPGGRKRDY
DWRVTAFDRA MRGGIDDVGM GLLFGLYDWR FEVLALLQHA RHLEEVFGVG PHTISVPRME
PAVGSEIAAN PPRPVSDDDF LKIVAILRMA VPYTGMIMST RETPETRRAT LALGISQISA
GSRTNPGGYS DGVQETDAQF QLGDHRPLNE VIRDLADMGY IPSFCTACYR LGRTGHDFME
LAKPGDIKYR CDPNALSTFL EYLLDYASPD TVAAGERLIE KQLARMDDRL RRTASKMLDK
VRGGRRDVYI