Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1841 |
Symbol | thiH |
ID | 4459852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 2247708 |
End bp | 2249180 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639702608 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_845961 |
Protein GI | 116749274 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0513393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.233409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGCTC GATCTTCTCT CTGTCGACGC GAAACGGTGC TCAACGTGAA CGCTCGGGCG GGCAGCATGG ATCATTCGAC GCAAGGCGGC GCATTCATCG ACGAACGCGT CATTTTTCAG ACACTGGAAG GTCTCGGTCC GGTCGCCGAC CCGGTGCACG TGCGGGAAAT CCTGGCGAAA GCGCGCGAGC TCAAGGGGCT GAATCCCGAG GAAGTGGCGG TGCTGACGGT GGTCACGGAC CGGGGGTTGC TGGAGGAGCT CTTCACCGCC GCCCGGTTTA TCAAGGAGAC CATCTACGGG CCCCGGATCG TCCTGTTCGC GCCTCTGTAC ATTTCCAATC TATGTCATAA CGAATGCCTC TATTGTGCAT TCCGGGCAAG CAACCGGGAA GTGCACCGCC GCGCCCTGAA CCAAGATGAA ATCGCAAATG AAATCAAGCT GTTGGTGGAA CAGGGACATA AGCGGGTGCT CGTCGTCTCC GGCGAGAGCT ATCCCCGGGA GGACGGCTTC GACTACATCA TCAAGTCTAT CGAAACCGTT TACAAAACGC GCAGCGGTCC CGGAGAAATC CGACGGGTGA ACGTGAACCT GGCGCCATGC ACGGTGGAAC AGTTCAAACA GCTCAAAGCC GCCGGCATCG GGACTTTTCA GCTGTTCCAG GAAACCTATC ACCGTCGCAC CTATGACGTC ATGCATCCGG GCGGTCGCAA GCGCGACTAC GATTGGCGCG TCACGGCCTT CGACCGCGCC ATGCGGGGCG GGATCGACGA CGTGGGGATG GGCCTTCTGT TCGGGCTATA TGACTGGCGA TTCGAGGTCC TGGCCCTGTT GCAGCACGCA CGGCACCTGG AAGAGGTCTT CGGCGTTGGT CCGCATACCA TCAGCGTCCC GCGCATGGAG CCGGCAGTCG GCTCCGAAAT CGCCGCGAAT CCTCCGCGCC CGGTGAGTGA CGACGATTTT CTGAAGATCG TCGCCATCCT GCGCATGGCG GTGCCTTACA CGGGAATGAT CATGTCCACG CGCGAAACAC CGGAAACCCG GCGTGCGACG CTGGCGTTGG GGATCTCGCA GATCTCCGCC GGAAGCCGCA CCAATCCCGG GGGCTATTCC GACGGCGTCC AGGAAACCGA CGCCCAGTTC CAACTCGGCG ATCACCGTCC CTTGAACGAG GTGATCCGGG ATCTGGCCGA CATGGGTTAC ATTCCGTCTT TTTGCACGGC CTGCTACCGG CTGGGACGCA CCGGGCACGA TTTCATGGAA CTGGCCAAGC CGGGAGACAT CAAGTACCGC TGCGACCCGA ACGCTCTGTC GACTTTCCTG GAATACCTGC TCGACTACGC CTCGCCGGAT ACCGTCGCCG CCGGCGAAAG GCTCATCGAG AAACAGCTGG CGCGCATGGA CGACAGGCTG CGTCGAACGG CTTCGAAGAT GCTCGACAAG GTGCGTGGCG GGCGGCGCGA CGTTTACATT TGA
|
Protein sequence | MPARSSLCRR ETVLNVNARA GSMDHSTQGG AFIDERVIFQ TLEGLGPVAD PVHVREILAK ARELKGLNPE EVAVLTVVTD RGLLEELFTA ARFIKETIYG PRIVLFAPLY ISNLCHNECL YCAFRASNRE VHRRALNQDE IANEIKLLVE QGHKRVLVVS GESYPREDGF DYIIKSIETV YKTRSGPGEI RRVNVNLAPC TVEQFKQLKA AGIGTFQLFQ ETYHRRTYDV MHPGGRKRDY DWRVTAFDRA MRGGIDDVGM GLLFGLYDWR FEVLALLQHA RHLEEVFGVG PHTISVPRME PAVGSEIAAN PPRPVSDDDF LKIVAILRMA VPYTGMIMST RETPETRRAT LALGISQISA GSRTNPGGYS DGVQETDAQF QLGDHRPLNE VIRDLADMGY IPSFCTACYR LGRTGHDFME LAKPGDIKYR CDPNALSTFL EYLLDYASPD TVAAGERLIE KQLARMDDRL RRTASKMLDK VRGGRRDVYI
|
| |