Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1822 |
Symbol | thiH |
ID | 4459867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 2223987 |
End bp | 2225123 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639702589 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_845942 |
Protein GI | 116749255 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.113959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCT ACAATGAAAT CAAACAGTAC AAGTGGTCGG ACATCGGCCG CGACCTGAAA AGCCGGTCGC GATCGGACGT GGAAAGGGCA CTCGGGTCTC GGCCCGTGAA CCTGGACGGG CTGATCAGCC TGCTCTCCCC GGCCGCCGAA CCTTTGCTCG AAGAGATGGC GCAAGAAGCG CACAGGCTGA CGATCCGCAG GTTCGGCAAC GTGATCTCCA TGTTCGCGCC GCTCTACATT TCCAACGTGT GCATGAATCG CTGCGCCTAC TGCGGTTTCA ACGCCGGCAA CCCGGTCGCC CGCCTGACGC TCACCGTCGA GCAGATCGAA GCGGAAGGCA GGGCCATCAG GGCGCTCGGC TTCCGGCACC TGCTCCTGGT TTCCGGAGAG GCCCCCAAGA TCGTCACCAT GGACTACCTG AAGAGCGCGC TGGACGTGTT GCGCCCGTTG TTCCCGTCCC TTTCCGTCGA GATTTTCCCC CTGGACACCG CCGCGTACGC CGAGCTCATC GACCATGGCC TGGACGGACT GGTCGTATTT CAGGAAACCT ACGACGAGGA GTTGTACGGG AAAGTCCACC TGGGGGGGAA GAAGAGGGAC TACCGGTGGC GGCTGGAAAC CCCGGACCGG GGCGGGTCGG CAGGCTTTCG CCGGCTCGGT CTGGGCGCCC TCCTCGGGCT CAACGACTGG CGAGTGGAAG CGTTTTTCCT CGCCCTGCAC GCACAGTACC TGCTGCGCAC CTACTGGAAG TCGCAGATCG GCATTTCCTT TCCGCGCCTG CGACCGGCTG CCGGGGCCTT CCAGCCGGCT CATCCCGTTT CCGATATCGA CTTTGTGCAA CTGCTGACCG CCCTGCGGCT GTTCCTGCCC GACGCCTCCC TGGTGCTCTC GACCCGCGAA CCCGCGTCCC TGAGGGACCA CCTGGTGCCG CTGGGCATCA CCACCATGAG CGCCGGGTCT CACACGGAAC CGGGCGGGTA CAGTCGCGAA TCGGAGGCCG AAGCGCAGTT CGAGATCGCC GACAGACGAT CTCCCGAGGA GGTGGCGAAC ATGCTCAGGG AAAAAGGCTA CGAGCCGGTG TGGAAGGACT GGGACAGCAT CTTCCTCACC CCGGGACGTG AAACCGCCGC CGCCTGA
|
Protein sequence | MSFYNEIKQY KWSDIGRDLK SRSRSDVERA LGSRPVNLDG LISLLSPAAE PLLEEMAQEA HRLTIRRFGN VISMFAPLYI SNVCMNRCAY CGFNAGNPVA RLTLTVEQIE AEGRAIRALG FRHLLLVSGE APKIVTMDYL KSALDVLRPL FPSLSVEIFP LDTAAYAELI DHGLDGLVVF QETYDEELYG KVHLGGKKRD YRWRLETPDR GGSAGFRRLG LGALLGLNDW RVEAFFLALH AQYLLRTYWK SQIGISFPRL RPAAGAFQPA HPVSDIDFVQ LLTALRLFLP DASLVLSTRE PASLRDHLVP LGITTMSAGS HTEPGGYSRE SEAEAQFEIA DRRSPEEVAN MLREKGYEPV WKDWDSIFLT PGRETAAA
|
| |