Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_2027 |
Symbol | thiH |
ID | 4844265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 2346834 |
End bp | 2347949 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640119246 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001050398 |
Protein GI | 126174249 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTG TCGCGGAATT TGCCAAGATC CCACGGGATA AACTGTTACT CGATTTGTAT TCTTGCACAG CCCAAGATGT CGAGCGGGCG TTAGTGAGTC CTGCAGGGGA TTTACGTAGC TTACTGGCCT TATTATCACC TGCGGCGGAA CCTTATATCG AAACCATGGC GCAGCACTCT GCTGCGCTTA CGCGGCAGCG ATTTGGCGCG AATCTGGGCA TGTATTTACC GCTATACGTC TCGAATCTGT GCGCTAATGA GTGTGATTAC TGTGGCTTTA GCATGAGTAA CAAACTCAAA CGCAAGACCT TGAATGAGCT GGAGTTGATG GCCGAAATGG CCATTATCAA AGATAGGGGC TTCGATTCTA TTTTACTGGT GTCGGGTGAG CATGAAACTA AGGTTGGTAT CGATTATTTC AAGCAAATGT TGCCGCTGGT AAAGCAGCAA TTTAGCCATT TGGCGATGGA GGTCCAGCCC ATGAGTGAGG ACCATTATTG CCAGTTAGTC GCGCTAGGTT TAGATGCCGT GATGATTTAT CAAGAGACCT ATCAGCCCGA GACTTATGCT CGTCACCATT CCCGAGGTAA AAAAATGGAT TTTGCTTATC GCTTAGCAAC ACCTGACAGA GTTGCGGCGG CGGGCGTTGA TAAAATTGGT CTTGGGGTAT TACTGGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT CTCGACTATT TAGAGCGACG CTATTGGCGT ACCCGCTTTA GTATTTCGCT ACCAAGGTTA AGGCCTTGTA CTGGCGGGAT CACCCCAAAA GTCATGCTAT CGGATCTAGG TTTAGTGCAA ATGATTTGTG CATTTAGACT TTTTAATCAA CAGCTTGACA TCAGCATGTC GACAAGGGAA AGCCCTGAGC TGAGGGATAA TCTTTTACCA CTTGGGATCA CTCAAATCAG TGCGGGCAGT TCGACACAAC CGGGCGGCTA TCAAGCGCCT GACAGTCAAC TCGATCAATT TGAGATAAGC GATGATCGCA GTGTCGAGCA AGTTATTGAA CAGATGCAAC GGCAGGGTTT TAATCCCGTA TTTAAGGATT GGGAAGCCAA TTGGATCACA GGATAA
|
Protein sequence | MSFVAEFAKI PRDKLLLDLY SCTAQDVERA LVSPAGDLRS LLALLSPAAE PYIETMAQHS AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNELELM AEMAIIKDRG FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHLAMEVQP MSEDHYCQLV ALGLDAVMIY QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH LDYLERRYWR TRFSISLPRL RPCTGGITPK VMLSDLGLVQ MICAFRLFNQ QLDISMSTRE SPELRDNLLP LGITQISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPV FKDWEANWIT G
|
| |