Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_2434 |
Symbol | thiH |
ID | 5754193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 2878836 |
End bp | 2879951 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641288728 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001554862 |
Protein GI | 160875546 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.360626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG TAGCGGAATT TGCCAACATC CCACGGGATA AACTGTTACT CGATTTGTAT TCTTGCACTG CCCAAGATGT TGAGCGGGCG CTGGTGAGTC CAGCAGGGGA TTTACGTAGC TTATTGGCCT TGCTATCACC TGCGGCGGAA CCTTATATCG AAACCATGGC GCAGCACTCT GCGGCGCTAA CGCGGCAGCG ATTTGGCGCG AATCTGGGCA TGTATTTACC GCTATACGTC TCGAATCTGT GCGCTAATGA GTGTGATTAC TGTGGCTTTA GCATGAGTAA CAAACTCAAA CGCAAGACCT TGAATGAGCA GGAGTTGATG GCCGAAATGG CCATTATCAA AGATAGGGGC TTCGATTCTA TTTTACTGGT GTCAGGTGAG CATGAAACTA AGGTCGGCAT CGATTATTTC AAGCAAATGT TGCCGCTGGT AAAGCAGCAA TTTAGCCATT TGGCGATGGA GGTCCAGCCC ATGAGTGAGG ACCATTATTG CCAGTTAGTC GCGCTAGGTT TAGATGCTGT GATGATTTAT CAGGAGACCT ATCAGCCCGA AACTTATGCT CGTCACCATT CTCGAGGGAA AAAAATGGAT TTTGCCTATC GTTTAGCAAC ACCGGACAGA GTTGCGGCGG CGGGCGTCGA TAAAATTGGT CTTGGGGTAT TACTCGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT CTCGACTATT TAGAGCGACG CTATTGGCGC ACCCGCTTTA GTATTTCGCT ACCAAGGTTA CGACCTTGTA CTGGCGGGAT CACCCCAAAA GTCATGCTAT CGGATCTAGG TTTAGTGCAA ATGATTTGTG CATTTAGACT TTTTAATCAA CAGCTTAACA TCAGCATGTC GACAAGGGAA AGCCCTGAGC TGAGGGATAA TCTTTTGCCA CTTGGGATCA CTCAAATCAG TGCGGGCAGT TCGACACAAC CGGGCGGCTA TCAAGCGCCT GACAGTCAAC TCGATCAATT TGAGATAAGC GATGATCGCA GTGTCGAGCA AGTTATCGAA CAGATGCAAC GGCAGGGTTT TAATCCCGTA TTTAAGGATT GGGAAGCTAA TTGGATCACA GGATAA
|
Protein sequence | MSFVAEFANI PRDKLLLDLY SCTAQDVERA LVSPAGDLRS LLALLSPAAE PYIETMAQHS AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNEQELM AEMAIIKDRG FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHLAMEVQP MSEDHYCQLV ALGLDAVMIY QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH LDYLERRYWR TRFSISLPRL RPCTGGITPK VMLSDLGLVQ MICAFRLFNQ QLNISMSTRE SPELRDNLLP LGITQISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPV FKDWEANWIT G
|
| |