Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_1927 |
Symbol | thiH |
ID | 5078547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 2208026 |
End bp | 2209141 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640499088 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001183449 |
Protein GI | 146293025 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTG TAGCGGAATT TGCCAACATC CCACGGGATA AACTGTTACT CGATTTGTAT TCTTGCACAG CCCAAGATGT CGAGCGGGCG TTAGTGAATC CAGCAGGGGA TTTACGCAGC TTACTGGCCT TGTTATCACC TGCGGCTGAA CCTTATATTG AAACCATGGC GCAGCAATCT GCGGCGCTTA CGCGGCAGCG ATTTGGCGCG AATCTTGGCA TGTATTTACC GCTATATGTC TCGAATCTGT GCGCTAATGA ATGTGATTAC TGTGGTTTTA GCATGAGTAA TAAACTCAAA CGCAAGACCT TGAATGAACA GGAGTTGATG GCCGAAATGG CCATTATCAA AGACAGGGGC TTCGATTCTA TTTTACTGGT GTCGGGTGAG CATGAAACTA AGGTCGGTAT CGATTATTTC AAGCAAATGT TGCCGCTGGT GAAGCAGCAA TTTAGCCATG TGGCGATGGA GGTCCAACCC ATGAGTGAGG ACCATTATCG CCAGTTAGTC GCGCTAGGTT TAGATGCTGT GATGATTTAT CAGGAGACCT ATCAGCCTGA GACTTATGCT CGTCACCATT CCCGAGGTAA AAAAATGGAT TTTGCCTATC GCTTAGCAAC ACCGGACAGA GTTGCGGCGG CGGGGGTCGA TAAAATTGGT CTTGGGGTAT TACTGGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT ATTGACTATT TAGAGCGTCG CTATTGGCGC ACTCGCTTTA GTATTTCGCT GCCAAGGTTA AGGCCTTGTA CTGGCGGGAT CACTCCAAAA GTTATTCTAT CGGATTTAGG TTTAGTACAA ATGATTTGTG CATTTAGACT TTTTAATCAT CAGCTTGATA TCAGCATGTC GACAAGGGAA AGCCCTGAGC TGAGGGATAA TCTTTTGCCA CTTGGAATTA CTCACATCAG TGCGGGCAGC TCGACACAAC CGGGCGGCTA TCAAGCACCT GACAGTCAAC TCGATCAATT TGAGATAAGC GATGATCGCA GCGTCGAGCA AGTCATCGAA CAGATGCAAC GGCAGGGTTT TAATCCAATA TTCAAGGATT GGGAATCTGC ATGGATAAAC AGCTAG
|
Protein sequence | MSFVAEFANI PRDKLLLDLY SCTAQDVERA LVNPAGDLRS LLALLSPAAE PYIETMAQQS AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNEQELM AEMAIIKDRG FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHVAMEVQP MSEDHYRQLV ALGLDAVMIY QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH IDYLERRYWR TRFSISLPRL RPCTGGITPK VILSDLGLVQ MICAFRLFNH QLDISMSTRE SPELRDNLLP LGITHISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPI FKDWESAWIN S
|
| |