Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swoo_2587 |
Symbol | thiH |
ID | 6116884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella woodyi ATCC 51908 |
Kingdom | Bacteria |
Replicon accession | NC_010506 |
Strand | - |
Start bp | 3164124 |
End bp | 3165233 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641634117 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001760959 |
Protein GI | 170726933 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.914294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0149307 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTC TTAGTTATTT ACAAGGGCTA TCACGTGAAA AGTTATCTCT AAAACTTTAC TCATGCACGG GGAAAGATGT TGAGCAGGCG TTGCAGCATC CGGAAGGTTC CTTAGAGAGC CTACTGGCGC TTTTATCTCC AGCTGCAGAG CCTTATCTGG AACAGATGGC GCAAACCTCC GCACGTCTCA CTCGCCAACG TTTTGGGGCG AATATTGGTA TGTATTTGCC GCTATACCTC TCTAACCTGT GTGCGAATGA GTGTGATTAT TGTGGCTTTA GCATGAGCAA TCGCATTAAG CGTAAGACCT TGAGTCTAGA TGAGTTAAAT GCTGAGATGC GAGTGGTAAA AGCGATGGAT TATGACTCAA TATTGCTGGT TTCAGGTGAA CATGAAACTA AAGTTGGCAT TGATTACTTT AGCTCAGTTC TGCCAAGCGT TAAAAGTCAG TTCAGTTATG TGGCGATGGA GGTTCAACCA CTCAAAGAGG CTGAATATTC CCAGTTGGCA GGACTTGGGC TCGATGCCGT GATGATCTAT CAGGAAACTT ACAACCCGCA AACCTATGCC GAGCACCACA CACGAGGAAA TAAACAAAAT TTTGAGTATC GTCTCGAGAC TCCTGAAAGG GTGGCTAGAG CAGGTGTCGA TAAAATTGGT TTAGGTGTAC TGCTTGGTTT AGATGACTGG CGTTTGGATG CACTGTTATT GGGCCACCAT TTAACTTATC TTGAGTCTCA TTTTTGGCGA AGCCGCTACA GTGTATCGTT ACCGAGATTA CGCCCCTGCA CAGGGGGGAT ATCACCTAAA GTAGAGCTAA CTGATAGAGG GTTGGTACAA CTAATTTGCG CATTTCGACT CTTTAATCAT CAGCTTGAGA TCAGTCTATC GACTAGAGAG TCGGCTGAGT TACGTAATAA TTTGTTCGGT TTAGGGGTGA CTCAGTTAAG CGCAGGCAGT TCAACACAGC CCGGTGGCTA CTTATTGCCT GATACTCAGC TTGATCAGTT TGAGATAAGT GATGAGCGTA CACCTGTTGA AGTTTGCGTG GCAATGAGAG ATAGGGGGTT TAATCCTGTT TGGAAAGACT GGGAATCAGC TTGGGTATAG
|
Protein sequence | MSFLSYLQGL SREKLSLKLY SCTGKDVEQA LQHPEGSLES LLALLSPAAE PYLEQMAQTS ARLTRQRFGA NIGMYLPLYL SNLCANECDY CGFSMSNRIK RKTLSLDELN AEMRVVKAMD YDSILLVSGE HETKVGIDYF SSVLPSVKSQ FSYVAMEVQP LKEAEYSQLA GLGLDAVMIY QETYNPQTYA EHHTRGNKQN FEYRLETPER VARAGVDKIG LGVLLGLDDW RLDALLLGHH LTYLESHFWR SRYSVSLPRL RPCTGGISPK VELTDRGLVQ LICAFRLFNH QLEISLSTRE SAELRNNLFG LGVTQLSAGS STQPGGYLLP DTQLDQFEIS DERTPVEVCV AMRDRGFNPV WKDWESAWV
|
| |