Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_2018 |
Symbol | thiH |
ID | 5613840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 2442223 |
End bp | 2443347 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640932904 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001473755 |
Protein GI | 157375155 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.705115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCT TCTCTTATCT GCAAAACCTT GATCGGGAGA AGCTGAAGCT CAGGCTCTAT TCGACAACCA GAGATGACGT TGAAAGGGCA CTCATATCAC CTGTAGGTTC ACTCGACAGC CTATTGGCCT TACTTTCCCC TGCGGCAGAA GATTTTCTAG AAGAGATGGC GCAGCGATCA AAGGCCCTGA CAAGGCAAAG GTTTGGCGCT AATGTCGGTA TGTATCTGCC TCTATACCTG TCAAATTTAT GTGCTAATGA GTGTGATTAT TGCGGTTTTA GCATGAGTAA CAAACTGAAA AGAGAGACCC TGAGTCTTAA AGGTATCGAT GCCGAGATGG CCGTGATTAA AAAATCTGGC TACGACTCGA TACTGCTGGT TTCGGGAGAA CACGAAACAA AAGTCGGAAT AGATTATTTT AAAACTGTCC TACCGAGAGT AAAACAAAAT TTCAGTTACC TTGCGATGGA AGTTCAACCG CTTAAGGAGA GCGAATACTC TGATTTGGTC GGCTTAGGGT TAGATGCGGT GATGATCTAT CAAGAGACTT ATAACCCGAA TACCTACGCA AAACATCACA CCAGAGGTAA AAAGCGGGAT TTTGGCTATC GATTGGGTAC CCCGGAAAGA GTGGCCAGAG CCGGCGTCGA TAAAATAGGT ATCGGCGTGC TACTGGGTTT AGATGATTGG CGCTTAGATG CACTGCTATT GGGCCATCAC CTCTCTTATC TCGAGTCCAG GTTTTGGCGT TCTCGTTACA GTGTCTCATT GCCCAGGTTA AGGCCCTGCA CGGGAGGTAT CACTCCCAAA GTAGAGTTGA CAGACAAAGG CCTGGTTCAG TTGATTTGTG CGTTCAGACT GTTTAATCAA CAGCTTGAGA TCAGTCTTTC TACACGTGAG TCTGCACAAT TACGCGATAA TTTGTTTGAA CTTGGGATCA CCAATATCAG TGCAGGAAGT TCGACTCAAC CCGGTGGTTA TGTTAAGCCG GATACTCAGT TAAACCAGTT CGATATTAGC GATGAACGAT CGGCGCAAGA GGTGGGTGCT GCGATCAAGG CCAAAGGCTT GAACCCGGTT TGGAAGGACT GGGAGTCGGC GTGGGTCCCA ACTCATTCTG CGTGA
|
Protein sequence | MSFFSYLQNL DREKLKLRLY STTRDDVERA LISPVGSLDS LLALLSPAAE DFLEEMAQRS KALTRQRFGA NVGMYLPLYL SNLCANECDY CGFSMSNKLK RETLSLKGID AEMAVIKKSG YDSILLVSGE HETKVGIDYF KTVLPRVKQN FSYLAMEVQP LKESEYSDLV GLGLDAVMIY QETYNPNTYA KHHTRGKKRD FGYRLGTPER VARAGVDKIG IGVLLGLDDW RLDALLLGHH LSYLESRFWR SRYSVSLPRL RPCTGGITPK VELTDKGLVQ LICAFRLFNQ QLEISLSTRE SAQLRDNLFE LGITNISAGS STQPGGYVKP DTQLNQFDIS DERSAQEVGA AIKAKGLNPV WKDWESAWVP THSA
|
| |