Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sden_1775 |
Symbol | thiH |
ID | 4018254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella denitrificans OS217 |
Kingdom | Bacteria |
Replicon accession | NC_007954 |
Strand | + |
Start bp | 2098071 |
End bp | 2099180 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637955788 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_562782 |
Protein GI | 91793131 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTCA CTGATGTTTT CTCAGCAATT TCACAAGATG ATTTGTTGTT GCAGCTTTAC AGTAAAAATA CTGCCGATGT AGAACGGGCG TTAATCGCCC CACAAGGCAA GCTTGAAAGC CTAATGACAC TATTGTCACC TGCAGCAGAG CCATTTATCG AAACGATGGC TACAGAGTCG GTGAGGTTAA CTCGGCAGCG TTTTGGCAAT AACTTAGGCC TCTACTTGCC ACTGTATTTG TCAAATTTGT GCGCCAATGA GTGTGATTAC TGCGGCTTTA CCATGAGCAA TAAAATAAAG CGCAAAACAT TAAACGAGTC TGAGTTACTG GCAGAAATAA GCATTATCAA GGCCAGAGGA TTCGATTCTA TCTTGCTGGT TTCCGGTGAA CATGAAAGCA AAGTGGGTAT GGGTTATTTT TCTTGGGCGA TCCCTATAGT TAAGGCCCAT TTTAGCTATG TAGCCATAGA AGTGCAGCCA TTAAGTGAAG CTGATTATAG TCATCTGAAA GAGTTGGGTG TCGATGCCGT GATGGTCTAT CAGGAAACGT ATCGGCCAGT AACTTACGCT AAGCATCATA CCCGAGGTCA GAAGAAGGAT TTTCATCATA GGCTAACAAC CCCTGACAGA GCCGCAAAAG CGGGTATCGA TAAAGTGGGC CTAGGCGTAT TATTGGGTTT AGATGATTGG CGATTAGATG CCTTGTTAAT GGGGCATCAT ATTGATTATT TGGAAAAAAA TTATTGGCGA AGCCGTTACA GTATTTCTTT ACCTAGGCTG CGCCCTTGCA CTGGAGGCGT TAATCCTAAA GTACCGCTGA CTGATCTTGG TTTAGTGCAA TTGATTTGTG CCTTTAGATT GTTTAATCCT CAGCTGGAAA TTAGCTTGTC TACCCGTGAA ACCCCCAGCT TACGGGACAG TTTATTGCCC CTTGGGGTGA CACATTTAAG TGCGGGGAGC TCGACACAAC CAGGGGGGTA TCAAGCACCT CAAACTCAGC TGGATCAATT TGAAATTAGC GATGGCCGTC CCGTGGATGC CGTTGTTGCA CAAATACAAC AACAGGGATT GAACCCAGTG TGGAAAGACT GGGAGGCGGG TTGGCATTAA
|
Protein sequence | MSFTDVFSAI SQDDLLLQLY SKNTADVERA LIAPQGKLES LMTLLSPAAE PFIETMATES VRLTRQRFGN NLGLYLPLYL SNLCANECDY CGFTMSNKIK RKTLNESELL AEISIIKARG FDSILLVSGE HESKVGMGYF SWAIPIVKAH FSYVAIEVQP LSEADYSHLK ELGVDAVMVY QETYRPVTYA KHHTRGQKKD FHHRLTTPDR AAKAGIDKVG LGVLLGLDDW RLDALLMGHH IDYLEKNYWR SRYSISLPRL RPCTGGVNPK VPLTDLGLVQ LICAFRLFNP QLEISLSTRE TPSLRDSLLP LGVTHLSAGS STQPGGYQAP QTQLDQFEIS DGRPVDAVVA QIQQQGLNPV WKDWEAGWH
|
| |