Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_2440 |
Symbol | thiH |
ID | 1170155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | - |
Start bp | 2553968 |
End bp | 2555095 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637344273 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | NP_718030 |
Protein GI | 24373987 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTG TCGACCATTT TGCCCGCATC GAACGGGATA AGTTATTGCT GGCGTTGTAT TCATGCACTG CTGCTGATGT TGAACGTGCC CTAGTTCAAC CAGAAGGCAA CTTACAGAGT CTGCTCGCTT TGCTGTCTCC TGCAGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATCGGAC TGTACTTACC ACTGTACTTG TCCAATCTTT GCGCTAACGA ATGCGATTAC TGTGGCTTTA GCATGAGTAA TAAACTGAAG CGTAAAGTAC TGAATGAACA GGAACTCGCC GCCGAAATAG CGATTATTAA ATCCCGCGGC TTTGATTCTA TCTTGCTGGT GTCGGGTGAG CATGAAACTA AGGTCGGGAT GGATTACTTT CAGCGGGTTT TACCGTTGGT AAAACAGCAG TTTAGTTATT TAGCCATGGA GGTGCAGCCG CTTGAGGAAA GCCATTATCG TAAGCTCGTC GAGCAGGGAC TCGATGCGGT GATGGTGTAT CAAGAAACCT ATCAAGCCGA GACTTATGCT AAACATCACA CCCGAGGTAA AAAACAGGAC TTCGCCTATC GGCTTGCCAC GCCCGATCGC GTTGCCCGTG CGGGTGTCGA TAAGATAGGC CTAGGTGTGT TACTGGGCTT AGATGACTGG CGGTTAGACG CGTTGATGAT GGGCTATCAT CTTGATTATT TAGAGCGGCA CTATTGGCGC ACCCGTTTTA GCATTTCGTT ACCACGATTG CGACCTTGTA CAGGAGGCAT TGCACCAAAA GTACATTTAA CCGATCTGGG ACTCGTGCAA ATGATCTGCA CCTTTAGACT TTTTAATCAA CAACTTGATA TCAGTTTATC GACACGCGAG GCGCCATCAC TTCGGGATAA TTTACTGCCA CTTGGGATAA CGCAAATGAG TGCGGGTAGT TCAACCCAGC CTGGCGGTTA CCAAGTTCCC GACAGTCAGC TCGATCAGTT TGAGATCAGT GATGATCGAA CTGTTGAGCA GGTCATTACT CAAATGCGAC TTAAGGGTTT TAATCCAGTC TTTAAAGATT GGGAATCCGC TTGGATTGTA CCTAAAATGC GAGTTTAA
|
Protein sequence | MSFVDHFARI ERDKLLLALY SCTAADVERA LVQPEGNLQS LLALLSPAAE PYIEEMAQRS AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQELA AEIAIIKSRG FDSILLVSGE HETKVGMDYF QRVLPLVKQQ FSYLAMEVQP LEESHYRKLV EQGLDAVMVY QETYQAETYA KHHTRGKKQD FAYRLATPDR VARAGVDKIG LGVLLGLDDW RLDALMMGYH LDYLERHYWR TRFSISLPRL RPCTGGIAPK VHLTDLGLVQ MICTFRLFNQ QLDISLSTRE APSLRDNLLP LGITQMSAGS STQPGGYQVP DSQLDQFEIS DDRTVEQVIT QMRLKGFNPV FKDWESAWIV PKMRV
|
| |