Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3923 |
Symbol | thiH |
ID | 1171562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 4070138 |
End bp | 4071577 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637345683 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | NP_719454 |
Protein GI | 24375411 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAC ACGAGCATCA CTCCATTACA CTTTCGGACT ACAATCCCAA CGTCAACTTT ATCGACGATA AAGCGATTTG GCAGACCATT GAAGACGCCA GTGATCCAAG TCGCGAGCAA GTTCTCGCCA TTCTCGACAA GGCGCGCCAG TGTGAAGGCT TAAGCATTAG CGAGACCGCC CTTTTGCTGC AAAACCAAGA TAAGACCTTG GATGAAATGC TTTTTAGCGT CGCCCGTGAG ATTAAAAACA CTATTTACGG CAACCGTATT GTGATGTTTG CACCGCTGTA TGTATCGAAT CATTGCGCCA ACAGTTGTAG TTATTGCGGC TTTAACGCCG ATAACCATGA GCTCAAACGT AAAACCTTAA AACAGGATGA GATCCGCCAA GAGGTTGCGA TCCTTGAAGA AATGGGCCAC AAGCGGATCC TTGCAGTCTA TGGCGAACAT CCTCGCAACA ATGTGCAAGC CATTGTTGAA AGTATTCAAA CCATGTACAG CGTTAAGCAG GGCAAGGGCG GAGAAATACG CCGTATCAAC GTCAACTGCG CGCCAATGAG TGTGGAGGAC TTTAAGCAAC TTAAAACCGC GGCGATAGGC ACTTATCAAT GCTTCCAAGA AACCTATCAT CAAGACACCT ACAGCCAAGT CCATCTTAAA GGTAAAAAAA CCGACTTTTT ATACCGCCTC TACGCCATGC ACAGGGCGAT GGAAGCAGGA ATTGACGATG TCGGCATTGG CGCCCTCTTT GGCCTGTATG ATCATAGATT CGAGCTCCTT GCCATGCTCA CCCATGTTCA GCAACTCGAA AAAGACTGTG GCGTTGGCCC ACACACTATC TCCTTTCCGC GGATTGAACC CGCCCATGGC TCTGCTATCA GTGAAAAGCC GCCCTATGAG GTCGATGATG ACTGCTTTAA GCGCATTGTT GCCATCACTC GCCTTGCCGT GCCTTATACA GGGTTAATTA TGAGCACGCG GGAAAGTGCA GCGCTGCGCA AAGAACTATT AGAACTCGGG GTTTCACAAA TCAGCGCAGG CTCGCGTACC GCGCCGGGTG GATATCAAGA CAGCAAACAA AATCAACATG ATGCCGAGCA ATTCAGCCTT GGTGACCACC GAGAAATGGA CGAAATCATC TATGAATTAG TCACCGACTC GGATGCCATC CCCTCCTTCT GCACTGGCTG TTACCGCAAA GGGCGAACTG GCGATCATTT TATGGGATTA GCCAAACAGC AGTTTATTGG TAAATTCTGC CAGCCCAATG CATTGATCAC CTTTAAGGAA TATTTGAACG ATTACGCCAG TGAAAAGACC CGCGAGGCTG GCAATGCGCT GATAGAGCGA GAGCTGGCTA AAATGAGCCC GTCACGGGCA CGCAATGTGC GCGGCTGTTT GCAAAAAACC GATGCGGGTG AACGGGATAT CTATCTGTAA
|
Protein sequence | MSTHEHHSIT LSDYNPNVNF IDDKAIWQTI EDASDPSREQ VLAILDKARQ CEGLSISETA LLLQNQDKTL DEMLFSVARE IKNTIYGNRI VMFAPLYVSN HCANSCSYCG FNADNHELKR KTLKQDEIRQ EVAILEEMGH KRILAVYGEH PRNNVQAIVE SIQTMYSVKQ GKGGEIRRIN VNCAPMSVED FKQLKTAAIG TYQCFQETYH QDTYSQVHLK GKKTDFLYRL YAMHRAMEAG IDDVGIGALF GLYDHRFELL AMLTHVQQLE KDCGVGPHTI SFPRIEPAHG SAISEKPPYE VDDDCFKRIV AITRLAVPYT GLIMSTRESA ALRKELLELG VSQISAGSRT APGGYQDSKQ NQHDAEQFSL GDHREMDEII YELVTDSDAI PSFCTGCYRK GRTGDHFMGL AKQQFIGKFC QPNALITFKE YLNDYASEKT REAGNALIER ELAKMSPSRA RNVRGCLQKT DAGERDIYL
|
| |