Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1886 |
Symbol | thiH |
ID | 4252460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2245438 |
End bp | 2246553 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638118497 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_734017 |
Protein GI | 113970224 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG TCGACCAATT TGCCCGCATT GAACGGGATA AGTTATTGCT GGCGTTATAT TCCTGCACGG CAGCGGAGGT TGAGCGCGCC CTGATGCAAC CCGAGGGTAA TCTAGAGAGT TTACTCGCCT TGTTGTCTCC AGCAGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATTGGAC TCTATTTGCC GTTATATCTG TCAAATCTGT GTGCCAACGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGTTAAAG CGTAAAGTGC TCAATGAGCA GGAAATTGCG GCCGAAATGG CGATTATTAA ATCCCGTGGT TTTGACTCCA TCTTACTGGT GTCGGGTGAG CATGAAACCA AAGTGGGGAT AGATTACTTT AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATT TGGCTATGGA GGTTCAGCCG CTTGAAGAGA TTGATTATCG CCAGCTTGTC GAGCTAGGGC TTGATGCTGT GATGGTGTAT CAGGAAACCT ATCAAGCGGC GACCTATGCT AAGCATCACA CCCGAGGCAA TAAGCAGGAC TTTGCGTATC GGCTCGCAAC GCCCGACCGC GTTGCCAGCG CAGGTGTCGA TAAGATTGGC CTAGGCGTGT TATTGGGTTT GGATGACTGG CGACTCGATG CCTTACTGAT GGGGCATCAT TTGGACTATT TAGAACGGCA TTATTGGCGG ACTCGCTTTA GTATTTCGTT ACCTCGTTTG CGGCCTTGTA CCGGCGGCAT AACACCAAAA GTGCATTTAA CCGATCTTGG ACTGGTACAA TTGATCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG GCGCCATCAC TTCGGGATAA TTTGCTGCCA CTTGGGATAA CACAAATGAG TGCGGGGAGT TCAACGCAAC CTGGTGGTTA TCAGGCGCCA GAGAGCCAAT TAGATCAGTT TGAGATAAGC GATGAACGTA CCGTTGAGCA AGTCATGACT CAGATGCGCC TTCGGGGATT TAATCCGGTT TTTAAGGATT GGGAATCGGC TTGGATTGCG GGTTAG
|
Protein sequence | MSFVDQFARI ERDKLLLALY SCTAAEVERA LMQPEGNLES LLALLSPAAE PYIEEMAQRS AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKSRG FDSILLVSGE HETKVGIDYF KRVLPIVKQQ FSYLAMEVQP LEEIDYRQLV ELGLDAVMVY QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH LDYLERHYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ LICAFRLFNQ QLDISLSTRE APSLRDNLLP LGITQMSAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMT QMRLRGFNPV FKDWESAWIA G
|
| |