Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_2092 |
Symbol | thiH |
ID | 4258359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | - |
Start bp | 2470376 |
End bp | 2471491 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 638122760 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_738137 |
Protein GI | 114047587 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.62908 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG TTGACCAATT TGCCCGTATT GAACGGGATA AGTTATTGCT GGCGCTATAT TCCTGCACGG CAGTGGAGGT TGAGCGGGCC CTGATGCAAC CCGAGGGTAA TCTAGAAAGT TTACTCGCCT TGTTGTCTCC AGCCGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG GCAGCGCTTA CTCGGCAACG CTTTGGGGCT AATATCGGAC TCTATTTGCC GTTATATCTG TCAAATCTGT GTGCCAACGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGCTAAAG CGTAAAGTGC TCAATGAGCA GGAAATTGCG GCTGAAATGG CGATAATCAA ATCACGTGGG TTTGACTCCA TCTTACTGGT GTCGGGCGAG CATGAAACTA AAGTGGGGAT GGATTACTTT AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATT TAGCAATGGA GGTACAGCCG CTTGATGAGA TTGATTATCG CCAGCTTGTC GAGCTAGGGC TTGATGCTGT GATGGTGTAT CAAGAAACCT ATCAAGCGGC GACCTATGCT AAGCATCACA CTCGAGGCAA TAAGCAGGAC TTTGCGTATC GGCTGGCAAC GCCTGACCGC GTTGCCAGCG CAGGTGTCGA TAAGATTGGC CTAGGCGTGT TATTGGGTTT GGATGACTGG CGACTCGATG CTTTACTGAT GGGTCATCAT TTGGACTATT TAGAACGGCA TTACTGGCGT ACTCGCTTTA GTATTTCGTT ACCGCGTTTA CGACCTTGTA CCGGCGGTAT AACGCCAAAA GTGCATTTAA CCGATCTCGG ACTGGTACAA TTGACCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG GCGCCATCAC TTCGGGATAA TTTGCTGCCA CTTGGGATAA CACAAATAAG TGCGGGGAGT TCAACGCAAC CTGGTGGTTA TCAGGCGCCA GAGAGTCAAT TAGATCAGTT TGAGATAAGC GATGAACGTA CCGTTGAGCA AGTCATGGCT CAGATGCGCC TTAGGGGATT CAATCCGGTT TTTAAGGATT GGGAATCGGC TTGGATTGCG GGTTAA
|
Protein sequence | MSFVDQFARI ERDKLLLALY SCTAVEVERA LMQPEGNLES LLALLSPAAE PYIEEMAQRS AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKSRG FDSILLVSGE HETKVGMDYF KRVLPIVKQQ FSYLAMEVQP LDEIDYRQLV ELGLDAVMVY QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH LDYLERHYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ LTCAFRLFNQ QLDISLSTRE APSLRDNLLP LGITQISAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMA QMRLRGFNPV FKDWESAWIA G
|
| |