Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_1941 |
Symbol | thiH |
ID | 4479738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 2307669 |
End bp | 2308784 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639726523 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_869578 |
Protein GI | 117920386 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG TCGACCAATT TGCCCGTATT GAACGGGATA AGTTATTGCT GGCGTTATAT TCCTGCACGG CAGTGGAGGT TGAGCGCGCC CTGATGCAAC CCGAGGGCAA TCTACAGAGT TTACTTGCCT TGTTGTCTCC AGCGGCCGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATCGGAC TCTATCTGCC CTTATACCTG TCGAACCTGT GTGCCAATGA GTGCGACTAT TGCGGCTTTA GCATGAGCAA TAAGCTAAAG CGCAAAGTGC TCAATGAGCA GGAAATTGCG GCTGAAATGG CGATTATCAA ATTCCGTGGT TTTGACTCCA TCTTGCTGGT GTCGGGAGAG CATGACACCA AAGTGGGGAT GGATTACTTT AAGCGCGTGT TACCCATTGT AAAACAGCAG TTTAGTTATA TGGCCATGGA GGTTCAGCCG CTTGAAGAGG TTGATTATCG TCAGCTTGTC GAGCTAGGGC TGGATGCTGT GATGGTGTAT CAAGAAACCT ATCAAGCGGC GACCTATGCC AAGCATCATA CCCGAGGCAA TAAACAGGAC TTTGCATATC GGCTCGCAAC GCCCGACCGC GTTGCTAGTG CGGGTGTCGA TAAGATAGGC CTAGGTGTGC TACTAGGTTT AGATGACTGG CGGCTCGATG CTTTACTGAT GGGCCATCAT CTGGACTATT TAGAACGGAA TTACTGGCGT ACTCGTTTTA GTATTTCGTT ACCACGTTTG CGGCCTTGTA CCGGCGGTAT AACACCAAAA GTGCATTTAA CCGATCTTGG ACTGGTGCAG ATGATCTGTG CCTTCAGGCT TTTTAATCAG CAACTTGATA TCAGTTTATC GACACGCGAG GCGCCATCGC TTCGGGACAA TTTACTGCCG CTTGGGATAA CACAGATGAG TGCGGGGAGC TCGACACAGC CAGGCGGTTA TCAGGCGCCA GAGAGCCAAT TAGATCAGTT TGAGATAAGC GATGAACGTA CTGTTGAGCA GGTCATGACT CAGATGCGAC TCAGGGGATT TAATCCGGTT TTCAAGGATT GGGAATCGGC TTGGATTGCG CGTTAA
|
Protein sequence | MSFVDQFARI ERDKLLLALY SCTAVEVERA LMQPEGNLQS LLALLSPAAE PYIEEMAQRS AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQEIA AEMAIIKFRG FDSILLVSGE HDTKVGMDYF KRVLPIVKQQ FSYMAMEVQP LEEVDYRQLV ELGLDAVMVY QETYQAATYA KHHTRGNKQD FAYRLATPDR VASAGVDKIG LGVLLGLDDW RLDALLMGHH LDYLERNYWR TRFSISLPRL RPCTGGITPK VHLTDLGLVQ MICAFRLFNQ QLDISLSTRE APSLRDNLLP LGITQMSAGS STQPGGYQAP ESQLDQFEIS DERTVEQVMT QMRLRGFNPV FKDWESAWIA R
|
| |