Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfri_2196 |
Symbol | thiH |
ID | 4279108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella frigidimarina NCIMB 400 |
Kingdom | Bacteria |
Replicon accession | NC_008345 |
Strand | - |
Start bp | 2626048 |
End bp | 2627163 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638134991 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_750880 |
Protein GI | 114563367 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.119873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTATT CAAGCGTATT TGCGCAATTA GCCCCCGAAG CATTATCGAT GAAGTTGTAT TCAACAACCG CCAAAGACGT TGAGTTGGCT TTGAAAAATC CAAGCGGTAA TCTTGATAGC TTGTTGGCAT TATTATCTCC TGCCGCAATG CCATACATTG AACCGATGGC GAAGCAAGCA GCTCAGCTGA CTCGGCAGCG ATTTGGTGCA AATATCGGCC TTTTCTTACC CTTATATTTA TCAAACTTGT GCGCCAATGA GTGCGATTAT TGTGGATTTA GTATGAGCAA TAAGGTCAAG CGCAAAACTC TTACTCGTGA TGAACTAGCA GCGGAAATGG CGATTATTAA ACAACGTGGC TTTGATTCGA TATTATTGGT GTCTGGCGAA CACGAAACTA AAGTTGGAAT GGAGTATTTT GAGTCGGTGC TACCGTTAGT TAGCAGGGCT TTTAATTACG TGGCAATGGA AGTGCAACCA TTGGAGACTG ATCAGTATCA GCGTCTTGGC AAACTAGGTG TTGATGCCGT TATGGTATAC CAAGAAACAT ATCGTGCTCA TACTTATGCT CAGCATCATA CTCGCGGTAA AAAACAAGAT TTTATCTATC GCTTAGATAC CCCTGATCGA GTGGCAAAAT CGGGTATCGA TAAAATTGGC CTCGGGGTAT TACTTGGGTT AGATGATTGG CGCTTAGATG CATTACTTAT GGGCTTTCAT CTAGATTACT TGGAAAATAC CTACTGGCGC AGTCGCTACA GTATTTCACT ACCTCGTTTA CGTCCATGTA CTGGCGGCAT TACACCGAAA GTTGAATTAA CCGATGCAGG ATTAGTACAA ATGATCTGTG CATTTAGGTT GTTTAACCCC CAGCTTGAAA TCAGTTTATC CACGCGAGAG TTACCATCAT TAAGGGATAA TTTACTCCCT TTAGGAATTA CCCACATGAG TGCAGGCAGT TCAACTCAAC CAGGCGGTTA TATGGCGCCA GACAGCCAAC TTGATCAATT TGAAATCAGT GATAACCGCC CAGTAGAGCA AGTTGTTGAA CAAATGAAGC GACGAGGAAT TAATCCAGTA TGGAAAGATT GGGAGATGGG TTGGGTTAAC AGCTAA
|
Protein sequence | MTYSSVFAQL APEALSMKLY STTAKDVELA LKNPSGNLDS LLALLSPAAM PYIEPMAKQA AQLTRQRFGA NIGLFLPLYL SNLCANECDY CGFSMSNKVK RKTLTRDELA AEMAIIKQRG FDSILLVSGE HETKVGMEYF ESVLPLVSRA FNYVAMEVQP LETDQYQRLG KLGVDAVMVY QETYRAHTYA QHHTRGKKQD FIYRLDTPDR VAKSGIDKIG LGVLLGLDDW RLDALLMGFH LDYLENTYWR SRYSISLPRL RPCTGGITPK VELTDAGLVQ MICAFRLFNP QLEISLSTRE LPSLRDNLLP LGITHMSAGS STQPGGYMAP DSQLDQFEIS DNRPVEQVVE QMKRRGINPV WKDWEMGWVN S
|
| |