Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_1855 |
Symbol | thiH |
ID | 4604105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 2259305 |
End bp | 2260420 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639781231 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_927730 |
Protein GI | 119774990 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG GGTATTTCTC TGAGGCATTT GCGCGCCTGA ACCCCGATAG CTTGCGGCTT AAGCTCTACT CGGCCACAGC ACAGGATGTG GAGGCGGCAC TGCGGGCTCC GGCGGGTAAC CTGAATGCGC TGCTGGCACT CTTGTCGCCT GCCGCCGAGC CTTATCTTGA GCAGATGGCA CAAAAAAGCA TGCAGCTCAC CCGTAAGCGC TTTGGTGCCA GTATCGGTAT GTACCTGCCG CTGTATCTAT CCAATCTGTG TGCAAACGAG TGTGATTACT GCGGCTTTTC AATGAGTAAC CGCATTAAGC GCAAGACCCT TGATGAAACC GAGCTTACCC GCGAAATGGC CGCCATCAAG GCCATGGGCT ATGACTCAGT GCTGCTGGTC TCCGGCGAGC ATGAAACCAA GGTCGGCATG GGCTATTTTC GTAAGGTGTT ACCCGAGGTA AAGCGTGCGT TTTCCTACGT GGCGATGGAA GTGCAGCCCC TAGCCGAGCC GGAGTACCGC GAGCTGGTAA CCCTCGGGCT TGATGCTGTG ATGATTTATC AGGAAACCTA TCAGCGAGCC ACCTACGCTG AGCATCACAC CCGCGGCAAA AAAATGGACT TTATCTGGCG CCTGGATACG CCCGACAGAC TGGCGCTGGC CGGTGTAGAC AAGATTGGCC TTGGGGTGCT GCTGGGGCTT GATGACTGGC GCCTGGATGC GCTGTTGATG GGGTATCACC TTGATTATCT TGAGCGAAAG TACTGGCGCA GCCGCTACAG TATTTCACTG CCAAGACTCA GGCCCTGCAC CGGCGGCGTG GCACCGAAAA CCGAGATAAG CGATAAAGGA TTGGTGCAAC TTATCTGTGC GTTCCGGTTG TTCAACGAAG CGCTGGATAT CAGCCTCTCG ACACGGGAGC GGCCGGACTT TCGCGACAAT CTGTTTGCCC TTGGGATCAC CCAAACCAGC GCTGGCAGTG CTACATCACC GGGAGGATAC TCTGAGCCGG ATACCCATTT GGATCAGTTT GAAATCAGCG ATGACAGAAG TGCAGCCGAC ATCGCAGCCG TGCTGAAGGC GAGGGGACTT AACCCCATCT GGAAAGACTG GGAATCCCAG TGGTAA
|
Protein sequence | MSQGYFSEAF ARLNPDSLRL KLYSATAQDV EAALRAPAGN LNALLALLSP AAEPYLEQMA QKSMQLTRKR FGASIGMYLP LYLSNLCANE CDYCGFSMSN RIKRKTLDET ELTREMAAIK AMGYDSVLLV SGEHETKVGM GYFRKVLPEV KRAFSYVAME VQPLAEPEYR ELVTLGLDAV MIYQETYQRA TYAEHHTRGK KMDFIWRLDT PDRLALAGVD KIGLGVLLGL DDWRLDALLM GYHLDYLERK YWRSRYSISL PRLRPCTGGV APKTEISDKG LVQLICAFRL FNEALDISLS TRERPDFRDN LFALGITQTS AGSATSPGGY SEPDTHLDQF EISDDRSAAD IAAVLKARGL NPIWKDWESQ W
|
| |