Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_2378 |
Symbol | thiH |
ID | 5662771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | - |
Start bp | 2900354 |
End bp | 2901460 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641237000 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001502233 |
Protein GI | 157962199 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTG TTGATGTGTT TAAAAAGCTG TCTCGTTCAG AGCTTAAACT CAGATTGTAT TCAAGCACAG CTGCGGATGT TGAGTGCGCA ATGCAGAAAC CATCGGGAGA CGTAGATAGT TTACTTGCGC TGTTGTCACC GGCGGCTGAG CCATTTCTTG AGCAGATGGC TCAGCAAGCT GTCGCGTTAA CTCGTCAGCG ATTTGGCGCG AGTATCGGCA TGTATATTCC GCTCTATCTT TCAAATCTCT GTGCCAATGA GTGTGATTAT TGTGGCTTCA CCATGAGTAA CAAAATCAAG CGTAAGACTT TAACTGACAG TGAAATCAAA GACGAGATGC GATCGATTAA GAGCATGGGC TACGATTCAA TTTTACTGGT ATCGGGTGAG CATGAGTCGA AAGTGGGCGT GCCATACTTC AAGCAGGTGT TACCACTGAT CACAGAGCAG TTTAGCCATG TGGCCATGGA GGTGCAGCCG TTGGAAGAGC AAGACTATAG AGAGCTAGTC GCAGAAGGGC TCGATGCGGT GATGCTATAC CAAGAAACCT ATAATCCTGT GACGTATAGC GAGCACCATA CCCGAGGTAA GAAGAAGGAT TTTGGTTATC GGCTCGAGTC TCCCGATAGG GTAGCGAGAG CTGGTGTCGA TAAAATAGGT CTTGGGGTAT TACTCGGCTT AGATGATTGG CGACTCGACG CACTATTAAT GGGGCATCAT TTAGATTACA TGGAGAAAAC TTATTGGCGC AGTCGTTATA GCATTTCGCT TCCAAGGTTG AGGCCTTGTA CTGGTGGGGT AACGCCAAAG GTTGAGCTGA CTGATAAAGG CTTAGTGCAG ATGATCTGTG CCTTTAGATT GTTTAATCAA CAACTAGAGA TTAGTCTATC GACACGCGAA ACGCCTAAGC TACGGGATAA CCTATTTACA CTAGGGGTAA CTAATGTTAG TGCAGGAAGC TCGACGCAAC CCGGCGGCTA TGTTGAACCT AACACCGAGC TGGATCAGTT TGAGATCAGC GATGAGCGCT CACCGCAAGT GGTTGCTAAC GCAATGCTTG AGCGTGGATT AAACCCAGTA TGGAAAGACT GGGAAAGCGG CTGGTGA
|
Protein sequence | MSFVDVFKKL SRSELKLRLY SSTAADVECA MQKPSGDVDS LLALLSPAAE PFLEQMAQQA VALTRQRFGA SIGMYIPLYL SNLCANECDY CGFTMSNKIK RKTLTDSEIK DEMRSIKSMG YDSILLVSGE HESKVGVPYF KQVLPLITEQ FSHVAMEVQP LEEQDYRELV AEGLDAVMLY QETYNPVTYS EHHTRGKKKD FGYRLESPDR VARAGVDKIG LGVLLGLDDW RLDALLMGHH LDYMEKTYWR SRYSISLPRL RPCTGGVTPK VELTDKGLVQ MICAFRLFNQ QLEISLSTRE TPKLRDNLFT LGVTNVSAGS STQPGGYVEP NTELDQFEIS DERSPQVVAN AMLERGLNPV WKDWESGW
|
| |