Gene Ssed_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_2018 
SymbolthiH 
ID5613840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2442223 
End bp2443347 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content47% 
IMG OID640932904 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001473755 
Protein GI157375155 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.705115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCT TCTCTTATCT GCAAAACCTT GATCGGGAGA AGCTGAAGCT CAGGCTCTAT 
TCGACAACCA GAGATGACGT TGAAAGGGCA CTCATATCAC CTGTAGGTTC ACTCGACAGC
CTATTGGCCT TACTTTCCCC TGCGGCAGAA GATTTTCTAG AAGAGATGGC GCAGCGATCA
AAGGCCCTGA CAAGGCAAAG GTTTGGCGCT AATGTCGGTA TGTATCTGCC TCTATACCTG
TCAAATTTAT GTGCTAATGA GTGTGATTAT TGCGGTTTTA GCATGAGTAA CAAACTGAAA
AGAGAGACCC TGAGTCTTAA AGGTATCGAT GCCGAGATGG CCGTGATTAA AAAATCTGGC
TACGACTCGA TACTGCTGGT TTCGGGAGAA CACGAAACAA AAGTCGGAAT AGATTATTTT
AAAACTGTCC TACCGAGAGT AAAACAAAAT TTCAGTTACC TTGCGATGGA AGTTCAACCG
CTTAAGGAGA GCGAATACTC TGATTTGGTC GGCTTAGGGT TAGATGCGGT GATGATCTAT
CAAGAGACTT ATAACCCGAA TACCTACGCA AAACATCACA CCAGAGGTAA AAAGCGGGAT
TTTGGCTATC GATTGGGTAC CCCGGAAAGA GTGGCCAGAG CCGGCGTCGA TAAAATAGGT
ATCGGCGTGC TACTGGGTTT AGATGATTGG CGCTTAGATG CACTGCTATT GGGCCATCAC
CTCTCTTATC TCGAGTCCAG GTTTTGGCGT TCTCGTTACA GTGTCTCATT GCCCAGGTTA
AGGCCCTGCA CGGGAGGTAT CACTCCCAAA GTAGAGTTGA CAGACAAAGG CCTGGTTCAG
TTGATTTGTG CGTTCAGACT GTTTAATCAA CAGCTTGAGA TCAGTCTTTC TACACGTGAG
TCTGCACAAT TACGCGATAA TTTGTTTGAA CTTGGGATCA CCAATATCAG TGCAGGAAGT
TCGACTCAAC CCGGTGGTTA TGTTAAGCCG GATACTCAGT TAAACCAGTT CGATATTAGC
GATGAACGAT CGGCGCAAGA GGTGGGTGCT GCGATCAAGG CCAAAGGCTT GAACCCGGTT
TGGAAGGACT GGGAGTCGGC GTGGGTCCCA ACTCATTCTG CGTGA
 
Protein sequence
MSFFSYLQNL DREKLKLRLY STTRDDVERA LISPVGSLDS LLALLSPAAE DFLEEMAQRS 
KALTRQRFGA NVGMYLPLYL SNLCANECDY CGFSMSNKLK RETLSLKGID AEMAVIKKSG
YDSILLVSGE HETKVGIDYF KTVLPRVKQN FSYLAMEVQP LKESEYSDLV GLGLDAVMIY
QETYNPNTYA KHHTRGKKRD FGYRLGTPER VARAGVDKIG IGVLLGLDDW RLDALLLGHH
LSYLESRFWR SRYSVSLPRL RPCTGGITPK VELTDKGLVQ LICAFRLFNQ QLEISLSTRE
SAQLRDNLFE LGITNISAGS STQPGGYVKP DTQLNQFDIS DERSAQEVGA AIKAKGLNPV
WKDWESAWVP THSA