Gene Sbal_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_2027 
SymbolthiH 
ID4844265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp2346834 
End bp2347949 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content47% 
IMG OID640119246 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001050398 
Protein GI126174249 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG TCGCGGAATT TGCCAAGATC CCACGGGATA AACTGTTACT CGATTTGTAT 
TCTTGCACAG CCCAAGATGT CGAGCGGGCG TTAGTGAGTC CTGCAGGGGA TTTACGTAGC
TTACTGGCCT TATTATCACC TGCGGCGGAA CCTTATATCG AAACCATGGC GCAGCACTCT
GCTGCGCTTA CGCGGCAGCG ATTTGGCGCG AATCTGGGCA TGTATTTACC GCTATACGTC
TCGAATCTGT GCGCTAATGA GTGTGATTAC TGTGGCTTTA GCATGAGTAA CAAACTCAAA
CGCAAGACCT TGAATGAGCT GGAGTTGATG GCCGAAATGG CCATTATCAA AGATAGGGGC
TTCGATTCTA TTTTACTGGT GTCGGGTGAG CATGAAACTA AGGTTGGTAT CGATTATTTC
AAGCAAATGT TGCCGCTGGT AAAGCAGCAA TTTAGCCATT TGGCGATGGA GGTCCAGCCC
ATGAGTGAGG ACCATTATTG CCAGTTAGTC GCGCTAGGTT TAGATGCCGT GATGATTTAT
CAAGAGACCT ATCAGCCCGA GACTTATGCT CGTCACCATT CCCGAGGTAA AAAAATGGAT
TTTGCTTATC GCTTAGCAAC ACCTGACAGA GTTGCGGCGG CGGGCGTTGA TAAAATTGGT
CTTGGGGTAT TACTGGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT
CTCGACTATT TAGAGCGACG CTATTGGCGT ACCCGCTTTA GTATTTCGCT ACCAAGGTTA
AGGCCTTGTA CTGGCGGGAT CACCCCAAAA GTCATGCTAT CGGATCTAGG TTTAGTGCAA
ATGATTTGTG CATTTAGACT TTTTAATCAA CAGCTTGACA TCAGCATGTC GACAAGGGAA
AGCCCTGAGC TGAGGGATAA TCTTTTACCA CTTGGGATCA CTCAAATCAG TGCGGGCAGT
TCGACACAAC CGGGCGGCTA TCAAGCGCCT GACAGTCAAC TCGATCAATT TGAGATAAGC
GATGATCGCA GTGTCGAGCA AGTTATTGAA CAGATGCAAC GGCAGGGTTT TAATCCCGTA
TTTAAGGATT GGGAAGCCAA TTGGATCACA GGATAA
 
Protein sequence
MSFVAEFAKI PRDKLLLDLY SCTAQDVERA LVSPAGDLRS LLALLSPAAE PYIETMAQHS 
AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNELELM AEMAIIKDRG
FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHLAMEVQP MSEDHYCQLV ALGLDAVMIY
QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH
LDYLERRYWR TRFSISLPRL RPCTGGITPK VMLSDLGLVQ MICAFRLFNQ QLDISMSTRE
SPELRDNLLP LGITQISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPV
FKDWEANWIT G