Gene Sbal195_2434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_2434 
SymbolthiH 
ID5754193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp2878836 
End bp2879951 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content47% 
IMG OID641288728 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001554862 
Protein GI160875546 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.360626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG TAGCGGAATT TGCCAACATC CCACGGGATA AACTGTTACT CGATTTGTAT 
TCTTGCACTG CCCAAGATGT TGAGCGGGCG CTGGTGAGTC CAGCAGGGGA TTTACGTAGC
TTATTGGCCT TGCTATCACC TGCGGCGGAA CCTTATATCG AAACCATGGC GCAGCACTCT
GCGGCGCTAA CGCGGCAGCG ATTTGGCGCG AATCTGGGCA TGTATTTACC GCTATACGTC
TCGAATCTGT GCGCTAATGA GTGTGATTAC TGTGGCTTTA GCATGAGTAA CAAACTCAAA
CGCAAGACCT TGAATGAGCA GGAGTTGATG GCCGAAATGG CCATTATCAA AGATAGGGGC
TTCGATTCTA TTTTACTGGT GTCAGGTGAG CATGAAACTA AGGTCGGCAT CGATTATTTC
AAGCAAATGT TGCCGCTGGT AAAGCAGCAA TTTAGCCATT TGGCGATGGA GGTCCAGCCC
ATGAGTGAGG ACCATTATTG CCAGTTAGTC GCGCTAGGTT TAGATGCTGT GATGATTTAT
CAGGAGACCT ATCAGCCCGA AACTTATGCT CGTCACCATT CTCGAGGGAA AAAAATGGAT
TTTGCCTATC GTTTAGCAAC ACCGGACAGA GTTGCGGCGG CGGGCGTCGA TAAAATTGGT
CTTGGGGTAT TACTCGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT
CTCGACTATT TAGAGCGACG CTATTGGCGC ACCCGCTTTA GTATTTCGCT ACCAAGGTTA
CGACCTTGTA CTGGCGGGAT CACCCCAAAA GTCATGCTAT CGGATCTAGG TTTAGTGCAA
ATGATTTGTG CATTTAGACT TTTTAATCAA CAGCTTAACA TCAGCATGTC GACAAGGGAA
AGCCCTGAGC TGAGGGATAA TCTTTTGCCA CTTGGGATCA CTCAAATCAG TGCGGGCAGT
TCGACACAAC CGGGCGGCTA TCAAGCGCCT GACAGTCAAC TCGATCAATT TGAGATAAGC
GATGATCGCA GTGTCGAGCA AGTTATCGAA CAGATGCAAC GGCAGGGTTT TAATCCCGTA
TTTAAGGATT GGGAAGCTAA TTGGATCACA GGATAA
 
Protein sequence
MSFVAEFANI PRDKLLLDLY SCTAQDVERA LVSPAGDLRS LLALLSPAAE PYIETMAQHS 
AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNEQELM AEMAIIKDRG
FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHLAMEVQP MSEDHYCQLV ALGLDAVMIY
QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH
LDYLERRYWR TRFSISLPRL RPCTGGITPK VMLSDLGLVQ MICAFRLFNQ QLNISMSTRE
SPELRDNLLP LGITQISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPV
FKDWEANWIT G