Gene Sputcn32_1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_1927 
SymbolthiH 
ID5078547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp2208026 
End bp2209141 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content46% 
IMG OID640499088 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001183449 
Protein GI146293025 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG TAGCGGAATT TGCCAACATC CCACGGGATA AACTGTTACT CGATTTGTAT 
TCTTGCACAG CCCAAGATGT CGAGCGGGCG TTAGTGAATC CAGCAGGGGA TTTACGCAGC
TTACTGGCCT TGTTATCACC TGCGGCTGAA CCTTATATTG AAACCATGGC GCAGCAATCT
GCGGCGCTTA CGCGGCAGCG ATTTGGCGCG AATCTTGGCA TGTATTTACC GCTATATGTC
TCGAATCTGT GCGCTAATGA ATGTGATTAC TGTGGTTTTA GCATGAGTAA TAAACTCAAA
CGCAAGACCT TGAATGAACA GGAGTTGATG GCCGAAATGG CCATTATCAA AGACAGGGGC
TTCGATTCTA TTTTACTGGT GTCGGGTGAG CATGAAACTA AGGTCGGTAT CGATTATTTC
AAGCAAATGT TGCCGCTGGT GAAGCAGCAA TTTAGCCATG TGGCGATGGA GGTCCAACCC
ATGAGTGAGG ACCATTATCG CCAGTTAGTC GCGCTAGGTT TAGATGCTGT GATGATTTAT
CAGGAGACCT ATCAGCCTGA GACTTATGCT CGTCACCATT CCCGAGGTAA AAAAATGGAT
TTTGCCTATC GCTTAGCAAC ACCGGACAGA GTTGCGGCGG CGGGGGTCGA TAAAATTGGT
CTTGGGGTAT TACTGGGGCT GGATGATTGG CGTTTAGATG CGTTAATGAT GGGATATCAT
ATTGACTATT TAGAGCGTCG CTATTGGCGC ACTCGCTTTA GTATTTCGCT GCCAAGGTTA
AGGCCTTGTA CTGGCGGGAT CACTCCAAAA GTTATTCTAT CGGATTTAGG TTTAGTACAA
ATGATTTGTG CATTTAGACT TTTTAATCAT CAGCTTGATA TCAGCATGTC GACAAGGGAA
AGCCCTGAGC TGAGGGATAA TCTTTTGCCA CTTGGAATTA CTCACATCAG TGCGGGCAGC
TCGACACAAC CGGGCGGCTA TCAAGCACCT GACAGTCAAC TCGATCAATT TGAGATAAGC
GATGATCGCA GCGTCGAGCA AGTCATCGAA CAGATGCAAC GGCAGGGTTT TAATCCAATA
TTCAAGGATT GGGAATCTGC ATGGATAAAC AGCTAG
 
Protein sequence
MSFVAEFANI PRDKLLLDLY SCTAQDVERA LVNPAGDLRS LLALLSPAAE PYIETMAQQS 
AALTRQRFGA NLGMYLPLYV SNLCANECDY CGFSMSNKLK RKTLNEQELM AEMAIIKDRG
FDSILLVSGE HETKVGIDYF KQMLPLVKQQ FSHVAMEVQP MSEDHYRQLV ALGLDAVMIY
QETYQPETYA RHHSRGKKMD FAYRLATPDR VAAAGVDKIG LGVLLGLDDW RLDALMMGYH
IDYLERRYWR TRFSISLPRL RPCTGGITPK VILSDLGLVQ MICAFRLFNH QLDISMSTRE
SPELRDNLLP LGITHISAGS STQPGGYQAP DSQLDQFEIS DDRSVEQVIE QMQRQGFNPI
FKDWESAWIN S