Gene SO_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_2440 
SymbolthiH 
ID1170155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp2553968 
End bp2555095 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content48% 
IMG OID637344273 
Productthiamine biosynthesis protein ThiH 
Protein accessionNP_718030 
Protein GI24373987 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG TCGACCATTT TGCCCGCATC GAACGGGATA AGTTATTGCT GGCGTTGTAT 
TCATGCACTG CTGCTGATGT TGAACGTGCC CTAGTTCAAC CAGAAGGCAA CTTACAGAGT
CTGCTCGCTT TGCTGTCTCC TGCAGCAGAG CCCTATATCG AAGAGATGGC GCAGCGCTCG
GCGGCGCTCA CTCGGCAACG CTTTGGGGCC AATATCGGAC TGTACTTACC ACTGTACTTG
TCCAATCTTT GCGCTAACGA ATGCGATTAC TGTGGCTTTA GCATGAGTAA TAAACTGAAG
CGTAAAGTAC TGAATGAACA GGAACTCGCC GCCGAAATAG CGATTATTAA ATCCCGCGGC
TTTGATTCTA TCTTGCTGGT GTCGGGTGAG CATGAAACTA AGGTCGGGAT GGATTACTTT
CAGCGGGTTT TACCGTTGGT AAAACAGCAG TTTAGTTATT TAGCCATGGA GGTGCAGCCG
CTTGAGGAAA GCCATTATCG TAAGCTCGTC GAGCAGGGAC TCGATGCGGT GATGGTGTAT
CAAGAAACCT ATCAAGCCGA GACTTATGCT AAACATCACA CCCGAGGTAA AAAACAGGAC
TTCGCCTATC GGCTTGCCAC GCCCGATCGC GTTGCCCGTG CGGGTGTCGA TAAGATAGGC
CTAGGTGTGT TACTGGGCTT AGATGACTGG CGGTTAGACG CGTTGATGAT GGGCTATCAT
CTTGATTATT TAGAGCGGCA CTATTGGCGC ACCCGTTTTA GCATTTCGTT ACCACGATTG
CGACCTTGTA CAGGAGGCAT TGCACCAAAA GTACATTTAA CCGATCTGGG ACTCGTGCAA
ATGATCTGCA CCTTTAGACT TTTTAATCAA CAACTTGATA TCAGTTTATC GACACGCGAG
GCGCCATCAC TTCGGGATAA TTTACTGCCA CTTGGGATAA CGCAAATGAG TGCGGGTAGT
TCAACCCAGC CTGGCGGTTA CCAAGTTCCC GACAGTCAGC TCGATCAGTT TGAGATCAGT
GATGATCGAA CTGTTGAGCA GGTCATTACT CAAATGCGAC TTAAGGGTTT TAATCCAGTC
TTTAAAGATT GGGAATCCGC TTGGATTGTA CCTAAAATGC GAGTTTAA
 
Protein sequence
MSFVDHFARI ERDKLLLALY SCTAADVERA LVQPEGNLQS LLALLSPAAE PYIEEMAQRS 
AALTRQRFGA NIGLYLPLYL SNLCANECDY CGFSMSNKLK RKVLNEQELA AEIAIIKSRG
FDSILLVSGE HETKVGMDYF QRVLPLVKQQ FSYLAMEVQP LEESHYRKLV EQGLDAVMVY
QETYQAETYA KHHTRGKKQD FAYRLATPDR VARAGVDKIG LGVLLGLDDW RLDALMMGYH
LDYLERHYWR TRFSISLPRL RPCTGGIAPK VHLTDLGLVQ MICTFRLFNQ QLDISLSTRE
APSLRDNLLP LGITQMSAGS STQPGGYQVP DSQLDQFEIS DDRTVEQVIT QMRLKGFNPV
FKDWESAWIV PKMRV