Gene SO_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3923 
SymbolthiH 
ID1171562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp4070138 
End bp4071577 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content49% 
IMG OID637345683 
Productthiamine biosynthesis protein ThiH 
Protein accessionNP_719454 
Protein GI24375411 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAC ACGAGCATCA CTCCATTACA CTTTCGGACT ACAATCCCAA CGTCAACTTT 
ATCGACGATA AAGCGATTTG GCAGACCATT GAAGACGCCA GTGATCCAAG TCGCGAGCAA
GTTCTCGCCA TTCTCGACAA GGCGCGCCAG TGTGAAGGCT TAAGCATTAG CGAGACCGCC
CTTTTGCTGC AAAACCAAGA TAAGACCTTG GATGAAATGC TTTTTAGCGT CGCCCGTGAG
ATTAAAAACA CTATTTACGG CAACCGTATT GTGATGTTTG CACCGCTGTA TGTATCGAAT
CATTGCGCCA ACAGTTGTAG TTATTGCGGC TTTAACGCCG ATAACCATGA GCTCAAACGT
AAAACCTTAA AACAGGATGA GATCCGCCAA GAGGTTGCGA TCCTTGAAGA AATGGGCCAC
AAGCGGATCC TTGCAGTCTA TGGCGAACAT CCTCGCAACA ATGTGCAAGC CATTGTTGAA
AGTATTCAAA CCATGTACAG CGTTAAGCAG GGCAAGGGCG GAGAAATACG CCGTATCAAC
GTCAACTGCG CGCCAATGAG TGTGGAGGAC TTTAAGCAAC TTAAAACCGC GGCGATAGGC
ACTTATCAAT GCTTCCAAGA AACCTATCAT CAAGACACCT ACAGCCAAGT CCATCTTAAA
GGTAAAAAAA CCGACTTTTT ATACCGCCTC TACGCCATGC ACAGGGCGAT GGAAGCAGGA
ATTGACGATG TCGGCATTGG CGCCCTCTTT GGCCTGTATG ATCATAGATT CGAGCTCCTT
GCCATGCTCA CCCATGTTCA GCAACTCGAA AAAGACTGTG GCGTTGGCCC ACACACTATC
TCCTTTCCGC GGATTGAACC CGCCCATGGC TCTGCTATCA GTGAAAAGCC GCCCTATGAG
GTCGATGATG ACTGCTTTAA GCGCATTGTT GCCATCACTC GCCTTGCCGT GCCTTATACA
GGGTTAATTA TGAGCACGCG GGAAAGTGCA GCGCTGCGCA AAGAACTATT AGAACTCGGG
GTTTCACAAA TCAGCGCAGG CTCGCGTACC GCGCCGGGTG GATATCAAGA CAGCAAACAA
AATCAACATG ATGCCGAGCA ATTCAGCCTT GGTGACCACC GAGAAATGGA CGAAATCATC
TATGAATTAG TCACCGACTC GGATGCCATC CCCTCCTTCT GCACTGGCTG TTACCGCAAA
GGGCGAACTG GCGATCATTT TATGGGATTA GCCAAACAGC AGTTTATTGG TAAATTCTGC
CAGCCCAATG CATTGATCAC CTTTAAGGAA TATTTGAACG ATTACGCCAG TGAAAAGACC
CGCGAGGCTG GCAATGCGCT GATAGAGCGA GAGCTGGCTA AAATGAGCCC GTCACGGGCA
CGCAATGTGC GCGGCTGTTT GCAAAAAACC GATGCGGGTG AACGGGATAT CTATCTGTAA
 
Protein sequence
MSTHEHHSIT LSDYNPNVNF IDDKAIWQTI EDASDPSREQ VLAILDKARQ CEGLSISETA 
LLLQNQDKTL DEMLFSVARE IKNTIYGNRI VMFAPLYVSN HCANSCSYCG FNADNHELKR
KTLKQDEIRQ EVAILEEMGH KRILAVYGEH PRNNVQAIVE SIQTMYSVKQ GKGGEIRRIN
VNCAPMSVED FKQLKTAAIG TYQCFQETYH QDTYSQVHLK GKKTDFLYRL YAMHRAMEAG
IDDVGIGALF GLYDHRFELL AMLTHVQQLE KDCGVGPHTI SFPRIEPAHG SAISEKPPYE
VDDDCFKRIV AITRLAVPYT GLIMSTRESA ALRKELLELG VSQISAGSRT APGGYQDSKQ
NQHDAEQFSL GDHREMDEII YELVTDSDAI PSFCTGCYRK GRTGDHFMGL AKQQFIGKFC
QPNALITFKE YLNDYASEKT REAGNALIER ELAKMSPSRA RNVRGCLQKT DAGERDIYL