Gene Sden_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSden_1775 
SymbolthiH 
ID4018254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella denitrificans OS217 
KingdomBacteria 
Replicon accessionNC_007954 
Strand
Start bp2098071 
End bp2099180 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content45% 
IMG OID637955788 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_562782 
Protein GI91793131 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTCA CTGATGTTTT CTCAGCAATT TCACAAGATG ATTTGTTGTT GCAGCTTTAC 
AGTAAAAATA CTGCCGATGT AGAACGGGCG TTAATCGCCC CACAAGGCAA GCTTGAAAGC
CTAATGACAC TATTGTCACC TGCAGCAGAG CCATTTATCG AAACGATGGC TACAGAGTCG
GTGAGGTTAA CTCGGCAGCG TTTTGGCAAT AACTTAGGCC TCTACTTGCC ACTGTATTTG
TCAAATTTGT GCGCCAATGA GTGTGATTAC TGCGGCTTTA CCATGAGCAA TAAAATAAAG
CGCAAAACAT TAAACGAGTC TGAGTTACTG GCAGAAATAA GCATTATCAA GGCCAGAGGA
TTCGATTCTA TCTTGCTGGT TTCCGGTGAA CATGAAAGCA AAGTGGGTAT GGGTTATTTT
TCTTGGGCGA TCCCTATAGT TAAGGCCCAT TTTAGCTATG TAGCCATAGA AGTGCAGCCA
TTAAGTGAAG CTGATTATAG TCATCTGAAA GAGTTGGGTG TCGATGCCGT GATGGTCTAT
CAGGAAACGT ATCGGCCAGT AACTTACGCT AAGCATCATA CCCGAGGTCA GAAGAAGGAT
TTTCATCATA GGCTAACAAC CCCTGACAGA GCCGCAAAAG CGGGTATCGA TAAAGTGGGC
CTAGGCGTAT TATTGGGTTT AGATGATTGG CGATTAGATG CCTTGTTAAT GGGGCATCAT
ATTGATTATT TGGAAAAAAA TTATTGGCGA AGCCGTTACA GTATTTCTTT ACCTAGGCTG
CGCCCTTGCA CTGGAGGCGT TAATCCTAAA GTACCGCTGA CTGATCTTGG TTTAGTGCAA
TTGATTTGTG CCTTTAGATT GTTTAATCCT CAGCTGGAAA TTAGCTTGTC TACCCGTGAA
ACCCCCAGCT TACGGGACAG TTTATTGCCC CTTGGGGTGA CACATTTAAG TGCGGGGAGC
TCGACACAAC CAGGGGGGTA TCAAGCACCT CAAACTCAGC TGGATCAATT TGAAATTAGC
GATGGCCGTC CCGTGGATGC CGTTGTTGCA CAAATACAAC AACAGGGATT GAACCCAGTG
TGGAAAGACT GGGAGGCGGG TTGGCATTAA
 
Protein sequence
MSFTDVFSAI SQDDLLLQLY SKNTADVERA LIAPQGKLES LMTLLSPAAE PFIETMATES 
VRLTRQRFGN NLGLYLPLYL SNLCANECDY CGFTMSNKIK RKTLNESELL AEISIIKARG
FDSILLVSGE HESKVGMGYF SWAIPIVKAH FSYVAIEVQP LSEADYSHLK ELGVDAVMVY
QETYRPVTYA KHHTRGQKKD FHHRLTTPDR AAKAGIDKVG LGVLLGLDDW RLDALLMGHH
IDYLEKNYWR SRYSISLPRL RPCTGGVNPK VPLTDLGLVQ LICAFRLFNP QLEISLSTRE
TPSLRDSLLP LGVTHLSAGS STQPGGYQAP QTQLDQFEIS DGRPVDAVVA QIQQQGLNPV
WKDWEAGWH