Gene Ddes_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDdes_1946 
SymbolthiH 
ID7285662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 
KingdomBacteria 
Replicon accessionNC_011883 
Strand
Start bp2345022 
End bp2346413 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID643582768 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002480520 
Protein GI220905208 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTGG AATCATTGCC CCGGTGGTTT AGTGTGGCTG CCATTGAGCG CGCCCTGGAT 
CGTCAGGATG CCCCGGACGC CATTGAGCTT CGTGATATTC TTGATAAGTC CATGCAGATG
GTTCCGCTGG ACGCAGATGA AATTGTGGCT CTTATGCGGG TTGACGACCC CGTGGAGCAT
GAACGCATCC TCGCTGTTGC CGATGAGGTC AAACAGCGGG TTTACGGGGA CCGCATGGTT
CTTTCCGCGC CGCTGCACCT TTCCAACCAC TGCGGCAGCG AGTGCCTGTA TTGCGCCAAC
CGCAGGGGAA ACGGACAGAT AGAACGCAAA TACATGACAT CGCCGGAAAT GCGTGAAGCT
GCGCTGCGCC TCATTCGTCA GGGGCACAAG CGTATTTTTC TGGTCAGCGG GCAACTGCCC
AACGCCGATG TGGAATATCT GGCCGAGGCC ATCAGCATTC TGTACACGGT ATTTGACGGT
GTGGGCGAAA TCCACAGCGT CAATGTCAAT GTGGGACCGC TTGAAAGCGC CCAGTACGAA
ACGCTGCTTG ACGCCTATGT GGGAACCGTC CTCATCTATC AGGACACCTA TCATGAGGCC
AGCTACCGCG CGGCGCATGT GTCCGGCCCC AAAAGCGATT ACGTCAGGCG TCTTGAGGCT
CCTGACACCG CCTTTGCGGC AGGGGTGCCT GATGTAGGCA TGGGGCTTCT GCTCGGCCTC
GGCCCCTGGC GCTTTGACCT GCTGGCCCTC ATCCAGCATG CGGCCCATCT GGAGCGGGTG
TACGGCATGG GCTGTCGTAC TGTAAGCCTG CACCGTATGC GTCCCGCCCC CGGCAGCCTT
ATGGAGGCTC CCTATCCGGT GAGCGATGCG GACTATCTGC GCTGCGTGGC CCTTGCGCGG
CTGGCCCTGC CCTATGCCGG CCTTATCCTG ACCACCAGGG AACCGTCCGG CCTCTGGCGC
GACGGCTGTA ATGCCGGTGG TTCGCAACTG CTTACGGGCA GCGTGGCAAA TCCTTACGAC
GGCTGGTTTA CCGCTTCAGG CCAGCAGGTT CCTTTTCCCT GTGGTGAAGA TTGCCATGTG
GATGAGGTCG TGCGTTTTCT GCTTGAAGAG GCCCGGCATC TGCCGTCCTT TTGCGCTGCC
TGTCCACGCC TGGGCCGCAG GGGTGAAGAG TTCATTTCCA TGGTGCGTGA ATGCGGCATC
AAGGGGCAGT GCGGGCCGAA CTCGGCGGCG TCGTTTATGG AATTTCTGCT GCATTATGCC
ACCCCCTATA CCCGCATGAT GGGCGAACGC CTTTTGAAGG AAAAGCTGGA CAGGATGCCC
ATTCACGAGC GCGGCGCTGC AGATCGCCTG CTCAAGAAGG TGCGGGCGGG CTGTATGGAC
GAGTTCATCT GA
 
Protein sequence
MSVESLPRWF SVAAIERALD RQDAPDAIEL RDILDKSMQM VPLDADEIVA LMRVDDPVEH 
ERILAVADEV KQRVYGDRMV LSAPLHLSNH CGSECLYCAN RRGNGQIERK YMTSPEMREA
ALRLIRQGHK RIFLVSGQLP NADVEYLAEA ISILYTVFDG VGEIHSVNVN VGPLESAQYE
TLLDAYVGTV LIYQDTYHEA SYRAAHVSGP KSDYVRRLEA PDTAFAAGVP DVGMGLLLGL
GPWRFDLLAL IQHAAHLERV YGMGCRTVSL HRMRPAPGSL MEAPYPVSDA DYLRCVALAR
LALPYAGLIL TTREPSGLWR DGCNAGGSQL LTGSVANPYD GWFTASGQQV PFPCGEDCHV
DEVVRFLLEE ARHLPSFCAA CPRLGRRGEE FISMVRECGI KGQCGPNSAA SFMEFLLHYA
TPYTRMMGER LLKEKLDRMP IHERGAADRL LKKVRAGCMD EFI