Gene Dde_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_1556 
SymbolthiH 
ID3755995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp1575356 
End bp1576525 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content57% 
IMG OID637782433 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_388048 
Protein GI78356599 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA TCAGCGCTGA CATGAGTGTG AGAACCGCAC AGGCGGGAGG TTTTTACGAA 
GTTGTCCGCC AGTTGCAGGA TGTGGATGTG CAGGCCGCCT GCATGGCGGC TGACGGATAC
AGAGTGCGGC AGGCTCTGGC AAAGGATACC CTTTCTCCGG AAGACTTTCT GGCTTTGCTT
TCACCCGCGG CCCGTCCGTA CATGGAATCA ATGGCGCATA AGGCCCGTGA TGTCACGGTG
CGCCAGTTCG GCCGCACCAT CCAGCTGTTT ACTCCGCTGT ACCTTTCAAA CTGGTGTACA
AACAGGTGCG TGTATTGCAG CTTCAATGCC TGCAGTGGCA TAGACCGCAT GCAGCTGGAT
GCTGCGGGTG TGCTGCGCGA AGGGCAGGCC ATAGCCGCCA CAGGACTGCG CCATCTGCTG
CTGCTGACCG GCGAGGCTCC GGCAAAGGCT TCCGTTGACT ACATCCGCGA CTGTGTGCGC
GTGTTGCGTC CTCTGTTTCC GTCACTGAGT ATTGAAGTGT ACGCCCTGAC CGAGCCTGAG
TACCGCACAC TTGCCGTTGC CGGTGTGGAC GGCATGACAC TTTTTCAGGA AACATATAAC
GAGGCTTTGT ACCCGTCATT GCATCCTGCC GGTCCCAAAA GCAATTATCA TTTCAGGCTG
GGGGCGCCGG AAAGAGCATG CCGTGCCGGA ATGCGCAATG TGAATATAGG TGCGTTGCTC
GGACTGGATC AGTGGCAGCG TGATGCTTTT ATGACGGGTA TGCATGCCCT TTGGCTGCAG
CACAGGTATC CGGGAGTGGA TATTGCTGTT TCGCTGCCGC GTATGCGTCC TTATGCCGGC
AGTTTTCAGC CGGTGTGTGA CGTGGATGAT CGTGCGCTGG TGCAGATACT GCTGGCCATG
CGGCTTTTTC TGCCGCGGTG CGGCATTACC ATTTCCACGC GCGAACGGCC TGCATTCCGC
GATAATCTGA TACCGCTGGG AGTGACCCGC ATGAGTGCCG GTGTATCCAC CGCTGTGGGC
GGACATGCTG AAAACGAACA TTCCTCTGTA GGGCAGTTTG AAATTTCTGA CCCGCGCAGT
GTGGACGAGG TGGTTGAAGC CGTAAGCCGG GCCGGTTATC AGGCCGTGTT TAAAGACTGG
CATCCGCTGG AAGACAGCTG CGCGGTATGA
 
Protein sequence
MSEISADMSV RTAQAGGFYE VVRQLQDVDV QAACMAADGY RVRQALAKDT LSPEDFLALL 
SPAARPYMES MAHKARDVTV RQFGRTIQLF TPLYLSNWCT NRCVYCSFNA CSGIDRMQLD
AAGVLREGQA IAATGLRHLL LLTGEAPAKA SVDYIRDCVR VLRPLFPSLS IEVYALTEPE
YRTLAVAGVD GMTLFQETYN EALYPSLHPA GPKSNYHFRL GAPERACRAG MRNVNIGALL
GLDQWQRDAF MTGMHALWLQ HRYPGVDIAV SLPRMRPYAG SFQPVCDVDD RALVQILLAM
RLFLPRCGIT ISTRERPAFR DNLIPLGVTR MSAGVSTAVG GHAENEHSSV GQFEISDPRS
VDEVVEAVSR AGYQAVFKDW HPLEDSCAV