Gene Dred_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_0074 
Symbol 
ID4956451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp80653 
End bp81759 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content45% 
IMG OID640179228 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_001111449 
Protein GI134297953 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.799799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTTT ATCAGGAATG CAAGGAATAC CAAAAAAGTG ATTTCAATAG TTTTTTTAAT 
AAGTTAACCG ACGAAGACAT ACGCAGGATT ATTTATAAAT CTCGGTTATC TCCATGGGAT
TATCTGGCTT TGCTGTCGCC GGTGGCAGAA AGACATTTGG AAGAAATGGC CCAGCGTGCT
CAGCAACTTA CATTGCAGCA TTTTGGCAGG TCTATACTAC TCTTTACCCC CCTTTATCTG
GCCAATTACT GTGTAAATCA ATGTGTCTAT TGTGGTTTTG GGGCAAAAAA CTTAATAAAC
AGAAAAAAAC TAACCCTTGA CGAAGTGGCA GCGGAGGCAA AGGCTATTGC AGCAACGGGT
CTTAAACATA TTCTAATTTT AACAGGAGAA TCCAGGGTCC ACTCGTCGGT GCAATACATT
CGAGATTGTG TGGAAGTTTT AAAAGAATAT TTTACTTCTG TTAGTATTGA AATTTATCCT
TTGGAAGAAG AAGAGTATGC TGATTTAATT GCCACAGGGG TTGATGGCTT AACCATGTAT
CAAGAGGTTT ACGATGAAGC TGAGTATGAC AAAATCCACC TGGCGGGACC CAAAAAGAAT
TACCGTTTTC GCTTAGAGGC ACCGGAGCGG GCCTGTCGGG CCGGCATAAG GACGGTCAAT
ATTGGGGCCC TGCTGGGATT CTATGACTGG CGTAGCGAAG CCTTTTTAAC TGGGGTTCAT
GCTAACTATT TACAAAGCCA ATACCCTGCT GTGGAAGTAA GTATTTCACC GCCTCGCATG
CGACCCCATG TGGGAGGGTA TATGCCCCGA GAAAAGATAA CCGATAAAAA CTTAGTGCAG
TATATCCTGG CCTACCGATT GTTTATGCCC AGGGGAGGTA TCACCCTTTC CACCCGGGAA
TCGGCAGAAT TGCGTGACCA TTTGCTGCCG TTGGGAGTTA CCAAAATGTC GGCGGGCTCT
TCTACCAATG TAGGGGGACA TGCCGGAGGA GAACCATCAA CCAGCCAGTT TGATATCTCC
GATGAAAGGG ATGTGCCGGC CATGGTAAAG ATGTTGTATA ACCAAGGTTA TCAACCTGTC
TTTAAAGACT GGCAGATGCT AGGGTGA
 
Protein sequence
MGFYQECKEY QKSDFNSFFN KLTDEDIRRI IYKSRLSPWD YLALLSPVAE RHLEEMAQRA 
QQLTLQHFGR SILLFTPLYL ANYCVNQCVY CGFGAKNLIN RKKLTLDEVA AEAKAIAATG
LKHILILTGE SRVHSSVQYI RDCVEVLKEY FTSVSIEIYP LEEEEYADLI ATGVDGLTMY
QEVYDEAEYD KIHLAGPKKN YRFRLEAPER ACRAGIRTVN IGALLGFYDW RSEAFLTGVH
ANYLQSQYPA VEVSISPPRM RPHVGGYMPR EKITDKNLVQ YILAYRLFMP RGGITLSTRE
SAELRDHLLP LGVTKMSAGS STNVGGHAGG EPSTSQFDIS DERDVPAMVK MLYNQGYQPV
FKDWQMLG