Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dred_0074 |
Symbol | |
ID | 4956451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum reducens MI-1 |
Kingdom | Bacteria |
Replicon accession | NC_009253 |
Strand | + |
Start bp | 80653 |
End bp | 81759 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640179228 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_001111449 |
Protein GI | 134297953 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.799799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTTT ATCAGGAATG CAAGGAATAC CAAAAAAGTG ATTTCAATAG TTTTTTTAAT AAGTTAACCG ACGAAGACAT ACGCAGGATT ATTTATAAAT CTCGGTTATC TCCATGGGAT TATCTGGCTT TGCTGTCGCC GGTGGCAGAA AGACATTTGG AAGAAATGGC CCAGCGTGCT CAGCAACTTA CATTGCAGCA TTTTGGCAGG TCTATACTAC TCTTTACCCC CCTTTATCTG GCCAATTACT GTGTAAATCA ATGTGTCTAT TGTGGTTTTG GGGCAAAAAA CTTAATAAAC AGAAAAAAAC TAACCCTTGA CGAAGTGGCA GCGGAGGCAA AGGCTATTGC AGCAACGGGT CTTAAACATA TTCTAATTTT AACAGGAGAA TCCAGGGTCC ACTCGTCGGT GCAATACATT CGAGATTGTG TGGAAGTTTT AAAAGAATAT TTTACTTCTG TTAGTATTGA AATTTATCCT TTGGAAGAAG AAGAGTATGC TGATTTAATT GCCACAGGGG TTGATGGCTT AACCATGTAT CAAGAGGTTT ACGATGAAGC TGAGTATGAC AAAATCCACC TGGCGGGACC CAAAAAGAAT TACCGTTTTC GCTTAGAGGC ACCGGAGCGG GCCTGTCGGG CCGGCATAAG GACGGTCAAT ATTGGGGCCC TGCTGGGATT CTATGACTGG CGTAGCGAAG CCTTTTTAAC TGGGGTTCAT GCTAACTATT TACAAAGCCA ATACCCTGCT GTGGAAGTAA GTATTTCACC GCCTCGCATG CGACCCCATG TGGGAGGGTA TATGCCCCGA GAAAAGATAA CCGATAAAAA CTTAGTGCAG TATATCCTGG CCTACCGATT GTTTATGCCC AGGGGAGGTA TCACCCTTTC CACCCGGGAA TCGGCAGAAT TGCGTGACCA TTTGCTGCCG TTGGGAGTTA CCAAAATGTC GGCGGGCTCT TCTACCAATG TAGGGGGACA TGCCGGAGGA GAACCATCAA CCAGCCAGTT TGATATCTCC GATGAAAGGG ATGTGCCGGC CATGGTAAAG ATGTTGTATA ACCAAGGTTA TCAACCTGTC TTTAAAGACT GGCAGATGCT AGGGTGA
|
Protein sequence | MGFYQECKEY QKSDFNSFFN KLTDEDIRRI IYKSRLSPWD YLALLSPVAE RHLEEMAQRA QQLTLQHFGR SILLFTPLYL ANYCVNQCVY CGFGAKNLIN RKKLTLDEVA AEAKAIAATG LKHILILTGE SRVHSSVQYI RDCVEVLKEY FTSVSIEIYP LEEEEYADLI ATGVDGLTMY QEVYDEAEYD KIHLAGPKKN YRFRLEAPER ACRAGIRTVN IGALLGFYDW RSEAFLTGVH ANYLQSQYPA VEVSISPPRM RPHVGGYMPR EKITDKNLVQ YILAYRLFMP RGGITLSTRE SAELRDHLLP LGVTKMSAGS STNVGGHAGG EPSTSQFDIS DERDVPAMVK MLYNQGYQPV FKDWQMLG
|
| |