Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ddes_1946 |
Symbol | thiH |
ID | 7285662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 |
Kingdom | Bacteria |
Replicon accession | NC_011883 |
Strand | - |
Start bp | 2345022 |
End bp | 2346413 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643582768 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002480520 |
Protein GI | 220905208 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.236284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTGG AATCATTGCC CCGGTGGTTT AGTGTGGCTG CCATTGAGCG CGCCCTGGAT CGTCAGGATG CCCCGGACGC CATTGAGCTT CGTGATATTC TTGATAAGTC CATGCAGATG GTTCCGCTGG ACGCAGATGA AATTGTGGCT CTTATGCGGG TTGACGACCC CGTGGAGCAT GAACGCATCC TCGCTGTTGC CGATGAGGTC AAACAGCGGG TTTACGGGGA CCGCATGGTT CTTTCCGCGC CGCTGCACCT TTCCAACCAC TGCGGCAGCG AGTGCCTGTA TTGCGCCAAC CGCAGGGGAA ACGGACAGAT AGAACGCAAA TACATGACAT CGCCGGAAAT GCGTGAAGCT GCGCTGCGCC TCATTCGTCA GGGGCACAAG CGTATTTTTC TGGTCAGCGG GCAACTGCCC AACGCCGATG TGGAATATCT GGCCGAGGCC ATCAGCATTC TGTACACGGT ATTTGACGGT GTGGGCGAAA TCCACAGCGT CAATGTCAAT GTGGGACCGC TTGAAAGCGC CCAGTACGAA ACGCTGCTTG ACGCCTATGT GGGAACCGTC CTCATCTATC AGGACACCTA TCATGAGGCC AGCTACCGCG CGGCGCATGT GTCCGGCCCC AAAAGCGATT ACGTCAGGCG TCTTGAGGCT CCTGACACCG CCTTTGCGGC AGGGGTGCCT GATGTAGGCA TGGGGCTTCT GCTCGGCCTC GGCCCCTGGC GCTTTGACCT GCTGGCCCTC ATCCAGCATG CGGCCCATCT GGAGCGGGTG TACGGCATGG GCTGTCGTAC TGTAAGCCTG CACCGTATGC GTCCCGCCCC CGGCAGCCTT ATGGAGGCTC CCTATCCGGT GAGCGATGCG GACTATCTGC GCTGCGTGGC CCTTGCGCGG CTGGCCCTGC CCTATGCCGG CCTTATCCTG ACCACCAGGG AACCGTCCGG CCTCTGGCGC GACGGCTGTA ATGCCGGTGG TTCGCAACTG CTTACGGGCA GCGTGGCAAA TCCTTACGAC GGCTGGTTTA CCGCTTCAGG CCAGCAGGTT CCTTTTCCCT GTGGTGAAGA TTGCCATGTG GATGAGGTCG TGCGTTTTCT GCTTGAAGAG GCCCGGCATC TGCCGTCCTT TTGCGCTGCC TGTCCACGCC TGGGCCGCAG GGGTGAAGAG TTCATTTCCA TGGTGCGTGA ATGCGGCATC AAGGGGCAGT GCGGGCCGAA CTCGGCGGCG TCGTTTATGG AATTTCTGCT GCATTATGCC ACCCCCTATA CCCGCATGAT GGGCGAACGC CTTTTGAAGG AAAAGCTGGA CAGGATGCCC ATTCACGAGC GCGGCGCTGC AGATCGCCTG CTCAAGAAGG TGCGGGCGGG CTGTATGGAC GAGTTCATCT GA
|
Protein sequence | MSVESLPRWF SVAAIERALD RQDAPDAIEL RDILDKSMQM VPLDADEIVA LMRVDDPVEH ERILAVADEV KQRVYGDRMV LSAPLHLSNH CGSECLYCAN RRGNGQIERK YMTSPEMREA ALRLIRQGHK RIFLVSGQLP NADVEYLAEA ISILYTVFDG VGEIHSVNVN VGPLESAQYE TLLDAYVGTV LIYQDTYHEA SYRAAHVSGP KSDYVRRLEA PDTAFAAGVP DVGMGLLLGL GPWRFDLLAL IQHAAHLERV YGMGCRTVSL HRMRPAPGSL MEAPYPVSDA DYLRCVALAR LALPYAGLIL TTREPSGLWR DGCNAGGSQL LTGSVANPYD GWFTASGQQV PFPCGEDCHV DEVVRFLLEE ARHLPSFCAA CPRLGRRGEE FISMVRECGI KGQCGPNSAA SFMEFLLHYA TPYTRMMGER LLKEKLDRMP IHERGAADRL LKKVRAGCMD EFI
|
| |