Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2278 |
Symbol | thiH |
ID | 3757289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 2299179 |
End bp | 2300588 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637783169 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_388770 |
Protein GI | 78357321 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.751586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTG ATTCCCGTTC CCTGCCAGGC TTCATAGACG AAGAAAAAAT CGAATCTGTG ATTGCCGCAA CGGCCAAACC TGACGCTGTG CGTGTGCGCG AAATTCTCGC CAAGGCACGT GAAGCAAAGG GCCTTGATGC CGAAGAGACC GCAACCCTGC TGCAACTCGA TAACGAAGAA CTGGATGCGG AGCTGTTTGC CACAGCCAAA AAGGTTAAAC AGACCATCTA CGGTAACCGG CTTGTTCTTT TTGCTCCTCT TTATATTACC AACGAATGCT ATAACCGGTG TGCCTATTGC GGATTTAACG CCACAAACAG CGATCTGAAG CGCCGGACTC TCTCGGAAGA TGAAATCCGG GCCGAAGTGG AAGTGCTGGA ACGTCTGGGG CATAAGCGCC TGCTGCTTGT GTACGGAGAG CACCCGCGCC TTGATGCCGA CTGGATGGCA CGCACCATTC AGGTGGTGTA TGATACTGTT TCTGAAAAAA GCGGTGAAAT CCGCCGTGTG AACATCAACT GTGCCCCGCA GACCGTGGAC GGCTTCAGAA AGCTGCACGA TGTGGGCATA GGTACCTACC AGTGTTTTCA GGAAACCTAC CACAAGGCGA CGTATGACAA GGCGCATCTG GGCGGTCCCA AAAAGGATTA CCTGTGGCGG TTGTATGCCA TGCACCGCGC CATGGAGGCC GGCATCGACG ACGTGGGCAT GGGCCCCCTG CTCGGTCTGT ACGACTACCG GTTTGAGATT CTTGCACTGA TGCAGCATGC CGCCGATCTG GAAAAACATT TCGGCGTAGG CCCGCATACC ATCTCTTTCC CCAGGCTGGA ACCGGCCCTC AATGCCGATA TGGCATTCAA TCCGCCGCAC CCGCTCACCG ATTCCCAGTT TAAACGAATG GTTGCCGTGC TCCGGCTGGC AGTGCCGTAT ACAGGGCTTA TTCTCAGCAC GCGTGAAAAT GCAGCCATGC GGCGTGAACT GCTCGAGCTG GGCGTTTCGC AGATCAGTGC GGGTTCGCGC ACCTATCCGG GTGCCTACAG CGACCCGAGC TACGACCGGC CCGATGTGCA GCAGTTCTGC GTAGGCGACA GCCGCAGTCT GGACGAGGTC ATAGCAGAGC TTGTCTCTTT GGGATACCTG CCCTCGTGGT GCACGGCCTG TTACCGTCTG GGCCGTACCG GCGAACACTT TATGGAGCTG GCAAAAAAAG GCTTCATTCA GGAATTCTGC CATCCCAACG CGCTGCTTAC CTTCAATGAA TATCTGCATG ACTACGCTTC TGAATCGACA CGCGAAGCGG GCAGAAAGCT TATTGAAAAA GAGGCGGCAG GCTGTCCGGA AAACAGGCGC GAGCTTGTTG CTTCGCGTCT GCAGCGCATA GACGGCGGCG AGCGCGATTT GTACATCTGA
|
Protein sequence | MSFDSRSLPG FIDEEKIESV IAATAKPDAV RVREILAKAR EAKGLDAEET ATLLQLDNEE LDAELFATAK KVKQTIYGNR LVLFAPLYIT NECYNRCAYC GFNATNSDLK RRTLSEDEIR AEVEVLERLG HKRLLLVYGE HPRLDADWMA RTIQVVYDTV SEKSGEIRRV NINCAPQTVD GFRKLHDVGI GTYQCFQETY HKATYDKAHL GGPKKDYLWR LYAMHRAMEA GIDDVGMGPL LGLYDYRFEI LALMQHAADL EKHFGVGPHT ISFPRLEPAL NADMAFNPPH PLTDSQFKRM VAVLRLAVPY TGLILSTREN AAMRRELLEL GVSQISAGSR TYPGAYSDPS YDRPDVQQFC VGDSRSLDEV IAELVSLGYL PSWCTACYRL GRTGEHFMEL AKKGFIQEFC HPNALLTFNE YLHDYASEST REAGRKLIEK EAAGCPENRR ELVASRLQRI DGGERDLYI
|
| |