Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0799 |
Symbol | thiH |
ID | 7172688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 963303 |
End bp | 964547 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643539300 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002435223 |
Protein GI | 218885902 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGT ACGACGTGGT CCGCGAATGG ACCCCCCGCG TGGCGGACGC ACCCCTGCGC GCCTTCATGG ACGCCGCCAC GCCGGACGGC GTGGCCCGCG TGCTGCGCAA GGAACGCCTT TCCCCGCACG ACCTGCTGAC CCTGCTTTCC CCGGCGGCGG CCACCCGACT GGAGGCCATG GCCCTTCGCG CCCGCGAGCT GACGGTGCGC CACTTCGGAC GCACCATCCA GTTCTTCACC CCGCTGTACC TTTCCAACCA CTGCACCAAC CAGTGCCGGT ACTGCGGCTT CAACGCGCGC AACCACATCC CCCGCCAGCG CCTGACGGAC GAGGCAATCG TGGCCGAGGG CCGGGCCATT GCCGCCACCG GGCTGCGCCA CCTGCTGCTG CTCACCGGCG ATGCGCGCCA CGTTTCCGGG CCGGACTACA TCGCCCATGC CGCGCGCCTG CTGGCCCCGC TGTTCCCTTC GCTGTCGGTG GAGGTCTATT CGCTGACGGA CGAGGAATAC GCGCTGCTGG TGGACGCGGG CATCGACGGC ATGACCATGT TCCAGGAAAC CTACAACGAG GCCCTGTACC CGGAACTGCA CCCCGCAGGG CCCAAGCGCG ACTATCATTT CCGACTGGAC GCGCCGGAGC GCGCCGCCCG CGCGGGCATG CGCAGTGTGG GCCTTGGCGC GCTGCTGGGG CTGGACGACT GGCGGCGCGA CGCCTTCTTC ACCGCGCTGC ACGGCCACTG GCTGCAACGC CGGTATCCCC ATGTGGACGT AAGTTTTTCC GTGCCGCGCC TGCGCCCCCA CGCCGGTGCC TTCCAGCCCG CGTACGCGGT ATCCGACCGC GATCTGGTGC AGGTCATCCT GGCCTATCGC ATCTTCATGC CCAGCGCGGG CATTACCGTT TCCACCCGCG AACGGGCGGG CCTGCGCGAC AACCTGATTC CCCTCGGGGT CACCCGCATG TCCGCCGGGG TGAGCACGGC GGTCGGCGGT CACGCCGCGC ATAAGAATGT GGAAGGGCAG GGGGATGGGG ACGGCGCCAC CCCGCAGTTC GAGATTTCCG ACCCGCGCAG TGCCGACGAA ATGGCCTCCG CCATTGCCGC GCGCGGCTAT CAGCCGGTGT ACAAGGACTG GGAATCGGTG CTGGACGGCG GGTACGGGTG TGGGATAGCG TGCGCCGCGC GACGCACCCC GTCCGGTGAA CCTGTTGGGG CACCCACCCC GGCAGCCCCC CGCGCCACGG CCTGA
|
Protein sequence | MSMYDVVREW TPRVADAPLR AFMDAATPDG VARVLRKERL SPHDLLTLLS PAAATRLEAM ALRARELTVR HFGRTIQFFT PLYLSNHCTN QCRYCGFNAR NHIPRQRLTD EAIVAEGRAI AATGLRHLLL LTGDARHVSG PDYIAHAARL LAPLFPSLSV EVYSLTDEEY ALLVDAGIDG MTMFQETYNE ALYPELHPAG PKRDYHFRLD APERAARAGM RSVGLGALLG LDDWRRDAFF TALHGHWLQR RYPHVDVSFS VPRLRPHAGA FQPAYAVSDR DLVQVILAYR IFMPSAGITV STRERAGLRD NLIPLGVTRM SAGVSTAVGG HAAHKNVEGQ GDGDGATPQF EISDPRSADE MASAIAARGY QPVYKDWESV LDGGYGCGIA CAARRTPSGE PVGAPTPAAP RATA
|
| |