Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1137 |
Symbol | thiH |
ID | 4662297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1383340 |
End bp | 1384632 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639819366 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_966584 |
Protein GI | 120602184 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.867554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.132743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCT ACGATGAACT GGCCCGCTGG CCTCACGAGA CGCTGGACGC ACTCATCGCG TCGTCCACGG CGGATGACGT CCAGCGCGCG TTGACGGCGA CGCGGCCCGG CCCCGCCGAC CTTGCGGCCC TTCTGTCCCC GGCGGCCATG CCGTACCTCG AGGACATGGC GCAGCGCGCC CATGAGCTTA CCGTGCGGCA TTTCGGGCGC ACCATACAGC TGTTCACCCC GCTGTACCTC GCGAACCATT GCACCAACCA GTGCCGTTAC TGTGGGTTCA ACGCCCGCAA CCATATCCGG CGCGACCAGC TGGACGCGGA AAGGATCATG GTCGAAGGGC AGGCCATCGC CAATACTGGG CTGCGTCAGT TGCTGCTGCT CACGGGCGAT GCCCCGCGCA TCTCTACCGT ATCCTACATC GCCGAGGCGG CGCACAGGCT TCGGCCTCTT TTCCCCTCCA TCGGTGTGGA AGTCTATGCC ATGCAGGTCG AGGAGTATGC CGAACTCGTG GCGGGAGGGG TGGAGTCGCT GACGATGTTC CAGGAGACCT ACAACCCCGG ACTCTACGCA TGGCTGCACC CCGCAGGGCC CAAGCGCGAC TTCCGCTTCC GGCTTGACGC GCCGGAACGC GGCTGCCTCG GCGGGATGCG CAGCGTCGGT CTCGGTGCCT TGCTCGGACT GGACGACTGG CGACGCGATG CGTTCTACAC CGCCATGCAC GGGGCGTGGT TGCAACGGTA CTATCCGGCT ACCGAGGTCA GCTTCTCGGT GCCTCGCATG AGGCCGCATA CGGGCAGCTT CGAGCCGCAG CACCCCGTCT CCGACCATGA ACTGGTGCAG ATTCTCACGG CGTACCGCAT CTTCCTGCCC ATGGCGGGCA TCACGGTATC CAGCCGCGAA GCGGCGGCGT TCCGCGACAA TCTCATTCCC CTTGGCGTGA CGCGCATGTC CGCAGGGGTT TCCACGGCGG TGGGCGGACA TGCCTCGGGC GGTGACGGCA ACGTGGCTTC GACCGAGGCG TCAGCCCTTG CGGCGAGGAT GGATGCCGCA TCGGACGACG CCACAGGATA CTCTCCGGCC CATGCGGCGG CTGAAGGCCT TCGGCAGGGC GATGACGCGG GGCCAAGCCA GTTCGACATC TCTGATGACC GTAGTGTCGA GGAGATGGTA TCTGCCATCA CCGCACGGGG CTACCAGCCG GTGTTCAAGG ACTGGGAACC CCCGCAAGAC AACGTCTACG CCTGTGGCGC ATCGGGCCAT GCCGATGGCA CAGTCCGATG CGAGGCCCGA TAG
|
Protein sequence | MSFYDELARW PHETLDALIA SSTADDVQRA LTATRPGPAD LAALLSPAAM PYLEDMAQRA HELTVRHFGR TIQLFTPLYL ANHCTNQCRY CGFNARNHIR RDQLDAERIM VEGQAIANTG LRQLLLLTGD APRISTVSYI AEAAHRLRPL FPSIGVEVYA MQVEEYAELV AGGVESLTMF QETYNPGLYA WLHPAGPKRD FRFRLDAPER GCLGGMRSVG LGALLGLDDW RRDAFYTAMH GAWLQRYYPA TEVSFSVPRM RPHTGSFEPQ HPVSDHELVQ ILTAYRIFLP MAGITVSSRE AAAFRDNLIP LGVTRMSAGV STAVGGHASG GDGNVASTEA SALAARMDAA SDDATGYSPA HAAAEGLRQG DDAGPSQFDI SDDRSVEEMV SAITARGYQP VFKDWEPPQD NVYACGASGH ADGTVRCEAR
|
| |