Gene Dvul_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1137 
SymbolthiH 
ID4662297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1383340 
End bp1384632 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content66% 
IMG OID639819366 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_966584 
Protein GI120602184 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.867554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.132743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCT ACGATGAACT GGCCCGCTGG CCTCACGAGA CGCTGGACGC ACTCATCGCG 
TCGTCCACGG CGGATGACGT CCAGCGCGCG TTGACGGCGA CGCGGCCCGG CCCCGCCGAC
CTTGCGGCCC TTCTGTCCCC GGCGGCCATG CCGTACCTCG AGGACATGGC GCAGCGCGCC
CATGAGCTTA CCGTGCGGCA TTTCGGGCGC ACCATACAGC TGTTCACCCC GCTGTACCTC
GCGAACCATT GCACCAACCA GTGCCGTTAC TGTGGGTTCA ACGCCCGCAA CCATATCCGG
CGCGACCAGC TGGACGCGGA AAGGATCATG GTCGAAGGGC AGGCCATCGC CAATACTGGG
CTGCGTCAGT TGCTGCTGCT CACGGGCGAT GCCCCGCGCA TCTCTACCGT ATCCTACATC
GCCGAGGCGG CGCACAGGCT TCGGCCTCTT TTCCCCTCCA TCGGTGTGGA AGTCTATGCC
ATGCAGGTCG AGGAGTATGC CGAACTCGTG GCGGGAGGGG TGGAGTCGCT GACGATGTTC
CAGGAGACCT ACAACCCCGG ACTCTACGCA TGGCTGCACC CCGCAGGGCC CAAGCGCGAC
TTCCGCTTCC GGCTTGACGC GCCGGAACGC GGCTGCCTCG GCGGGATGCG CAGCGTCGGT
CTCGGTGCCT TGCTCGGACT GGACGACTGG CGACGCGATG CGTTCTACAC CGCCATGCAC
GGGGCGTGGT TGCAACGGTA CTATCCGGCT ACCGAGGTCA GCTTCTCGGT GCCTCGCATG
AGGCCGCATA CGGGCAGCTT CGAGCCGCAG CACCCCGTCT CCGACCATGA ACTGGTGCAG
ATTCTCACGG CGTACCGCAT CTTCCTGCCC ATGGCGGGCA TCACGGTATC CAGCCGCGAA
GCGGCGGCGT TCCGCGACAA TCTCATTCCC CTTGGCGTGA CGCGCATGTC CGCAGGGGTT
TCCACGGCGG TGGGCGGACA TGCCTCGGGC GGTGACGGCA ACGTGGCTTC GACCGAGGCG
TCAGCCCTTG CGGCGAGGAT GGATGCCGCA TCGGACGACG CCACAGGATA CTCTCCGGCC
CATGCGGCGG CTGAAGGCCT TCGGCAGGGC GATGACGCGG GGCCAAGCCA GTTCGACATC
TCTGATGACC GTAGTGTCGA GGAGATGGTA TCTGCCATCA CCGCACGGGG CTACCAGCCG
GTGTTCAAGG ACTGGGAACC CCCGCAAGAC AACGTCTACG CCTGTGGCGC ATCGGGCCAT
GCCGATGGCA CAGTCCGATG CGAGGCCCGA TAG
 
Protein sequence
MSFYDELARW PHETLDALIA SSTADDVQRA LTATRPGPAD LAALLSPAAM PYLEDMAQRA 
HELTVRHFGR TIQLFTPLYL ANHCTNQCRY CGFNARNHIR RDQLDAERIM VEGQAIANTG
LRQLLLLTGD APRISTVSYI AEAAHRLRPL FPSIGVEVYA MQVEEYAELV AGGVESLTMF
QETYNPGLYA WLHPAGPKRD FRFRLDAPER GCLGGMRSVG LGALLGLDDW RRDAFYTAMH
GAWLQRYYPA TEVSFSVPRM RPHTGSFEPQ HPVSDHELVQ ILTAYRIFLP MAGITVSSRE
AAAFRDNLIP LGVTRMSAGV STAVGGHASG GDGNVASTEA SALAARMDAA SDDATGYSPA
HAAAEGLRQG DDAGPSQFDI SDDRSVEEMV SAITARGYQP VFKDWEPPQD NVYACGASGH
ADGTVRCEAR