Gene DvMF_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0799 
SymbolthiH 
ID7172688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp963303 
End bp964547 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content70% 
IMG OID643539300 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002435223 
Protein GI218885902 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT ACGACGTGGT CCGCGAATGG ACCCCCCGCG TGGCGGACGC ACCCCTGCGC 
GCCTTCATGG ACGCCGCCAC GCCGGACGGC GTGGCCCGCG TGCTGCGCAA GGAACGCCTT
TCCCCGCACG ACCTGCTGAC CCTGCTTTCC CCGGCGGCGG CCACCCGACT GGAGGCCATG
GCCCTTCGCG CCCGCGAGCT GACGGTGCGC CACTTCGGAC GCACCATCCA GTTCTTCACC
CCGCTGTACC TTTCCAACCA CTGCACCAAC CAGTGCCGGT ACTGCGGCTT CAACGCGCGC
AACCACATCC CCCGCCAGCG CCTGACGGAC GAGGCAATCG TGGCCGAGGG CCGGGCCATT
GCCGCCACCG GGCTGCGCCA CCTGCTGCTG CTCACCGGCG ATGCGCGCCA CGTTTCCGGG
CCGGACTACA TCGCCCATGC CGCGCGCCTG CTGGCCCCGC TGTTCCCTTC GCTGTCGGTG
GAGGTCTATT CGCTGACGGA CGAGGAATAC GCGCTGCTGG TGGACGCGGG CATCGACGGC
ATGACCATGT TCCAGGAAAC CTACAACGAG GCCCTGTACC CGGAACTGCA CCCCGCAGGG
CCCAAGCGCG ACTATCATTT CCGACTGGAC GCGCCGGAGC GCGCCGCCCG CGCGGGCATG
CGCAGTGTGG GCCTTGGCGC GCTGCTGGGG CTGGACGACT GGCGGCGCGA CGCCTTCTTC
ACCGCGCTGC ACGGCCACTG GCTGCAACGC CGGTATCCCC ATGTGGACGT AAGTTTTTCC
GTGCCGCGCC TGCGCCCCCA CGCCGGTGCC TTCCAGCCCG CGTACGCGGT ATCCGACCGC
GATCTGGTGC AGGTCATCCT GGCCTATCGC ATCTTCATGC CCAGCGCGGG CATTACCGTT
TCCACCCGCG AACGGGCGGG CCTGCGCGAC AACCTGATTC CCCTCGGGGT CACCCGCATG
TCCGCCGGGG TGAGCACGGC GGTCGGCGGT CACGCCGCGC ATAAGAATGT GGAAGGGCAG
GGGGATGGGG ACGGCGCCAC CCCGCAGTTC GAGATTTCCG ACCCGCGCAG TGCCGACGAA
ATGGCCTCCG CCATTGCCGC GCGCGGCTAT CAGCCGGTGT ACAAGGACTG GGAATCGGTG
CTGGACGGCG GGTACGGGTG TGGGATAGCG TGCGCCGCGC GACGCACCCC GTCCGGTGAA
CCTGTTGGGG CACCCACCCC GGCAGCCCCC CGCGCCACGG CCTGA
 
Protein sequence
MSMYDVVREW TPRVADAPLR AFMDAATPDG VARVLRKERL SPHDLLTLLS PAAATRLEAM 
ALRARELTVR HFGRTIQFFT PLYLSNHCTN QCRYCGFNAR NHIPRQRLTD EAIVAEGRAI
AATGLRHLLL LTGDARHVSG PDYIAHAARL LAPLFPSLSV EVYSLTDEEY ALLVDAGIDG
MTMFQETYNE ALYPELHPAG PKRDYHFRLD APERAARAGM RSVGLGALLG LDDWRRDAFF
TALHGHWLQR RYPHVDVSFS VPRLRPHAGA FQPAYAVSDR DLVQVILAYR IFMPSAGITV
STRERAGLRD NLIPLGVTRM SAGVSTAVGG HAAHKNVEGQ GDGDGATPQF EISDPRSADE
MASAIAARGY QPVYKDWESV LDGGYGCGIA CAARRTPSGE PVGAPTPAAP RATA