Gene Dvul_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1390 
SymbolthiH 
ID4664913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1691677 
End bp1693071 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content63% 
IMG OID639819620 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_966835 
Protein GI120602435 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00852808 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA TGAAGGCTAC CTGGCTGGAC GACGCCGCAC TGGAAGCGAC GCTCGAGCGC 
AACGCTCAAG AAGATGCGGT GAAGGGCCGC GAGGTCATCG CCAAGGCAAG GCTACTCGGC
GGACTCGACC TTGACGACGT GGCGACGCTC ATCGCCCTCC GCGACCCCGA ACTCGTAGAG
GAGATGTTCC AGACCGCACG CGACGTGAAG GAAGAGATCT ACGGTAACCG CCTCGTGCTC
TTTGCGCCGC TCTACATCTC GAACCTGTGC TCCAACGAAT GTCTGTACTG TGCCTTCAGA
CGATCGAACA CCGAACTCGA TCGCAAGGCG CTGGACATGG ATGCCATTGC CGACGAGACA
CGACTCATCG TCCAGCAGGG CCACAAGCGC ATCCTGCTTG TGGCGGGCGA ATCGTACCCG
CGCGAAGGCT TCGACTACGT GCTGCGCGCC ATCGATGCCG TCTATTCGGT ACACGAGGGC
ACAGGCGAGA TACGCCGACT CAATGTCAAC GTCGCACCCC TCACCGTCGA GCAGTTCCGC
GACCTCAAGG CCCGCAACAT CGGGACATAT CAGCTCTTTC AGGAGACGTA CCACCGGGGC
ACCTACGCGA AGGTGCATCT GGCCGGCCCG AAGAAGGACT TCGACTGGCG TGCCACCGCC
ATGGACAGGG CCATGGAGGC GGGTATCGAC GACGTAGGTA TCGGGCCGTT GTTCGGCCTG
TACGACTGGC GCTTCGAAGT GCTCGCCACC CTGCGCCACG CACAGCACCT TGAAGAGGCC
TTCGGCGTGG GATGTCACAC CATCAGCGTG CCTCGTCTCG AACCCGCCTG CGGTTCGGAC
ATGGCGTCGA ATCCTCCCAG ACCCGTCTCC AATGACGATT TCATGCGCCT TGTCGCCATC
CTTCGGCTTG CCGTGCCGTA CACCGGCATC ATCATGTCCA CGCGCGAAAG CGCCGAGATG
CGCACGCAGA CGCTGGCCCT CGGCGTTTCG CAGATATCGG CCGGCAGCCG CACGAACCCC
GGCGGCTATG CCGAGAACGA GCGTGAAGAG GCTGCGCAGT TCCAGCTTGG CGACCACAGG
TCGCTTTCGG AAGTGATCGC CGATGTGGGC AGCATGGGGT ACATCCCCTC GTTCTGTACC
GCCTGCTATC GCATGGGCCG CACTGGGCAC GACTTCATGG ACCTCGCCAA GCCGGGGCTC
ATCAAGCAGA AGTGCGGGCC CAACGCCCTC GCCACCTTCA AGGAGTACCT GCTCGACTAC
GGCACTCCCG AGGCGCGGGC CGCGGGCGAA TCGGTCATCG CAGCCGACCT CGGCAAACTC
GACGAGAAGA CGCGCCGTGT GGCTGAACGA CTCATCGCCC GTGTGGACGA GGGCCGTCGG
GATGTCTTTG TCTGA
 
Protein sequence
MAEMKATWLD DAALEATLER NAQEDAVKGR EVIAKARLLG GLDLDDVATL IALRDPELVE 
EMFQTARDVK EEIYGNRLVL FAPLYISNLC SNECLYCAFR RSNTELDRKA LDMDAIADET
RLIVQQGHKR ILLVAGESYP REGFDYVLRA IDAVYSVHEG TGEIRRLNVN VAPLTVEQFR
DLKARNIGTY QLFQETYHRG TYAKVHLAGP KKDFDWRATA MDRAMEAGID DVGIGPLFGL
YDWRFEVLAT LRHAQHLEEA FGVGCHTISV PRLEPACGSD MASNPPRPVS NDDFMRLVAI
LRLAVPYTGI IMSTRESAEM RTQTLALGVS QISAGSRTNP GGYAENEREE AAQFQLGDHR
SLSEVIADVG SMGYIPSFCT ACYRMGRTGH DFMDLAKPGL IKQKCGPNAL ATFKEYLLDY
GTPEARAAGE SVIAADLGKL DEKTRRVAER LIARVDEGRR DVFV