Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1390 |
Symbol | thiH |
ID | 4664913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 1691677 |
End bp | 1693071 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639819620 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_966835 |
Protein GI | 120602435 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00852808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA TGAAGGCTAC CTGGCTGGAC GACGCCGCAC TGGAAGCGAC GCTCGAGCGC AACGCTCAAG AAGATGCGGT GAAGGGCCGC GAGGTCATCG CCAAGGCAAG GCTACTCGGC GGACTCGACC TTGACGACGT GGCGACGCTC ATCGCCCTCC GCGACCCCGA ACTCGTAGAG GAGATGTTCC AGACCGCACG CGACGTGAAG GAAGAGATCT ACGGTAACCG CCTCGTGCTC TTTGCGCCGC TCTACATCTC GAACCTGTGC TCCAACGAAT GTCTGTACTG TGCCTTCAGA CGATCGAACA CCGAACTCGA TCGCAAGGCG CTGGACATGG ATGCCATTGC CGACGAGACA CGACTCATCG TCCAGCAGGG CCACAAGCGC ATCCTGCTTG TGGCGGGCGA ATCGTACCCG CGCGAAGGCT TCGACTACGT GCTGCGCGCC ATCGATGCCG TCTATTCGGT ACACGAGGGC ACAGGCGAGA TACGCCGACT CAATGTCAAC GTCGCACCCC TCACCGTCGA GCAGTTCCGC GACCTCAAGG CCCGCAACAT CGGGACATAT CAGCTCTTTC AGGAGACGTA CCACCGGGGC ACCTACGCGA AGGTGCATCT GGCCGGCCCG AAGAAGGACT TCGACTGGCG TGCCACCGCC ATGGACAGGG CCATGGAGGC GGGTATCGAC GACGTAGGTA TCGGGCCGTT GTTCGGCCTG TACGACTGGC GCTTCGAAGT GCTCGCCACC CTGCGCCACG CACAGCACCT TGAAGAGGCC TTCGGCGTGG GATGTCACAC CATCAGCGTG CCTCGTCTCG AACCCGCCTG CGGTTCGGAC ATGGCGTCGA ATCCTCCCAG ACCCGTCTCC AATGACGATT TCATGCGCCT TGTCGCCATC CTTCGGCTTG CCGTGCCGTA CACCGGCATC ATCATGTCCA CGCGCGAAAG CGCCGAGATG CGCACGCAGA CGCTGGCCCT CGGCGTTTCG CAGATATCGG CCGGCAGCCG CACGAACCCC GGCGGCTATG CCGAGAACGA GCGTGAAGAG GCTGCGCAGT TCCAGCTTGG CGACCACAGG TCGCTTTCGG AAGTGATCGC CGATGTGGGC AGCATGGGGT ACATCCCCTC GTTCTGTACC GCCTGCTATC GCATGGGCCG CACTGGGCAC GACTTCATGG ACCTCGCCAA GCCGGGGCTC ATCAAGCAGA AGTGCGGGCC CAACGCCCTC GCCACCTTCA AGGAGTACCT GCTCGACTAC GGCACTCCCG AGGCGCGGGC CGCGGGCGAA TCGGTCATCG CAGCCGACCT CGGCAAACTC GACGAGAAGA CGCGCCGTGT GGCTGAACGA CTCATCGCCC GTGTGGACGA GGGCCGTCGG GATGTCTTTG TCTGA
|
Protein sequence | MAEMKATWLD DAALEATLER NAQEDAVKGR EVIAKARLLG GLDLDDVATL IALRDPELVE EMFQTARDVK EEIYGNRLVL FAPLYISNLC SNECLYCAFR RSNTELDRKA LDMDAIADET RLIVQQGHKR ILLVAGESYP REGFDYVLRA IDAVYSVHEG TGEIRRLNVN VAPLTVEQFR DLKARNIGTY QLFQETYHRG TYAKVHLAGP KKDFDWRATA MDRAMEAGID DVGIGPLFGL YDWRFEVLAT LRHAQHLEEA FGVGCHTISV PRLEPACGSD MASNPPRPVS NDDFMRLVAI LRLAVPYTGI IMSTRESAEM RTQTLALGVS QISAGSRTNP GGYAENEREE AAQFQLGDHR SLSEVIADVG SMGYIPSFCT ACYRMGRTGH DFMDLAKPGL IKQKCGPNAL ATFKEYLLDY GTPEARAAGE SVIAADLGKL DEKTRRVAER LIARVDEGRR DVFV
|
| |