Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_1556 |
Symbol | thiH |
ID | 3755995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 1575356 |
End bp | 1576525 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637782433 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_388048 |
Protein GI | 78356599 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA TCAGCGCTGA CATGAGTGTG AGAACCGCAC AGGCGGGAGG TTTTTACGAA GTTGTCCGCC AGTTGCAGGA TGTGGATGTG CAGGCCGCCT GCATGGCGGC TGACGGATAC AGAGTGCGGC AGGCTCTGGC AAAGGATACC CTTTCTCCGG AAGACTTTCT GGCTTTGCTT TCACCCGCGG CCCGTCCGTA CATGGAATCA ATGGCGCATA AGGCCCGTGA TGTCACGGTG CGCCAGTTCG GCCGCACCAT CCAGCTGTTT ACTCCGCTGT ACCTTTCAAA CTGGTGTACA AACAGGTGCG TGTATTGCAG CTTCAATGCC TGCAGTGGCA TAGACCGCAT GCAGCTGGAT GCTGCGGGTG TGCTGCGCGA AGGGCAGGCC ATAGCCGCCA CAGGACTGCG CCATCTGCTG CTGCTGACCG GCGAGGCTCC GGCAAAGGCT TCCGTTGACT ACATCCGCGA CTGTGTGCGC GTGTTGCGTC CTCTGTTTCC GTCACTGAGT ATTGAAGTGT ACGCCCTGAC CGAGCCTGAG TACCGCACAC TTGCCGTTGC CGGTGTGGAC GGCATGACAC TTTTTCAGGA AACATATAAC GAGGCTTTGT ACCCGTCATT GCATCCTGCC GGTCCCAAAA GCAATTATCA TTTCAGGCTG GGGGCGCCGG AAAGAGCATG CCGTGCCGGA ATGCGCAATG TGAATATAGG TGCGTTGCTC GGACTGGATC AGTGGCAGCG TGATGCTTTT ATGACGGGTA TGCATGCCCT TTGGCTGCAG CACAGGTATC CGGGAGTGGA TATTGCTGTT TCGCTGCCGC GTATGCGTCC TTATGCCGGC AGTTTTCAGC CGGTGTGTGA CGTGGATGAT CGTGCGCTGG TGCAGATACT GCTGGCCATG CGGCTTTTTC TGCCGCGGTG CGGCATTACC ATTTCCACGC GCGAACGGCC TGCATTCCGC GATAATCTGA TACCGCTGGG AGTGACCCGC ATGAGTGCCG GTGTATCCAC CGCTGTGGGC GGACATGCTG AAAACGAACA TTCCTCTGTA GGGCAGTTTG AAATTTCTGA CCCGCGCAGT GTGGACGAGG TGGTTGAAGC CGTAAGCCGG GCCGGTTATC AGGCCGTGTT TAAAGACTGG CATCCGCTGG AAGACAGCTG CGCGGTATGA
|
Protein sequence | MSEISADMSV RTAQAGGFYE VVRQLQDVDV QAACMAADGY RVRQALAKDT LSPEDFLALL SPAARPYMES MAHKARDVTV RQFGRTIQLF TPLYLSNWCT NRCVYCSFNA CSGIDRMQLD AAGVLREGQA IAATGLRHLL LLTGEAPAKA SVDYIRDCVR VLRPLFPSLS IEVYALTEPE YRTLAVAGVD GMTLFQETYN EALYPSLHPA GPKSNYHFRL GAPERACRAG MRNVNIGALL GLDQWQRDAF MTGMHALWLQ HRYPGVDIAV SLPRMRPYAG SFQPVCDVDD RALVQILLAM RLFLPRCGIT ISTRERPAFR DNLIPLGVTR MSAGVSTAVG GHAENEHSSV GQFEISDPRS VDEVVEAVSR AGYQAVFKDW HPLEDSCAV
|
| |