Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0163 |
Symbol | thiH |
ID | 6027285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 186717 |
End bp | 188192 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641593015 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001716359 |
Protein GI | 169830377 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACCA CTACGGAAAA AGCCTGGCTG GACGAACGCC TGGCATACAT CAAGGCCTAT GAGGCTTCCG AAAAAAGGCC GAGCTTCGTG AGCGACGCGG AAATTGAGGC CGTCTTGAAA CGCAAGGCCG ATCCCGAGCG GCTGGAAGTG GAAGAGGTTC TAAGCAAGGC CAAGGAGCTT CACGGACTAA CCCCGGACGA CGCAGCCGTA TTGCTGAACA ACCGGGACCC GGAACTCTGG GCGGAGATTT TTGCGACCGC GCACTGGATC AAGCAGGAGG TCTACGGTAA CCGGATCGTT CTGTTCGCCC CCCTCTACAT TTCCAGTCCG TGCGTAAACA ACTGCGCTTA CTGCGGCTTC CGGCACAGCA ATGATCAGGT CGCCAAAAAG ACCCTTTCGC CGGCCGAACT GGAGGCCGAA GTAAAGGCGC TGATCACCAA GGGGCACAAA CGGCTGATCG TGGTTTATGG TGAGCACCCG GCCAGCGACG TCGATTTTAT GTGCCGGACC ATCGAAACGA TCTACGCCGT CAAAGAAGGC CGGGGCGAGA TCCGCAGGGT CAACGTCAAC GCCGCGCCCC TAACCGTGGA AGAATACCGG CGGCTGAAAG AAGTCGGCAT CGGTACCTAT CAGGTGTTTC AGGAGACGTA CCACCTGACC ACCTACCGGA AGATGCACCC GGCGAACACC CTCAAGGGTT CGTTCCGCTG GCGGTTGTTC GCCCTGCACC GGGCGCAGGA AGCAGGAATC GACGACGTGG CCGTCGGCGC GCTCTTCGGG CTGTATGACT GGCGCTTCGA GGTACTGGGC CTCCTTTACC ACGCCCTGGA CCTGGAGCGG GAGTTCGGCG TGGGCCCGCA CACCATCTCG GTCCCCCGGC TGGAGCCGGC CTTGAACACC CCGCTGACCA CCAGCTCACC CTACCGGGTG GCCGATGAGG ACTTCAAGAA GGCGGTCACC GTACTGCGGT GCGCCGTGCC TTACACCGGT ATCATCCTCA CCTGCCGCGA AAAGCCCGCT TTGCGGCGGG AGGTCATCGC GCTGGGCGTC TCGCAGGTCG ACGCCGGGTC CCGGACCGCC GTGGGCGGCT ACGCGGAGAT GGAACGGGAA CACATCCCCG ACCGCGAGCA GTTCCAACTG GCGGACACCC GCTCCCTGGA CGAGTATATT CTGGAACTGT GCCGGGACGG GTACATCCCT TCCTTCTGCA CGGCGGGATA CCGTACCGGT CGCACCGGCT GCCACTTTAT GTCCTTTGCC AAGCAGGGTT TGATTAAGAA TTTCTGCCTG CCGAACGCCG TGCTCACTTT CAAGGAATAC CTTCTCGACT ACGCTTCGCC CGAAACCAGG GATGCCGGGG AAAAAACCAT CGCCCGGCAC GTCGAAGACT TTGCCCGCCG GATGCCGCAA CGCGCCGAAA AACTGAAAGA GATGCTTTCC CGGATGGAGG GTGGCCAGCG TGACCTGTAC TTTTAA
|
Protein sequence | MLTTTEKAWL DERLAYIKAY EASEKRPSFV SDAEIEAVLK RKADPERLEV EEVLSKAKEL HGLTPDDAAV LLNNRDPELW AEIFATAHWI KQEVYGNRIV LFAPLYISSP CVNNCAYCGF RHSNDQVAKK TLSPAELEAE VKALITKGHK RLIVVYGEHP ASDVDFMCRT IETIYAVKEG RGEIRRVNVN AAPLTVEEYR RLKEVGIGTY QVFQETYHLT TYRKMHPANT LKGSFRWRLF ALHRAQEAGI DDVAVGALFG LYDWRFEVLG LLYHALDLER EFGVGPHTIS VPRLEPALNT PLTTSSPYRV ADEDFKKAVT VLRCAVPYTG IILTCREKPA LRREVIALGV SQVDAGSRTA VGGYAEMERE HIPDREQFQL ADTRSLDEYI LELCRDGYIP SFCTAGYRTG RTGCHFMSFA KQGLIKNFCL PNAVLTFKEY LLDYASPETR DAGEKTIARH VEDFARRMPQ RAEKLKEMLS RMEGGQRDLY F
|
| |