Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2472 |
Symbol | |
ID | 8384774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2546225 |
End bp | 2547670 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973546 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003131369 |
Protein GI | 257053536 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.709563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGAA CCCAACTGCA GCGCGCACGC AACGGTGAGA TCACGCCGGC GATGGAGCGC GTCGCCGAGC GGGAGAACCG TGATCCCGAG TACGTCCGCG AGAAGGTCGC CGCGGGCGAG GCAGTGATCC CGAACAACCA CGAGCACGAC TCGCTGGATC CGATGATTAT CGGGAAGGAC TTTTCGACGA AAGTCAACGC CAACATCGGC AACAGCGAAC CCGAAAGCGA TATCGAAACC GAACGGGAGA AACTCCACAC GGCCGTCGAA TACGGCGCGG ACACGGTGAT GGATCTCTCG ACGGGTGGTG ACATTCCGGA ACTTCGCGCC GGCCAGATCG AACACTCGCC GGTCCCGCTC GGGACAGTGC CGATCTACGA GGCGCTAAAG CAGGCCGGAT CACCCGAGGA ACTCACGGCG GACCTCCTCC TGGACGTGAT CGAGTCCCAG GCCGCCCAGG GCACGGACTA TCAGACGATT CATGCGGGGG TTCTCCGGGA GCACTTGCCG CTGACCGACG GCCGGATCAC GGGCATCGTC TCCCGCGGTG GGTCGATTCT CGCCGAATGG ATGGAAAATC ACGGCGAGCA GAACCCGCTG TACACGCATT TCGACGAGAT CTGTGCGATC CTTGCGGAAT ACGACGTGAC GATCAGTCTC GGCGACGGGC TCCGACCCGG GAGCCTGGCA GACGCCAACG ACGACGCCCA GCTGGCGGAA CTGGAGACGC TGGGTGACCT CACCGAGCGC GCCCACGAGC ACGGCGTCCA GGTCATGGTC GAGGGGCCGG GCCACGTCCC CCTAGACGAG ATCGGCGAAC AGGTCGAGTA TCAACAGGAG GTCTGTGACG GCGCGCCGTT CTACCTGCTG GGCCCGCTGG TGACTGACGT TGCGCCGGGG TACGACCATA TCACGAGTGC GATCGGCGCA ACCGAGGCCG CCCGTCACGG CGCGGCGATG CTCTGCTACG TCACACCCAA AGAGCACCTC GGTTTGCCCG ACGCCGAGGA CGTCCGGGAC GGGCTGGCGG CCTACCGGAT CGCCGCTCAT GCGGGCGACG TGGCCGCCGG GAAGCCAGGC GCTCGCGACT GGGACGACGC GCTTTCCCAG GCGCGGTACA ACTTCGACTG GCGCGAGCAG TTCAACCTCG CGCTGGACCC CGACCGAGCC AAGGAGTATC ACGACCAGAC GCTCCCGGAG GACAACTACA AGGAAGCCCG CTACTGCTCG ATGTGTGGCG CGGAATTTTG CTCGATGCGG ATCGACCAGG ACGCCCGTGA GGGGGAGGAG ATGGAGGGAC TGGACGCCGA CGTCGATCTC GCGGACTCGC CCGCAGCCGA CGTCAATCTG CCGCCGACGG GCAAACACGA TACAAGTGGC CTGCCCACGG TGCCCGAGGC GTTGTGTGAC CACGCCGGGG GCGAGGGTTT GCCGGGCGAC GACTGA
|
Protein sequence | MMGTQLQRAR NGEITPAMER VAERENRDPE YVREKVAAGE AVIPNNHEHD SLDPMIIGKD FSTKVNANIG NSEPESDIET EREKLHTAVE YGADTVMDLS TGGDIPELRA GQIEHSPVPL GTVPIYEALK QAGSPEELTA DLLLDVIESQ AAQGTDYQTI HAGVLREHLP LTDGRITGIV SRGGSILAEW MENHGEQNPL YTHFDEICAI LAEYDVTISL GDGLRPGSLA DANDDAQLAE LETLGDLTER AHEHGVQVMV EGPGHVPLDE IGEQVEYQQE VCDGAPFYLL GPLVTDVAPG YDHITSAIGA TEAARHGAAM LCYVTPKEHL GLPDAEDVRD GLAAYRIAAH AGDVAAGKPG ARDWDDALSQ ARYNFDWREQ FNLALDPDRA KEYHDQTLPE DNYKEARYCS MCGAEFCSMR IDQDAREGEE MEGLDADVDL ADSPAADVNL PPTGKHDTSG LPTVPEALCD HAGGEGLPGD D
|
| |