Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2307 |
Symbol | |
ID | 8411848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2226397 |
End bp | 2227584 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645020650 |
Product | thiamine biosynthesis protein |
Protein accession | YP_003178126 |
Protein GI | 257388353 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0561858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCGC CGGGAGCCGA CATCGTCCTC GTTCGCCACG GCGAGATCGG GACCAAGAGC GAGCAGGTCC GTCGATCCAT GGAGGAGCGG CTGGCGCGGA ACCTCTCGGC CCTGCTCGCG GATCGCGGCG TCGACGGCTC CGTCGAGCGC GAACGGACGC GGCTGTTCGT CCACAGCGAC GAGCCCAGCG CCGCCGTCGG GGCCGCGACA GACACCTTCG GCGTCGTCTC TGCCAGCGCT GCCGTCCGCA CCGAGCCGAC GCTCGACGCC ATCTGTGAGG GGCTGGCAGC GATCGCGCGC GAGCGATTCG ACGGTGGCAC CTTCGCCGTC GACGCCCGCC GGGCGGGCCA GCAGTCCGCC CACGACTTCT CCAGCGAGGA CATCGAGTCC GACGGCGGCG CTGCCGTGTG GGCCGCGATC GAAGCGGCGG GCGGCGATCC GGCGGTCGAT CTCGACGAGC CGGACCTGAC GTTCCACGTC GAGTGTCGTC GGGAGGTCGC GTACCTGTTT CTCGACAAGC AGGCGGGTCC CGGCGGGCTT CCCGTCGGGA CTCAAGAGCC CGTCGTCGTC TTGCTCTCCG GGGGAATCGA CTCTCCCGTC GCGGCCTGGA AGCTGCTCAA GCGCGGCTGT CCCGTCGTGC CGGTGTACGT CGATCTCGGG GCGTACGGCG GTCCGGACCA CCGCGCGCGA GCCCTCTCGA CCGCCGAGAC GCTCGCCAGC TACGTCCCGA ACGTCGACCT CTCGGTGCGG GTCGCCGACG GAGGGGCCGT CGTGGAGCGG CTCGCAGACG AGCTGGACGC ACGGCGGATG CTCGCCCTGC GGCGGTTCAT GCTCGCCGTC GGGGCCCGGG TCGCCGAGCG TACCGACGCC GTCGGCGTGG CGACCGGCGA GGCGATCGGA CAGAAGTCGA GCCAGACGAG CGCCAACCTC GCGGTGACGG ACGTGGCCGT CGACTGTCCC GTCTTCCGGC CGAACCTGAC CGCCGACAAG GCCGACATCA CGCAGCTGGC CCGCGAGATC GGGACCTTCG AGGCGTCGAC GATCCCGACC GGCTGTAACC GCGTCGCGCC GTCGCTGCCC GAGACGAACG CCGACCTGCA CGCCGTCCGC GAACGCGAAC CGGACGACCT GTTCGAGCGT GCGAGGGCCG TCGCTGACCG CGCCGAAGTC GTCGCCCTCG ACCGCTGA
|
Protein sequence | MHPPGADIVL VRHGEIGTKS EQVRRSMEER LARNLSALLA DRGVDGSVER ERTRLFVHSD EPSAAVGAAT DTFGVVSASA AVRTEPTLDA ICEGLAAIAR ERFDGGTFAV DARRAGQQSA HDFSSEDIES DGGAAVWAAI EAAGGDPAVD LDEPDLTFHV ECRREVAYLF LDKQAGPGGL PVGTQEPVVV LLSGGIDSPV AAWKLLKRGC PVVPVYVDLG AYGGPDHRAR ALSTAETLAS YVPNVDLSVR VADGGAVVER LADELDARRM LALRRFMLAV GARVAERTDA VGVATGEAIG QKSSQTSANL AVTDVAVDCP VFRPNLTADK ADITQLAREI GTFEASTIPT GCNRVAPSLP ETNADLHAVR EREPDDLFER ARAVADRAEV VALDR
|
| |