Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0002 |
Symbol | |
ID | 8409498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2764 |
End bp | 3687 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645018339 |
Product | ribulose-1,5-biphosphate synthetase |
Protein accession | YP_003175860 |
Protein GI | 257386087 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1635] Flavoprotein involved in thiazole biosynthesis |
TIGRFAM ID | [TIGR00292] thiazole biosynthesis enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.512704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAT TCGAGCAGTT CAGCGACGTC GGTGAGGCGG AAGTCACACG CGCGATCGGG CAGGAGTGGA CCGAGGAGTT CATGGACTTC TCGGACTCCG ACGTGATCAT CGTCGGCGGC GGCCCCTCCG GGCTGACCGC GGCCAAGGAA CTGGCCGAGC GCGGCGTCCA GGTCATGGTC GTCGAGAAGA ACAACTACCT CGGCGGGGGC TTCTGGCTTG GCGGGTTCCT GATGAACAAG GTCACCGTTC GAGACCCCGC ACAGAACGTG CTGGACGAAC TCGACGTGGA CTACAAGCAG TCCCAGGACA GCGACGGACT CTACGTCGCC AACGGTCCCG AAGCCTGTTC CGGCCTGATC AAGGCCGCCT GTGACGCCGG TGCGAAGATG CAGAACATGA CGGAGTTCAC CGACATCGTC ATCCGCGAGG ACCACCGCGT CGGCGGGATC GTCATGAACT GGACGCCGGT CCACGCGCTG CCCCGCGAGA TCACCTGCGT CGATCCGATC GCCGTCGAGG CCGACCTCGT CATCGACGCG ACGGGCCACG ACGCGATGGC CGTCAAAAAG CTCGACGAGC GAGGCGTCCT CAACGCGCCC GGTCTCGAAG AGGAAGCCAG CGGCATGGAT TCCACCGGCG ACGACACCTA CGGCGCACCG GGCCACGACT CGCCCGGCCA CGACTCGATG TGGGTCGGCA AGAGCGAGGA CGCCGTCGTC GAGCACACCG GCCTCGCCCA CGACGGCCTC ATCGTCACCG GGATGGCCAC CGCGACCACC TACGGGCTGC CCCGCATGGG TCCGACCTTC GGTGCCATGC TGCTCTCGGG CAAGCGCGCC GCACAGGCCG CGCTGGACGA ACTCGAAGTC GACGCCGAAC CGGTCGACGT GACGGCGACG AACGCGACCC CCGCGGACGA TTAG
|
Protein sequence | MSEFEQFSDV GEAEVTRAIG QEWTEEFMDF SDSDVIIVGG GPSGLTAAKE LAERGVQVMV VEKNNYLGGG FWLGGFLMNK VTVRDPAQNV LDELDVDYKQ SQDSDGLYVA NGPEACSGLI KAACDAGAKM QNMTEFTDIV IREDHRVGGI VMNWTPVHAL PREITCVDPI AVEADLVIDA TGHDAMAVKK LDERGVLNAP GLEEEASGMD STGDDTYGAP GHDSPGHDSM WVGKSEDAVV EHTGLAHDGL IVTGMATATT YGLPRMGPTF GAMLLSGKRA AQAALDELEV DAEPVDVTAT NATPADD
|
| |