Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1097 |
Symbol | |
ID | 8410616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1049345 |
End bp | 1050556 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645019433 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_003176931 |
Protein GI | 257387158 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.076735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTAG ACCACCAGAC CCTCTCGCTG GGCGAGTTCG AGTTCGCCTG TGGCGAGACG ATCCCCGAGC TCGAAGTCGC CTACGAGACC TACGGCGAGT TCGACGGCGA CAACGCCGTC CTCGTCTGTC ACGCGCTCAC CGGCAGCGCA CACGTCGCCG GCCGCGGTCG CTTCGAAGAC TCCGATCAGG CCTTCGCCTG GTGGGACAAC ATCGTCGGCC CGGGCAAGGC CATCGACACG ACCGAGTACT ACGTGATCTG CGTGAACGAC CCCGGCTCCT GTTATGGCAC GTCCGGGCCG GCCTCGACGA ACCCCGAGAC GGGCGAGCCC TACGGGACGG CGTTCCCGCC GGTGACCGTC GGCGACTGGA CGGAGGCCCA GCGGGCCGTA CTGGACGAGT TGGGGGTCCC CCACCTCCAC GCGGTCGTCG GCGGTAGCGT CGGCGGCATG AACGCACTGG ACTGGGTCAA GCGCCATCCT GATCACGTCG AGCGCGTCGT CGCCATCGCC GCCGCCGCGC GACTCGATTC GCAGTGTCTC GCGCTGGACG GCATCGCCCG CCGAGCGATC ACGACCGACG ACGACTGGAA CGGGGGCGAC TACTACGGCG ACGACCGCCC GGACCCGGAC GACGGGCTGG CGCTGGCCCG GGAACTCGGC CACGTGATGT ACCTCTCGAA GGCGACGATG GAGCAGCGCT TCGGCCGCCG GGCCGCCACG AGAGAGTACG AGCGAGCCTT CCCGACGGAC CCGGCGGGCC GCTTTTTCCC CTACCGGGAC GTGGAGTCCT ATCTCGATCA CAACGCCACG AAGTTCGTCG ACCGCTTCGA CGCCAACAGC TACCTCTACC TGACTCGGGC GATGGACAAC TACGACCTCT CGTCGGGCTT CGAGTCCGAC GCGGACGCCG TCGCCGCTTT CGACGGCGAG GCGCTGCTGA TGTCCTTTAC CGGCGACTGG CACTTCACGA CAGCACAGTC CGAGGAGCTG GCCGAGTCGT TCCGCGAGAC CGACACCGCG ACCGCACACC ACGTGGTCGA CTCCGACTAC GGCCACGACG CCTTCCTCGT CGAGCCCGAT CGCGTCGGCC CGCCCGTCGC GGACTTCCTC GCTGACGGCG TCGGTGGCAG CGCCGTCTCC GACACCACCG ACGACGACGA CGGCGACGAC GGTCCCGACC ACGCACCGGT CCACACGAGC CTCTTCTCGT GA
|
Protein sequence | MDVDHQTLSL GEFEFACGET IPELEVAYET YGEFDGDNAV LVCHALTGSA HVAGRGRFED SDQAFAWWDN IVGPGKAIDT TEYYVICVND PGSCYGTSGP ASTNPETGEP YGTAFPPVTV GDWTEAQRAV LDELGVPHLH AVVGGSVGGM NALDWVKRHP DHVERVVAIA AAARLDSQCL ALDGIARRAI TTDDDWNGGD YYGDDRPDPD DGLALARELG HVMYLSKATM EQRFGRRAAT REYERAFPTD PAGRFFPYRD VESYLDHNAT KFVDRFDANS YLYLTRAMDN YDLSSGFESD ADAVAAFDGE ALLMSFTGDW HFTTAQSEEL AESFRETDTA TAHHVVDSDY GHDAFLVEPD RVGPPVADFL ADGVGGSAVS DTTDDDDGDD GPDHAPVHTS LFS
|
| |