Gene Hmuk_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1097 
Symbol 
ID8410616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1049345 
End bp1050556 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID645019433 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003176931 
Protein GI257387158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.076735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTAG ACCACCAGAC CCTCTCGCTG GGCGAGTTCG AGTTCGCCTG TGGCGAGACG 
ATCCCCGAGC TCGAAGTCGC CTACGAGACC TACGGCGAGT TCGACGGCGA CAACGCCGTC
CTCGTCTGTC ACGCGCTCAC CGGCAGCGCA CACGTCGCCG GCCGCGGTCG CTTCGAAGAC
TCCGATCAGG CCTTCGCCTG GTGGGACAAC ATCGTCGGCC CGGGCAAGGC CATCGACACG
ACCGAGTACT ACGTGATCTG CGTGAACGAC CCCGGCTCCT GTTATGGCAC GTCCGGGCCG
GCCTCGACGA ACCCCGAGAC GGGCGAGCCC TACGGGACGG CGTTCCCGCC GGTGACCGTC
GGCGACTGGA CGGAGGCCCA GCGGGCCGTA CTGGACGAGT TGGGGGTCCC CCACCTCCAC
GCGGTCGTCG GCGGTAGCGT CGGCGGCATG AACGCACTGG ACTGGGTCAA GCGCCATCCT
GATCACGTCG AGCGCGTCGT CGCCATCGCC GCCGCCGCGC GACTCGATTC GCAGTGTCTC
GCGCTGGACG GCATCGCCCG CCGAGCGATC ACGACCGACG ACGACTGGAA CGGGGGCGAC
TACTACGGCG ACGACCGCCC GGACCCGGAC GACGGGCTGG CGCTGGCCCG GGAACTCGGC
CACGTGATGT ACCTCTCGAA GGCGACGATG GAGCAGCGCT TCGGCCGCCG GGCCGCCACG
AGAGAGTACG AGCGAGCCTT CCCGACGGAC CCGGCGGGCC GCTTTTTCCC CTACCGGGAC
GTGGAGTCCT ATCTCGATCA CAACGCCACG AAGTTCGTCG ACCGCTTCGA CGCCAACAGC
TACCTCTACC TGACTCGGGC GATGGACAAC TACGACCTCT CGTCGGGCTT CGAGTCCGAC
GCGGACGCCG TCGCCGCTTT CGACGGCGAG GCGCTGCTGA TGTCCTTTAC CGGCGACTGG
CACTTCACGA CAGCACAGTC CGAGGAGCTG GCCGAGTCGT TCCGCGAGAC CGACACCGCG
ACCGCACACC ACGTGGTCGA CTCCGACTAC GGCCACGACG CCTTCCTCGT CGAGCCCGAT
CGCGTCGGCC CGCCCGTCGC GGACTTCCTC GCTGACGGCG TCGGTGGCAG CGCCGTCTCC
GACACCACCG ACGACGACGA CGGCGACGAC GGTCCCGACC ACGCACCGGT CCACACGAGC
CTCTTCTCGT GA
 
Protein sequence
MDVDHQTLSL GEFEFACGET IPELEVAYET YGEFDGDNAV LVCHALTGSA HVAGRGRFED 
SDQAFAWWDN IVGPGKAIDT TEYYVICVND PGSCYGTSGP ASTNPETGEP YGTAFPPVTV
GDWTEAQRAV LDELGVPHLH AVVGGSVGGM NALDWVKRHP DHVERVVAIA AAARLDSQCL
ALDGIARRAI TTDDDWNGGD YYGDDRPDPD DGLALARELG HVMYLSKATM EQRFGRRAAT
REYERAFPTD PAGRFFPYRD VESYLDHNAT KFVDRFDANS YLYLTRAMDN YDLSSGFESD
ADAVAAFDGE ALLMSFTGDW HFTTAQSEEL AESFRETDTA TAHHVVDSDY GHDAFLVEPD
RVGPPVADFL ADGVGGSAVS DTTDDDDGDD GPDHAPVHTS LFS