Gene Hlac_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1850 
Symbol 
ID7400042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1854553 
End bp1855695 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID643708919 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_002566498 
Protein GI222480261 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0860888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.20671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACG AGACCACTCC GGTTATCGCC GCAGCCTACC GAACGCCGCA GGGACGCGAC 
GGGGGCGTCT ACGCGGACGT CCGCAGCGAG GATCTTTCGA CGCGCCTCAT CGACCACACG
CTCGCGGAGA CCGGGCTGAC CGGCGACCAC GTCGACGACC TGATGTGGGG GGTCGCCCAG
CAGCGGACCG AACAGGACAA CAACGTCGCC CGCGTCATCG CGCTCCTCTC TGACCTCGGT
GAATCGGTAC CGGCGACCTC GATCAACCGC TGGTGCGCCT CCTCGATGCA GGCGATCATC
TCGGCAGCGG ACGCCATCGC GGCCGGGAAC CGCGACTGCA TCATCGCCGG CGGCGTCGAG
AATATGAGTC GCGTCCCGAT GGACGGCGAC TCCTACGAAC ACCTCCACCC CGAGTTGTCG
GAGCAGTACA ACGTCTTTCA GCTCCAGATG GGAATGACCG CCGAGAAGGT CGCCGAGGAG
TACGAGGTGA GCCGCGAGGC CCAAGACGAG TACGCCGCCC GGAGCCACCA GCGTGCCGCC
GAGGCGACGG AGTCGGGACG CTTCGACGAC GAGATCGTCC CCGTGGAGAC CGACGACGGC
CTGATCGACG AAGACGAAGG GATCCGCCCG GACACGACCG CCGAGAAGCT CTCCGGCCTC
TCGCCGGCGT TCACGGGGGA CGGCACGGTG ACCGCGGGGA ACTCCTCGCA GATCTCGGAC
GGCGCGTCGC TGACGCTCGT CACGAGCAAG GCGTTCGCAG AAGACCACGG GCTCGACGTG
CTCGCGGAGG TCGGCACGAA CAACGTCGCC GGCGTCGACC CCACCGTGAT GGGGATCGGC
CCGGTGCCCG CGACGCGCGG CCTGCTTGAC CGCGCCGGTC GGACCATCGA CGACTACGAC
CTCGTCGAGC TCAACGAGGC GTTCGCCTCC CAGTGTGAGT ACTCCCGCCG CGAACTCGGA
ATCGACGAGG AGCAGTACAA CGTCAACGGC GGCGCCATCG CCATCGGCCA CCCGCTCGGC
GCCTCCGGCG CGCGACTCCC CGTCACCCTG ATCCACGAGA TGCAGAAGCG CGACGCCGAC
CGCGGCCTTG CGACCCTCTG TGTCGGCTTC GGACAGGGCG CAGCGATCGA GTTCAGTCGA
TAA
 
Protein sequence
MTDETTPVIA AAYRTPQGRD GGVYADVRSE DLSTRLIDHT LAETGLTGDH VDDLMWGVAQ 
QRTEQDNNVA RVIALLSDLG ESVPATSINR WCASSMQAII SAADAIAAGN RDCIIAGGVE
NMSRVPMDGD SYEHLHPELS EQYNVFQLQM GMTAEKVAEE YEVSREAQDE YAARSHQRAA
EATESGRFDD EIVPVETDDG LIDEDEGIRP DTTAEKLSGL SPAFTGDGTV TAGNSSQISD
GASLTLVTSK AFAEDHGLDV LAEVGTNNVA GVDPTVMGIG PVPATRGLLD RAGRTIDDYD
LVELNEAFAS QCEYSRRELG IDEEQYNVNG GAIAIGHPLG ASGARLPVTL IHEMQKRDAD
RGLATLCVGF GQGAAIEFSR