Gene Hlac_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1006 
Symbol 
ID7401901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1002433 
End bp1003599 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID643708072 
ProductThiolase 
Protein accessionYP_002565673 
Protein GI222479436 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.63654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACG TACGCGTCGC CGGGGTCGGA CTTACGCACT TTGGGAGCCA TCCGGACCGG 
ACAGGCCGAG ATCTCTTCGC GACAGCCGCG CTACGAGCGT TCGAAGATTC CGGGGTTCCC
CGCGAGGACG TAGAGGAACT GAACTACGGG AACTTCATGG GCTCACTCGC CGAGCACCAG
GGCCATCAGG CGCCGCTGAT GGCCGAGGCG GCCGGCGTGA ACTGCCCCTC GACCCGCTAC
GAGGAGGCGT GTGCCTCGGC CGGCGTCGCG GTGCGCGAGG CCGTCCGGAC CGTCGCCGCG
GGCGACGCCG ACGTGGTGTT GGCCGGCGGG ATGGAGCGCA TGACAAACCT CCCAACCGAC
GAGGTGACCG AGGGGCTCGC TATCGCCGCC GACGACCTGT TCGAGGTGCG AGCGGGAGTG
ACGTTCCCCG GCGCGTACGC GCTGATGGCG ACCGCCTACT TCGACGCGTA CGGCGGGAGC
CGCCGGGATC TGGCCCACAT CGCCTCGAAG AACCACGCGA ACGCGGTCCC CAACGAGTAC
GCTCAGTACC GCCAGGAAGT GCCCGTCGAG AAGGCGTTGG ACGCCCCACC TGTCGCGGAG
CCGCTCCACC TCTACGACGC CTGCCCGATC ACCGACGGCG CGAGCGCGCT CGTGATCGTC
TCCGAGGAGT ACGCGGCTGA GCACGACGTG GACGCCCCGG TCGCGGTGAC GGGGACCGGA
CAGGGGACCG ACCGGATGGC GCTCGCGGAC CGCGAGGAAC TCGCGCGCAC CCCCGCCGCC
GACGACGCGG CCGACGCGGC CTACGCCGAC GCCGGGATCG GCCCCGACGA CGTCGACGTG
GCGGAGGTCC ACGACTGCTT CACCATCGCG GAGGTGCTCG CGCTGGAGTC GCTCGGCTTC
TTCGAACCCG GCGAGGGGAT TTCGGCCGCC CGCAATGGGG TCACGACCGC CGACGGCGAC
CTCCCCGTGA ACCTCTCGGG CGGACTGAAG GCGAAGGGCC ACCCGGTCGG CGCGACCGGC
GGTTCGCAGA TCGCGGAGCT GACTCGGCTC TTGCGAGGTG ACCACCCGAA CAGCGACCAC
GTCGCCGACG CCGAGGTGGG CGTCGCGCAC AACGCCGGCG GGACAGTGGC CAGCGCGGTC
GTCCACGTGC TGGAGGTGGC GGAATGA
 
Protein sequence
MIDVRVAGVG LTHFGSHPDR TGRDLFATAA LRAFEDSGVP REDVEELNYG NFMGSLAEHQ 
GHQAPLMAEA AGVNCPSTRY EEACASAGVA VREAVRTVAA GDADVVLAGG MERMTNLPTD
EVTEGLAIAA DDLFEVRAGV TFPGAYALMA TAYFDAYGGS RRDLAHIASK NHANAVPNEY
AQYRQEVPVE KALDAPPVAE PLHLYDACPI TDGASALVIV SEEYAAEHDV DAPVAVTGTG
QGTDRMALAD REELARTPAA DDAADAAYAD AGIGPDDVDV AEVHDCFTIA EVLALESLGF
FEPGEGISAA RNGVTTADGD LPVNLSGGLK AKGHPVGATG GSQIAELTRL LRGDHPNSDH
VADAEVGVAH NAGGTVASAV VHVLEVAE