Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1006 |
Symbol | |
ID | 7401901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1002433 |
End bp | 1003599 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708072 |
Product | Thiolase |
Protein accession | YP_002565673 |
Protein GI | 222479436 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.63654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACG TACGCGTCGC CGGGGTCGGA CTTACGCACT TTGGGAGCCA TCCGGACCGG ACAGGCCGAG ATCTCTTCGC GACAGCCGCG CTACGAGCGT TCGAAGATTC CGGGGTTCCC CGCGAGGACG TAGAGGAACT GAACTACGGG AACTTCATGG GCTCACTCGC CGAGCACCAG GGCCATCAGG CGCCGCTGAT GGCCGAGGCG GCCGGCGTGA ACTGCCCCTC GACCCGCTAC GAGGAGGCGT GTGCCTCGGC CGGCGTCGCG GTGCGCGAGG CCGTCCGGAC CGTCGCCGCG GGCGACGCCG ACGTGGTGTT GGCCGGCGGG ATGGAGCGCA TGACAAACCT CCCAACCGAC GAGGTGACCG AGGGGCTCGC TATCGCCGCC GACGACCTGT TCGAGGTGCG AGCGGGAGTG ACGTTCCCCG GCGCGTACGC GCTGATGGCG ACCGCCTACT TCGACGCGTA CGGCGGGAGC CGCCGGGATC TGGCCCACAT CGCCTCGAAG AACCACGCGA ACGCGGTCCC CAACGAGTAC GCTCAGTACC GCCAGGAAGT GCCCGTCGAG AAGGCGTTGG ACGCCCCACC TGTCGCGGAG CCGCTCCACC TCTACGACGC CTGCCCGATC ACCGACGGCG CGAGCGCGCT CGTGATCGTC TCCGAGGAGT ACGCGGCTGA GCACGACGTG GACGCCCCGG TCGCGGTGAC GGGGACCGGA CAGGGGACCG ACCGGATGGC GCTCGCGGAC CGCGAGGAAC TCGCGCGCAC CCCCGCCGCC GACGACGCGG CCGACGCGGC CTACGCCGAC GCCGGGATCG GCCCCGACGA CGTCGACGTG GCGGAGGTCC ACGACTGCTT CACCATCGCG GAGGTGCTCG CGCTGGAGTC GCTCGGCTTC TTCGAACCCG GCGAGGGGAT TTCGGCCGCC CGCAATGGGG TCACGACCGC CGACGGCGAC CTCCCCGTGA ACCTCTCGGG CGGACTGAAG GCGAAGGGCC ACCCGGTCGG CGCGACCGGC GGTTCGCAGA TCGCGGAGCT GACTCGGCTC TTGCGAGGTG ACCACCCGAA CAGCGACCAC GTCGCCGACG CCGAGGTGGG CGTCGCGCAC AACGCCGGCG GGACAGTGGC CAGCGCGGTC GTCCACGTGC TGGAGGTGGC GGAATGA
|
Protein sequence | MIDVRVAGVG LTHFGSHPDR TGRDLFATAA LRAFEDSGVP REDVEELNYG NFMGSLAEHQ GHQAPLMAEA AGVNCPSTRY EEACASAGVA VREAVRTVAA GDADVVLAGG MERMTNLPTD EVTEGLAIAA DDLFEVRAGV TFPGAYALMA TAYFDAYGGS RRDLAHIASK NHANAVPNEY AQYRQEVPVE KALDAPPVAE PLHLYDACPI TDGASALVIV SEEYAAEHDV DAPVAVTGTG QGTDRMALAD REELARTPAA DDAADAAYAD AGIGPDDVDV AEVHDCFTIA EVLALESLGF FEPGEGISAA RNGVTTADGD LPVNLSGGLK AKGHPVGATG GSQIAELTRL LRGDHPNSDH VADAEVGVAH NAGGTVASAV VHVLEVAE
|
| |