Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1008 |
Symbol | |
ID | 5105607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 928012 |
End bp | 929169 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640506907 |
Product | Acetyl-CoA acetyltransferase-like protein |
Protein accession | YP_001191100 |
Protein GI | 146303784 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.011821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTTAG GTTTCGGAAG TACGGTACAG AAAACCTACC CGGGCACCAC CTTTGAGCTT CTCTCATCCA CACTTGATAA GGCCCTCGAG ATGGCCCACC TAGACAGAGG TAAGATAGAC GGGCTAATTG CGACCTTCCT CCCAGGTACC TTTGACGGTA ACTTGGCCCT GCATTTCTAC ACGGGACAGC TGGCCCAATA CCTTGGAATA AGACCTAGGT TCCTAGATTA CGTGGACTTT GGGGGAGCCT CTGCCCTAGC CATGCTCTAT AGGGCAGAGA AGGCGATTTC TGCAGGTGAC GCCGATAACG TTGTCATAAT CGTGGGCGGG AAGGCGTCTC CCGTGAGGGA GAGGAAAGTG ACTGCAGATT CGGTGGACAG GGCATATCAA GGTATAAGGC TCACTCCCTT CGACGAGGTG TTCAGGGTTT ACGACGACCT GAACCCTGTC ACGGACTACG CACTTGTGGC GACGAGGCAC TCGCACCTCT TCGGGACCAC GGACGAGCAG AGGGCGTCCA TAGCCGTGAA ACAGAGGTTT AATGCCCAGG GAAATCCCAA GGCTATGTAC AAGGACCCCC TCAGATTAGA GGACGTGCTC TCCTCTAGGA TGGTTAGTAC TCCCCTTAGG TTGCTGGAGA TCGTGTATCC AGTTGACGGC TTCCACGTGT TCGTCGTGGG GAAGTCAGGA GGTAAGTCAG ACCTAAGACC CTTGTCCGTG AAGTACTTCG GGGAGGCCCA CTGGCCGGAG ATGCCTCCTG AGCTACCGGA TATAGTGTCA ACGCCCGCAG TCGAGAGTTC CAAGGGGGCC AGGCCACTCC TTGAGAAGAT GGACTGCTTT GAACTTTATG ACTCCTTCAC CATCACGGTC CTCCTTCAGA TAGAGGACAT AGGTCTCGCC GAGAAGGGAA AGGGAGGTAG GTTCGCCCAA GATGTCAACT TCACCTATCA AGGGGAGATC CCCATTAACA CGGGAGGCGG ATCGCTCAAC GTGGGTCAAC CAGCCTACAT GAGCGGTGGG GTGATCCTGG AGGAGGCGCT CATCCAGTTA AATGCCATGG GAGAGGGGAG ACAGGTCAAG GGGGTAGACA TGGTCCTGGT TAACGGGATA GGTGGATGGA ACAGGGCTCA CTCGACTACC CTAGTTCTAG GTGAGTGA
|
Protein sequence | MILGFGSTVQ KTYPGTTFEL LSSTLDKALE MAHLDRGKID GLIATFLPGT FDGNLALHFY TGQLAQYLGI RPRFLDYVDF GGASALAMLY RAEKAISAGD ADNVVIIVGG KASPVRERKV TADSVDRAYQ GIRLTPFDEV FRVYDDLNPV TDYALVATRH SHLFGTTDEQ RASIAVKQRF NAQGNPKAMY KDPLRLEDVL SSRMVSTPLR LLEIVYPVDG FHVFVVGKSG GKSDLRPLSV KYFGEAHWPE MPPELPDIVS TPAVESSKGA RPLLEKMDCF ELYDSFTITV LLQIEDIGLA EKGKGGRFAQ DVNFTYQGEI PINTGGGSLN VGQPAYMSGG VILEEALIQL NAMGEGRQVK GVDMVLVNGI GGWNRAHSTT LVLGE
|
| |