Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0270 |
Symbol | |
ID | 5103890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 227300 |
End bp | 228475 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640506176 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_001190371 |
Protein GI | 146303055 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00178908 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.140204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCA ACCTAAAGTT CAGGAGGGTC GCGGCGATAG GTGCGGGAAT GACACCCTTC AGAAGGAGGA TGTTGGAGAC ACCACAGGAA ATGGCCTGGG AGGCCTCTAG GAGAGCCCTT GATGAGGCTG GCCTGGAACT TAGGGACATA GACTGTGTGG TAATTGGGAG CGCACCAGAT GCCTTCGACG GAGTTCACAT GAAGGGGGAA TACCTGGCAC ACGGAGCGGG AGGGGTAATG AGACCCAGTA GCAGGGTATA CGTTGGAGGA GCAACGGGGG TCATGACGGC CATTGCAGGA TGGTACCACG TGGCCAGCGG GATGTGCAAG AAAGTGCTCG CTGTGGCAGA GGAAAAGATG AGTACAGGTA GGCCCCATCC CCAAGCCGTT TTTAGGTACA TCTGGGATCC AATCACGGAG AAACCCCTGA ACCCCAACCT AATCTGGATC TTCGCCATGG AGATGCACAG ATACATGTTC GTGAACAAGG TGAGCAAGGA AGAGATTGCC CTAGTCTCGG TGAAGAACAA GAGGAACGCT GCAAGCAATC CCTATGCACA ACTGGGGGGA GAAATCACCG TGGACGATGT GCTGAGAAGC GAGGTTCTAG TGTGGCCAGT GCAACTTCTA GACGTAAGCC CAGTTAGTGA TGGGGCAGCT GCCATGGTTT TCGTGGACGG TGACATTGCG AGGAGATACA CTGATACCCC CATATGGGTT GAGGGAGTTG GATGGACCCT CGATAACACC TCTTGGCCCA ACAGGGAACT TGCGTATCCC AGATACCTTG AGAACGCAGC TAGGATGGCC TACAGGATGG CAGGGATAGA GAGACCTCAG AAGGAGATAG ACGTCGTGGA GCCATATGAT CCCTTCGATT ATAAGGAACT TCATCACCTG GAGGGCCTCA TGCTAGCCAA GAGAGGCGAG GCACCCCTAC TTCTGAAGGA AGGGTTCTTC GATAAGGACG GGGATATACC CAGTAGCCCC TCAGGAGGAC TTCTAGGAGT TGGGAACCCC ATAGCTGCAG CAGGACTAAT GAAGGTCATT AGCATTTACT GGCAACTTAA GGGAACAGCT GGGAAGATGC AGGTGAAAAG GCCAGTTCAC ACTGGCGTAG CCCAGGCGTG GGGTGACCTA ATGCAGGCCT CCACGGTCAT GGTTATGAGA AATTGA
|
Protein sequence | MKTNLKFRRV AAIGAGMTPF RRRMLETPQE MAWEASRRAL DEAGLELRDI DCVVIGSAPD AFDGVHMKGE YLAHGAGGVM RPSSRVYVGG ATGVMTAIAG WYHVASGMCK KVLAVAEEKM STGRPHPQAV FRYIWDPITE KPLNPNLIWI FAMEMHRYMF VNKVSKEEIA LVSVKNKRNA ASNPYAQLGG EITVDDVLRS EVLVWPVQLL DVSPVSDGAA AMVFVDGDIA RRYTDTPIWV EGVGWTLDNT SWPNRELAYP RYLENAARMA YRMAGIERPQ KEIDVVEPYD PFDYKELHHL EGLMLAKRGE APLLLKEGFF DKDGDIPSSP SGGLLGVGNP IAAAGLMKVI SIYWQLKGTA GKMQVKRPVH TGVAQAWGDL MQASTVMVMR N
|
| |