Gene Msed_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0656 
Symbol 
ID5103816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp600914 
End bp602101 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content48% 
IMG OID640506560 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001190755 
Protein GI146303439 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0450431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACG TTTACATAGT TTCAGCGGTG AGAACTCCAA TAGGTCGCTT TGGAGGATCA 
CTGAAATCAG TGAAGCCACA GATGCTAGGG GCAATCGCCA TAAAGGAAGC GCTGAGGAGG
GCTAACACCG ATCCCTCTCG CGTGGAACTT ACGATAATGG GCAATGTGTT GAGGTCGGGT
CATGGTCAGG ATCTAGCTAG GCAGGCTGCC CTCTTGGCTG GTATACCATG GGAAGTGGAT
GGATATTGTG TGGACATGGT CTGTTCCTCC GGGATGATGG GGGTTACTAA CGCGGCACAA
ATGATCAAGA GCGGTGACGC TGACGTTGTA GTCGCCGGTG GGATGGAATC CATGAGTCAA
TCCATGCTTG CGGTCAACTC AGAAGTTAGA TGGGGTGTTA AGTTTCTATC TGGAAAGAGT
TTGAATTTCA TTGATACAAT GCTGGTTGAC GGGCTTACAG ACCCCTTCAA CCTTAAGCTA
ATGGGGCAAG AGGCTGATAT GGTAGCAAGG GAGAGGGACA TTTCAAGAAG GGAATTGGAC
GAGGTGGCGT TTGAGAGTCA CAGAAGAGCC CACCAGGCGT GGGAGAAGGG TCTGTTCAAG
TCGGAAGTTA TTCCAGTTAA TCTCGATGAG GGTAAATTAG AGAGAGATGA GGGTATCAGG
CCAGACACCA CCATGGAGAA ACTTAGCTCC CTTAAGCCTG CGTTTACAGA AAACGGGTAT
CACACCGCAG GTAACTCGTC TCAAATTTCA GATGGTGCAG TGGCCATGGT CCTCATGAGC
GAGAAGGCAG TGAAGGAATT TGGAGTGGAT CCAGTGGCGA AAATATTGGG TTACTCATGG
GTTGGTATAG AAAGTTGGCG CTTCACCGAG GCTCCACTTT ACTCGGTGAG GAAGTTGCTG
ACAAGGCTGA ACATGAACAT TACTCAATTT GATTATTTTG AAAATAACGA GGCCTTTGCG
GTAAACAATG TTCTCTTTCA CAGATATTTG GGGGTGCCAT ACGATCAATT GAACGTGTTC
GGTGGAGCAA TAGCCTTAGG TCACCCAATA GGAGCTAGTG GAGCAAGAAT TATGGTCACT
CTTCTCAATG TGCTCAGCAA GATGAACGCT ACTAGGGGAA TTGCGAGTAT CTGTCACGGT
GTAGGTGGGT CTACAGCAAT AGCCCTTGAG CTACTTAGAC CCCTTTAA
 
Protein sequence
MPDVYIVSAV RTPIGRFGGS LKSVKPQMLG AIAIKEALRR ANTDPSRVEL TIMGNVLRSG 
HGQDLARQAA LLAGIPWEVD GYCVDMVCSS GMMGVTNAAQ MIKSGDADVV VAGGMESMSQ
SMLAVNSEVR WGVKFLSGKS LNFIDTMLVD GLTDPFNLKL MGQEADMVAR ERDISRRELD
EVAFESHRRA HQAWEKGLFK SEVIPVNLDE GKLERDEGIR PDTTMEKLSS LKPAFTENGY
HTAGNSSQIS DGAVAMVLMS EKAVKEFGVD PVAKILGYSW VGIESWRFTE APLYSVRKLL
TRLNMNITQF DYFENNEAFA VNNVLFHRYL GVPYDQLNVF GGAIALGHPI GASGARIMVT
LLNVLSKMNA TRGIASICHG VGGSTAIALE LLRPL