Gene Sare_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2841 
Symbol 
ID5708015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3226423 
End bp3227601 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content71% 
IMG OID641272297 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001537667 
Protein GI159038414 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.623235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.130609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAA CGGCCGTGCT GGGAACCGGG CAAACACAGC ACCAGACACG GCGCACCGAC 
GTGTCGATGG CCGGGCTGTG CCGGGAGGCG ATCGACCGCG CGCTGGCCGA CGCCGGCGTC
GACTGGCCGC AGATCGACGC GGTGGTGCTG GGCAAGGCAC CGGACCTGTT CGAAGGTGTG
ATGATGCCGG AGCTGTTCCT CGCCGACGCG CTGGGAGCCG CCGGCCGCCC GCTGCTGCGG
GTGCACACGG CCGGCTCCGT GGGCGGCGCC ACCGCGATCG TGGCGACCAG CCTGGTCCGG
GCCGGCGTGC ACCACCGTGT GCTCGCGGTC GCGTTCGAGA AGCAGTCGGA ATCCAACGCC
ATGTGGGCGC TGTCCATCCA GCCGCCCTTC ACCGCACCGA TCGGGGCCGG GGCCGGTGGA
TACTTCGCGC CGCACGTCCG CGCCTACATT CGGCGCTCGC ACGCGCCCGA GCACATCGGC
GCGCTGGTCG CGGTGAAGGA CCGACGCAAC GGCGCCCTCA ACCCGTACGC CCACCTGCGC
CAGCCGGACA TCACGCTGGA GTCGGTACGG GCGTCGCGGA TGCTGTGGGA TCCGATCCGG
TACGACGAGA CCTGCCCCTC CTCCGACGGT GCCTGTGCCA TGGTGATCGG CGACCAGGCG
GCAGCCGAGG CGAGCGAGCG TCCGGTGGCC TGGATCCGGG CTACCGTGAT GCGCACCGAA
CCGACCTACT TCGCCGGGAA GGACCACGTC AACCCGAGGG CTGGTGCGGA GGCGGCCCAG
GCGCTGTGGC AGGCGGCTGG CATCACCGAC CCCCTCGATG AGGTCGACGT CGCCGAGTTG
TACGTGCCCT TCTCCTGGTT CGAGCCGATG TGGCTGGAGA ACCTCGGCTT CGCCGAGGCG
GGGCACGGCT GGAAGCTCAC CGAGTCGGGT GAGACCCGGA TCGGCGGGCG GCTGCCGGTC
AACCCGTCCG GCGGGGTGTT GTGTTCCAAC CCGATCGGTG CGTCCGGCAT GCTCCGTTTC
GCCGAGGCGG CCACGCAGGT GATGGGGCGG GCCGGCGAAC GTCAGGTAGC CGGGGCACGC
ACGGCGCTCG GCCACGCGTA CGGCGGCGGA TCGCAGTTCT TCTCGATGTG GGTCGTCAGC
GATACCGCAA CGGCACGCTC CGTGACTCCC CGCAACTGA
 
Protein sequence
MRRTAVLGTG QTQHQTRRTD VSMAGLCREA IDRALADAGV DWPQIDAVVL GKAPDLFEGV 
MMPELFLADA LGAAGRPLLR VHTAGSVGGA TAIVATSLVR AGVHHRVLAV AFEKQSESNA
MWALSIQPPF TAPIGAGAGG YFAPHVRAYI RRSHAPEHIG ALVAVKDRRN GALNPYAHLR
QPDITLESVR ASRMLWDPIR YDETCPSSDG ACAMVIGDQA AAEASERPVA WIRATVMRTE
PTYFAGKDHV NPRAGAEAAQ ALWQAAGITD PLDEVDVAEL YVPFSWFEPM WLENLGFAEA
GHGWKLTESG ETRIGGRLPV NPSGGVLCSN PIGASGMLRF AEAATQVMGR AGERQVAGAR
TALGHAYGGG SQFFSMWVVS DTATARSVTP RN