Gene Sare_3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3991 
Symbol 
ID5706666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4539925 
End bp4541124 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID641273416 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001538772 
Protein GI159039519 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000808201 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTCGG TGATCGTCAG CGGCGCTCGA ACCCCGATGG GGCGGCTGCT GGGTAACCTC 
AAGGACGTTC CCGCGACCCG GCTCGGTGCC GTGGCGATAA AGGCGGCGCT CGAGCGCGGC
CAGGTCGCCC CCGACCAGGT TCAGTACGTG ATCATGGGGC AGGTGCTTCA GGCGGGCGCT
GGCCAGATCC CAGCGCGCCA AGCGGCTGCC GAGGCGGGCA TCCCGTTGTC CGTCCCGGCG
CTCACCGTTA ACAAGGTCTG CCTCTCCGGC CTGGACGCGA TCGCTCTGGC CGACCAGTTG
ATCAGAGCCG GTGAGTTCGA TGTCGTCGTG GCCGGCGGCA TGGAGTCGAT GACCAATGCC
CCGCATCTGC TGCTGGGCCA GCGCGGTGGC TACAAGTACG GCGATGTGGT GATCAAGGAT
CACATGGCCC TCGACGGGCT TACCGATGCC TGGGACTGCT GCTCGATGGG AGAGTCGACC
GAACGGCACG GCAGCACCAA GGGCATCAGT CGCGCAGAGC AGGACGCGTT CGCCGCGGCG
AGTCACCAGC GCGCCGCCGC CGCTCAGAAG AACGGGTACT TCGCCGACGA GATCACCCCG
GTGGTCCTCC CACAGCGCAG GGGGGAACCG CTGGTGATCA GCGAGGACGA GGGTATCCGT
CCGGACACCA CCGTCGAGTC GCTGGCAAAG TTGCGTCCGG CTTTCACTCG GGACGGCAGC
ATCACCGCCG GCAGCTCGTC GCCGATTTCC GACGGGGCCG CCGCCGTCGT CGTGATGAGC
AGGGCCAAGG CCAAGGAGCT GGGGCTGAGC TGGCTGGCGG AAATCGGCGC ACACGGCAAC
GTCGCCGGCC CGGACAACTC GCTGCACTCG CAGCCGTCCA ACGCGATCGG GCACGCGCTC
CGGAAGGCTG GCCTGACCAT CGACGATCTT GACCTTATTG AGATCAACGA GGCGTTCGCG
CAGGTGGGCA TCCAGTCGGC CCGTGATCTT GGCGTGAGTC AGGACAAGGT CAACGTCAAT
GGCGGCGCGA TCGCGCTTGG TCACCCGATC GGCATGTCGG GTGCCCGGCT GGTCCTGACC
CTGGCGCTGG AGCTGAAGCG GCGCGGTGGC GGCACCGGGG CGGCGGCGCT CTGCGGCGGT
GGTGGGCAGG GCGATGCGTT GATCATTCAC GTCCCAGCGG GCGCCGAGAG CCAGGGGTGA
 
Protein sequence
MASVIVSGAR TPMGRLLGNL KDVPATRLGA VAIKAALERG QVAPDQVQYV IMGQVLQAGA 
GQIPARQAAA EAGIPLSVPA LTVNKVCLSG LDAIALADQL IRAGEFDVVV AGGMESMTNA
PHLLLGQRGG YKYGDVVIKD HMALDGLTDA WDCCSMGEST ERHGSTKGIS RAEQDAFAAA
SHQRAAAAQK NGYFADEITP VVLPQRRGEP LVISEDEGIR PDTTVESLAK LRPAFTRDGS
ITAGSSSPIS DGAAAVVVMS RAKAKELGLS WLAEIGAHGN VAGPDNSLHS QPSNAIGHAL
RKAGLTIDDL DLIEINEAFA QVGIQSARDL GVSQDKVNVN GGAIALGHPI GMSGARLVLT
LALELKRRGG GTGAAALCGG GGQGDALIIH VPAGAESQG