Gene Sare_2812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2812 
Symbol 
ID5707004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3194542 
End bp3195705 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content71% 
IMG OID641272268 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001537638 
Protein GI159038385 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000975641 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCACTC CGGTGATCGT TGACGCGGTT CGTACGCCGA TCGGGAAACG CGGTGGTTGG 
CTGGCGGGAC TACACGCCGC CGAACTCCTG GGCGCGGCCC AACGTGCCCT CGTCGAACAC
GTGGACCTCG ACCCGGGCGC GGTCGAGCAG GTCGTCGGCG GGTGCGTCAC CCAGAGCGGT
GAACAGTCCA ACAACGTCAC CCGCACCGCC TGGTTGCACG CCGGCCTGCC GTACCAGACA
GGCTGCCTCA CCATCGACGC ACAGTGCGGA TCCTCCCAGC ACGCCGCCCA TCTTGTCGCC
GGGCTCATCG CCACCGACGC CGTCGAGGTG GGCATCGCCT GCGGTGTCGA GGCGATGAGC
CGGGTGCCGC TGCGGGCGAA CCTCGGCGTC GACGTCGGCA CGCCCCGTCC GGCGTCGTGG
CACATCGACC TGCCCAACCA GTACGTCGCC GCCGAGCGGA TCGCGGTACG GCGAGGCTTG
TCCCGCACGA CGGTCGACGA GTTCGGCATG CGCTCGCAGG TCAGGGCGGC CCGGGCCTGG
ACGCAGGGGT ACTACGACCG CGAGGTCGTG GCGGTGCACG CGCCGGCACT CGACGCCGAG
GGACAGCCGA CCGGAGAAAC CCGTGTCATC GACCGGGACC AAGGGCTGCG CGATACCACG
ATGGAGGCGC TGAGCCGGCT GCGGCCGGTG GTCGAGGACG GGCTGCACAC TGCCGGGACC
TCGTCGCAGA TCTCCGACGG CGCCGCGGCG GTCCTGCTCA TGTCCGCCGA CCGGGCCCAC
GCGCTCGGTC TGCGCCCAAG GGCCAGAATC GTCGCCCAGT GCCTGGTCGG CGCCGAACCC
CACTACCACC TGGACGGCCC CGTGCAGGCA ACCGAGCGGG TGCTGGCCCA CGCCGGCATG
AAGATCCAGG ATATTGATCG GTTCGAGGTC AACGAGGCGT TCGCCGCCGT CGTGCTGTCC
TGGCTGTCGG CGCACCAGGC CGACCCGGAG AAGGTGAACG TCAATGGCGG CGCGATCGCG
CTCGGGCATC CGGTGGGCAG TACCGGGGCC CGGCTGCTCA CCACCGCCCT GCACGAGCTG
GAGCGGACGG CTACCCGCAC GGCGTTGATC ACCATGTGCG CTGGCGGCGC CATGTCCACC
GCGACCATCA TCGAACGACT CTGA
 
Protein sequence
MGTPVIVDAV RTPIGKRGGW LAGLHAAELL GAAQRALVEH VDLDPGAVEQ VVGGCVTQSG 
EQSNNVTRTA WLHAGLPYQT GCLTIDAQCG SSQHAAHLVA GLIATDAVEV GIACGVEAMS
RVPLRANLGV DVGTPRPASW HIDLPNQYVA AERIAVRRGL SRTTVDEFGM RSQVRAARAW
TQGYYDREVV AVHAPALDAE GQPTGETRVI DRDQGLRDTT MEALSRLRPV VEDGLHTAGT
SSQISDGAAA VLLMSADRAH ALGLRPRARI VAQCLVGAEP HYHLDGPVQA TERVLAHAGM
KIQDIDRFEV NEAFAAVVLS WLSAHQADPE KVNVNGGAIA LGHPVGSTGA RLLTTALHEL
ERTATRTALI TMCAGGAMST ATIIERL