Gene Apre_1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1351 
Symbol 
ID8398158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1452355 
End bp1453536 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content40% 
IMG OID644995713 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_003153095 
Protein GI257066839 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.428024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA AAGTAGTAAT AGCAAGTGCA GCAAGAACAC CCGTAGGAGC TTACGGCGGA 
GCATTCAAAA CAGTTTCAGC AAGAGAATTA GGTGCTGTAG CAGCTAAAGA AGCAATCAAA
AGAGCAGGTA TCAAACCAGA AGATGTTGAT GAATCAATCC TAGGTTGTGT ACTTCAAGCA
GGTAACGGTC AAAACATCGC TCGTCAAATC GCCCTTGATG CAGGTATTCC TAAAGAAAAA
CCAGCTATGA CATTAAATAT AGTTTGTGGA TCAGGACTTA GAAGTGTATC TCTTGCAGCA
CAAATGATTA TGGCAGGAGA TGACGATATA GTTCTTGCAG GTGGTACAGA ATCAATGTCT
CAAGCTCCAT ACCTCCTAAC TGATGAAAGA TGGGGAGCAA GAATGGGAGA TAAGAAAGTT
GTCGATGAAA TGATCAAAGA CGGACTTTGG GATGCATTCA ATGACTACCA CATGGGAGTT
ACTGCAGAAA ATATAGCTGA AAAATTCGGC CTAACAAGAG AAGAACAAGA CGCACTTGCT
GCAGACAGCC AACAAAAAGC TGCTAAAGCT AGAGCTGAAG GAAGATTCAA AGACGAAATA
GTTCCAGTAG AAGTTAAAGG AAGAAAAGGA AAAGTAACTG TAGTTGATGA AGATGAATAC
ATCAAAGAAG GCGTTACAAC AGAAAGTATC TCTAAACTAA GACCAGCTTT CATTAAAGAC
GGTACAGTTA CAGCAGCTAA CGCATCAGGA ATCAACGATG GTGCAGCATG TCTTGTAGTA
ATGAGCGAAG AAAAAGCAAA AGAGTTAGGT GTTAAACCAC TAGCTACAAT CGTAAGCTAC
GCTACAGAAG GTGTTGATCC AAAAATCATG GGTACTGGTC CAATCCCAAC AGTTAGAAAA
GCTCTAGAAA AAGCTGATCT TAAACTTGAA GATATCGACC TAATCGAAGC TAATGAGGCT
TTCGCTGCTC AAGCTCTATC AGTAATCAAA GAACTTGGAT TAAATACAGA TATAGTTAAC
GTTAACGGTG GTGCAATCGC AATTGGTCAC CCTGTTGGAG CAAGTGGAGC AAGAATCCTT
ACAACACTTC TTTACGAAAT GCAAAAGAGA GACTCTAAAA AAGGTATCGC AACCCTATGT
ATAGGTGGCG GTATGGGAAC AGCAGTAGTA GTAGAAAGAT AA
 
Protein sequence
MTKKVVIASA ARTPVGAYGG AFKTVSAREL GAVAAKEAIK RAGIKPEDVD ESILGCVLQA 
GNGQNIARQI ALDAGIPKEK PAMTLNIVCG SGLRSVSLAA QMIMAGDDDI VLAGGTESMS
QAPYLLTDER WGARMGDKKV VDEMIKDGLW DAFNDYHMGV TAENIAEKFG LTREEQDALA
ADSQQKAAKA RAEGRFKDEI VPVEVKGRKG KVTVVDEDEY IKEGVTTESI SKLRPAFIKD
GTVTAANASG INDGAACLVV MSEEKAKELG VKPLATIVSY ATEGVDPKIM GTGPIPTVRK
ALEKADLKLE DIDLIEANEA FAAQALSVIK ELGLNTDIVN VNGGAIAIGH PVGASGARIL
TTLLYEMQKR DSKKGIATLC IGGGMGTAVV VER