Gene SeD_A3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3352 
Symbol 
ID6874207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3227405 
End bp3228613 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content58% 
IMG OID642786356 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_002216995 
Protein GI198241745 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.378933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGT CATTTTGCGC TGAGGTGGTA ATGAAAGAGG TTGTGATAGT GGGGGCTTTA 
CGGACCCCCA TCGGCTGTTT TCAGGGAACG CTGGCCCGTC ATTCCGCCGT CGAGCTGGGG
AGTATGGTCG TAAAAGCGTT AATTGAGCGC ACCGGCGTGG ACGCTAATGC TATCGATGAG
GTCATCCTCG GTCAGGTACT GACGGCTGGC GCCGGGCAGA ATCCGGCGCG CCAGTCAGCC
ATCAAAGGAG GGCTGCCGAC TACCGTTTCC GCTATTACCA TTAATGACGT TTGTGGTTCC
GGTTTAAAAG CGTTGCACCT GGCGACGCAG GCTATTCAGT GCGGCGAAGC GGACATTGTT
ATTGCGGGCG GGCAGGAGAA TATGAGCCGT GCGCCGCATG TCCTCAATGA CAGCCGTACC
GGCGCGTTGC CTGATGCCGA CAATCTGGTG GATAGTCTGG TACATGATGG CTTATGGGAT
GCCTTCAACG ATTATCACAT TGGCGTGACG GCAGAAAACC TGGCGCGGGA ATATGGCATC
AGCCGCGAGT TACAGGATGC TTATGCGCTC AGTTCGCAGC AAAAAGCCAG GGCCGCGATT
GATACCGGAC GTTTTAAAGA TGAGATTGTC CCCATCGTCA CGCAACGTAA TGGGCAGACC
GCAATTGTCG ATACGGATGA ACAACCGCGG GCCGACGCCA GTGCGGAGGG GTTAGCCCTG
CTGCATCCGG CGTTTGATAG TTTAGGTTCG GTGACGGCGG GTAATGCCTC CTCAATCAAC
GATGGCGCCG CCGCGGTGAT GATGATGAGC GAGGCGAAAG CCCAGGCGCT GGGGCTGCCG
GTGCTGGCGC GCATCCGCGC GTTTGCCAGC GTCGGCGTCG ATCCTGCGCT AATGGGGATT
GCGCCGGTAT ATGCTACCCG GCGCTGTCTG GAGCGTGCCG GCTGGCAACT GACGGAGGTC
GACCTTATCG AAGCCAACGA AGCGTTTGCC GCCCAGGCGT TGTCGGTAGG AAAAATGCTG
GAATGGGATG AGCGACGGGT AAACGTTAAT GGCGGCGCGA TTGCGCTGGG GCATCCCATT
GGCGCTTCCG GTTGTCGAAT TCTGGTTTCA CTGGTGCACG AAATGGTTAA ACGCGACGCG
CGAAAAGGTC TGGCGACGCT GTGTATCGGT GGCGGCCAGG GCGTTGCTTT AACCATTGAA
CGCGATTGA
 
Protein sequence
MAVSFCAEVV MKEVVIVGAL RTPIGCFQGT LARHSAVELG SMVVKALIER TGVDANAIDE 
VILGQVLTAG AGQNPARQSA IKGGLPTTVS AITINDVCGS GLKALHLATQ AIQCGEADIV
IAGGQENMSR APHVLNDSRT GALPDADNLV DSLVHDGLWD AFNDYHIGVT AENLAREYGI
SRELQDAYAL SSQQKARAAI DTGRFKDEIV PIVTQRNGQT AIVDTDEQPR ADASAEGLAL
LHPAFDSLGS VTAGNASSIN DGAAAVMMMS EAKAQALGLP VLARIRAFAS VGVDPALMGI
APVYATRRCL ERAGWQLTEV DLIEANEAFA AQALSVGKML EWDERRVNVN GGAIALGHPI
GASGCRILVS LVHEMVKRDA RKGLATLCIG GGQGVALTIE RD