Gene GM21_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2820 
Symbol 
ID8138163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3280718 
End bp3281914 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID644870422 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_003022611 
Protein GI253701422 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAAG CGGTTATCGT CGATGCGGTG CGGACGCCGG TGGGGAAATT CGGAGGCGCG 
CTGAAGGACG TCCGCCCGGA CGACCTGGCG GCGCTCTGCA TCATGGAACT GGTGCAGCGG
AACAAGCTCG ACCCCTCCCT CGTCGAGGAT GTGGTGCTCG GCTGCACCAA CCAGGCTGGG
GAGGACAACC GCAACGTGGC GCGCATGGCG GCGCTCCTGG CGGGGCTCCC CCACTCGGTG
GCGGGGCACA CCATCAACCG CCTCTGCGGG TCGGGGCTTA ACGCCATCAA CAGCGCGGCC
CAGGCCATCA AGGTGGGGGA GGGGAAGATC TTCATCGCCG GCGGCACCGA GTCCATGACG
CGCGCCCCCT TCGTCTTCGC CAAGGCCGAT TCCCCCTTCT CGCGCGACAT CAAGGTGTTC
GACTCCACCA TCGGCTGGCG CTTCACCAAC CCCCGGATGA CGGAGCCCTA CGCGAAGGAA
GGGATGGGGG ACACCGCCGA GAACGTGGCG CGCAGCTACG GCATCACCCG CGAGCAGCAG
GACGCGTTCG CGCTGGCGAC CCAGAGGAAA TGGGGCGAGG CTCAGGCCGC GGGGAAGTTC
GAGGACGAGC TGGTTCCGGT CGTCATCCCG CAGAAGAAGG GGGACCCGAA GGTCGTGGAC
CGGGACGAGT TCCCGCGCCC GGACGTGACG CTGGAGCAGC TCGCCAAACT CTCCCCCGCC
TTCAAGAAGG ACGGCAGCGT CACAGCGGGT AATTCCAGCG GCATCAACGA CGGCGCCGCC
GCCGTGCTCC TCATGGAAGG GGAACTCGCC AAAGAGCTCG GTTACCGGCC GCTGGCGCGC
GTTCTCTCCA GCGCCGTCGC CGGCTGCGAC CCCTCGTTCA TGGGGCTCGG CCCGGTTCCG
GCCATCAGGA AAGCCCTGGA ACGTGCGGGG CTGACCATCG GCGACATCGA CCTTTTCGAG
CTGAACGAGG CCTTCGCGGC GCAGGCCATA CCCTGCATGA ACGAGCTCGG GATCGACCCG
GCCCGGGTGA ACGTGAACGG CGGCTCCATC GCCATCGGCC ATCCGCTGGG TTCCACCGGC
GCCCGCATCA CCGCGACCTT GGTGCACGAG ATGCGGCGGC GCAACGCCCG CTACGGGGTG
ATCTCGCTTT GCATCGGCCT CGGGCAGGGG ATCGCCACCG TGGTGGAACG GGTCTGA
 
Protein sequence
MREAVIVDAV RTPVGKFGGA LKDVRPDDLA ALCIMELVQR NKLDPSLVED VVLGCTNQAG 
EDNRNVARMA ALLAGLPHSV AGHTINRLCG SGLNAINSAA QAIKVGEGKI FIAGGTESMT
RAPFVFAKAD SPFSRDIKVF DSTIGWRFTN PRMTEPYAKE GMGDTAENVA RSYGITREQQ
DAFALATQRK WGEAQAAGKF EDELVPVVIP QKKGDPKVVD RDEFPRPDVT LEQLAKLSPA
FKKDGSVTAG NSSGINDGAA AVLLMEGELA KELGYRPLAR VLSSAVAGCD PSFMGLGPVP
AIRKALERAG LTIGDIDLFE LNEAFAAQAI PCMNELGIDP ARVNVNGGSI AIGHPLGSTG
ARITATLVHE MRRRNARYGV ISLCIGLGQG IATVVERV