Gene GM21_2794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2794 
Symbol 
ID8138137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3245695 
End bp3246870 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID644870397 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_003022586 
Protein GI253701397 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.456608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATG TCTATGTCAT CGAATCACTG CGTACCCCGT TGGGCTCCTT CGGCGGCTCG 
CTCAGCGACG TAGAGGCGCC GCGTCTCGCC GCCACCGTCA TCGCCGGCAT CATGGATCGC
ACCGGGCTTC CCGCTGAAGC CATCGACGAG GTGATCGTCG GCCAGGTTCT CTCGGGCGGC
TCGGGCCAGG CTCCCGCGCG TCAGGCCATG CGTTATGCGG GGCTCCCCGA TACGGTCCCC
GCACTCACCA TCAACAAGGT CTGCGGCTCC GGCCTTAAAG CCATCATGCT CGGCGCCGGT
TCCATCAAGC TCGGCGATGC CCAGGTGGTG CTCGCCGGCG GCATGGAGAA CATGTCCCTT
GCCCCCTACG CGCTCAGCAA GGGTCGCTAC GGCTACCGCA TGGGGAACGG CGAGATGCTG
GACCTGCTGG TCCACGACGG CCTGACCGAC CCCTACAGCG GCAAGCACAT GGGGGTGATA
GCGGAGCAAA GCGCCGGAGA GAACAAGCTG GACCGCGCAC TCCAGGACAA CTTCGCGCTC
GCCTCCTATC AGAAGGCTCA GGCGGCCCTG AAAGACGGGA TCTTCAACGA CGAGATCGTC
CCGGTGGTGA AGAAGACGCG CCAGGGCGAG GTGGTCGTCC GCGAGGACGA GGAGCCGCAC
AAGGTCGACT TCAAGAAACT CCCGGAACTC CGCGCGGCGT TCCAAAAGGA CGGCACCATC
ACCGCGGGTA ACGCCTCCAC CATCAACGAC GGCGCCGCCA TAGCGCTTCT GGCGAGCGGC
GAGGCCGTGG CCAAATACGG GCTGAAGCCC AAAGCCCGCC TGGTCGCCTA CGCCACCAAC
AGCGTCCATC CCGACCAGTT CGCCGAAGCC CCGGTAGGCG CCATCGCCAA GGTCTGCGCC
AAGGCGGGGC TCAACACCGA CGACATCGAC CTCTTCGAGA TCAACGAGGC CTTTGCCGCG
GTCCCGCTTA TCGCCTGCCA GAGGCTCCGT CTCGACCCTG AGCGCGTCAA CGTAAACGGC
GGCGCGGTGG CGCTCGGCCA CCCGCTGGGT GCAAGCGGCG CGCGCATCAC CGCCACCTTG
GTGCGGGAAC TTCACAAACG CAAATTACGC TACGGTCTCG CCACGCTCTG CATCGGCGGC
GGCGAGGCGG TCGCGGTAAT CCTGGAACGG GTTTAG
 
Protein sequence
MSDVYVIESL RTPLGSFGGS LSDVEAPRLA ATVIAGIMDR TGLPAEAIDE VIVGQVLSGG 
SGQAPARQAM RYAGLPDTVP ALTINKVCGS GLKAIMLGAG SIKLGDAQVV LAGGMENMSL
APYALSKGRY GYRMGNGEML DLLVHDGLTD PYSGKHMGVI AEQSAGENKL DRALQDNFAL
ASYQKAQAAL KDGIFNDEIV PVVKKTRQGE VVVREDEEPH KVDFKKLPEL RAAFQKDGTI
TAGNASTIND GAAIALLASG EAVAKYGLKP KARLVAYATN SVHPDQFAEA PVGAIAKVCA
KAGLNTDDID LFEINEAFAA VPLIACQRLR LDPERVNVNG GAVALGHPLG ASGARITATL
VRELHKRKLR YGLATLCIGG GEAVAVILER V