Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2820 |
Symbol | |
ID | 8138163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3280718 |
End bp | 3281914 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644870422 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_003022611 |
Protein GI | 253701422 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases [TIGR02430] beta-ketoadipyl CoA thiolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGAAG CGGTTATCGT CGATGCGGTG CGGACGCCGG TGGGGAAATT CGGAGGCGCG CTGAAGGACG TCCGCCCGGA CGACCTGGCG GCGCTCTGCA TCATGGAACT GGTGCAGCGG AACAAGCTCG ACCCCTCCCT CGTCGAGGAT GTGGTGCTCG GCTGCACCAA CCAGGCTGGG GAGGACAACC GCAACGTGGC GCGCATGGCG GCGCTCCTGG CGGGGCTCCC CCACTCGGTG GCGGGGCACA CCATCAACCG CCTCTGCGGG TCGGGGCTTA ACGCCATCAA CAGCGCGGCC CAGGCCATCA AGGTGGGGGA GGGGAAGATC TTCATCGCCG GCGGCACCGA GTCCATGACG CGCGCCCCCT TCGTCTTCGC CAAGGCCGAT TCCCCCTTCT CGCGCGACAT CAAGGTGTTC GACTCCACCA TCGGCTGGCG CTTCACCAAC CCCCGGATGA CGGAGCCCTA CGCGAAGGAA GGGATGGGGG ACACCGCCGA GAACGTGGCG CGCAGCTACG GCATCACCCG CGAGCAGCAG GACGCGTTCG CGCTGGCGAC CCAGAGGAAA TGGGGCGAGG CTCAGGCCGC GGGGAAGTTC GAGGACGAGC TGGTTCCGGT CGTCATCCCG CAGAAGAAGG GGGACCCGAA GGTCGTGGAC CGGGACGAGT TCCCGCGCCC GGACGTGACG CTGGAGCAGC TCGCCAAACT CTCCCCCGCC TTCAAGAAGG ACGGCAGCGT CACAGCGGGT AATTCCAGCG GCATCAACGA CGGCGCCGCC GCCGTGCTCC TCATGGAAGG GGAACTCGCC AAAGAGCTCG GTTACCGGCC GCTGGCGCGC GTTCTCTCCA GCGCCGTCGC CGGCTGCGAC CCCTCGTTCA TGGGGCTCGG CCCGGTTCCG GCCATCAGGA AAGCCCTGGA ACGTGCGGGG CTGACCATCG GCGACATCGA CCTTTTCGAG CTGAACGAGG CCTTCGCGGC GCAGGCCATA CCCTGCATGA ACGAGCTCGG GATCGACCCG GCCCGGGTGA ACGTGAACGG CGGCTCCATC GCCATCGGCC ATCCGCTGGG TTCCACCGGC GCCCGCATCA CCGCGACCTT GGTGCACGAG ATGCGGCGGC GCAACGCCCG CTACGGGGTG ATCTCGCTTT GCATCGGCCT CGGGCAGGGG ATCGCCACCG TGGTGGAACG GGTCTGA
|
Protein sequence | MREAVIVDAV RTPVGKFGGA LKDVRPDDLA ALCIMELVQR NKLDPSLVED VVLGCTNQAG EDNRNVARMA ALLAGLPHSV AGHTINRLCG SGLNAINSAA QAIKVGEGKI FIAGGTESMT RAPFVFAKAD SPFSRDIKVF DSTIGWRFTN PRMTEPYAKE GMGDTAENVA RSYGITREQQ DAFALATQRK WGEAQAAGKF EDELVPVVIP QKKGDPKVVD RDEFPRPDVT LEQLAKLSPA FKKDGSVTAG NSSGINDGAA AVLLMEGELA KELGYRPLAR VLSSAVAGCD PSFMGLGPVP AIRKALERAG LTIGDIDLFE LNEAFAAQAI PCMNELGIDP ARVNVNGGSI AIGHPLGSTG ARITATLVHE MRRRNARYGV ISLCIGLGQG IATVVERV
|
| |