Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0644 |
Symbol | |
ID | 8135959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 775709 |
End bp | 777313 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644868261 |
Product | thiolase, putative |
Protein accession | YP_003020476 |
Protein GI | 253699287 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 4.4394300000000005e-24 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCAGC AATTCAAGCC TCGGGAGGTC TATGTCGCCT CCTCTTTCAT GGCTCCGGTC GGGCGCTATA ACGGGCGCGA ACGCGAAGCC CTGAGCTTTT TGGAGATGGC CGAAAAGGCG GGAGAGGTTT TCGCCGGCAG CCGGCTCAAG CGCTCCGACA TAAACGCCGT CGTCGTCGGC TGCCAGAACC CGGTCGCCTT CTCGGGGGTC GACAACACGG CGGCCAAGAT CGCCGGCGTC CTCGGGATCT CCGGCGCCAA ATCGGTGCTG ATCGACACCG CCTCCTCCTC GGGCGCGTCG GCTCTCGAAT ACGCCTACCT GCAGATCGCC TCCGGCCGCT GCGACCACGT CCTCGCCATC GGGATCCAGA AGATGAGCGA CGTCCCCACC GGGCAGGCCA CCCGCATCGT CGCCGGGGTG ATCGACAAGG ACGAGGCGGA GTTCGGGCTC TCCATGCCGG CCTGCGGGGC ACTCGTGGCG CGCTCCCTGA TCGAGCGGCT GAAGCTCTCC ACCGACGAGT GGACCGCCTT CTCCGCCCTT TTGACCCAGC GGGCGCACCG CTTTGCCGCG CGCAACCCCG AGGCGCACCT GGGCTTCGAG ATCCCGCTTC AGGATTACTA CCGCCAGATC GTCACCGGCA AGAACTACCG CTACTGGTGG CCTTTGCGCT ACCACGACTT CTGCCCCATG TCGGACGGGG TCGCCGCCGT GCTCCTCTCG GCGACGCCGC ACGAGGTGAT CGTCTCCGGG GTGGGGAGCG CCACCGACAT CCCCACCATC GCCGACCGCC CCTACTTCCA CAGCTTCCCC GCCACCGTGC GCGCCGCGGC CGAGGCCTAC GCCATGGCCG GGATCAAGAA GATCTCCGAC TTCGCCGGAA AGATCCACGT GAACATGCAC GATCCGTTCA ACGGCTTCGG GCCGATCAAC ATGGTGGACC TGGGCTTCGT GCACCGGCGC CGGATCGTGG AAGCGCTTTT GAACGACGAG CTTACCGGCG AGCATGGGGC CTTCCCGACC AACATAACCG GCGGACTCAA GGGGCGCGGC CATCCGCTTG GTGCCACCGG CATGATCCAG ATCGTCGAGA ACCACCGGCT CATCACCTCC GGCCGCTTCC AGATGGGGCT CGCCCACTCA ATCGGCGGAC CGATCAACAA CAACGTGGTG ACGCTCCTGG AGAGGAGCAG CCACTACCGG GGGCGCTCGC GCCCGGCACT CACCCCCTGG GGGCTCCCCC CCCTTGGGCG CATGAAGCCA AAGCAGATGA ACGTCGGCGA GCTCTTGAAA GGGTCGGGCG AGGTGCAGGG GCGTTTCGTC GCCGCCACCA CCCGCTTCGA TTTCAAGACC GGTGACCCGG AGGGGATCAT CATCATCGTT TCCTGCCTGG TGAACGCGAC CCGCCACTCC TTCCTCTTCG GGGTAGGCGG CGAGCACTAC CGGCAGGTGG TGCAACTCAG GTCGGGGGAC CAGGTGAGCC TGGAGCAAAG CGAGGAGGGG ATCCTGGTGA ACCGGATCCC GGTCAGGAAG TTCTACCAAA GGAGCATGAG CGGCGTGCTG GAGCTGGCCG GAAACGGCTG GAAGAAGCTC ACCGGTGGGA GCTAG
|
Protein sequence | MTQQFKPREV YVASSFMAPV GRYNGREREA LSFLEMAEKA GEVFAGSRLK RSDINAVVVG CQNPVAFSGV DNTAAKIAGV LGISGAKSVL IDTASSSGAS ALEYAYLQIA SGRCDHVLAI GIQKMSDVPT GQATRIVAGV IDKDEAEFGL SMPACGALVA RSLIERLKLS TDEWTAFSAL LTQRAHRFAA RNPEAHLGFE IPLQDYYRQI VTGKNYRYWW PLRYHDFCPM SDGVAAVLLS ATPHEVIVSG VGSATDIPTI ADRPYFHSFP ATVRAAAEAY AMAGIKKISD FAGKIHVNMH DPFNGFGPIN MVDLGFVHRR RIVEALLNDE LTGEHGAFPT NITGGLKGRG HPLGATGMIQ IVENHRLITS GRFQMGLAHS IGGPINNNVV TLLERSSHYR GRSRPALTPW GLPPLGRMKP KQMNVGELLK GSGEVQGRFV AATTRFDFKT GDPEGIIIIV SCLVNATRHS FLFGVGGEHY RQVVQLRSGD QVSLEQSEEG ILVNRIPVRK FYQRSMSGVL ELAGNGWKKL TGGS
|
| |