Gene GM21_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0644 
Symbol 
ID8135959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp775709 
End bp777313 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content66% 
IMG OID644868261 
Productthiolase, putative 
Protein accessionYP_003020476 
Protein GI253699287 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.4394300000000005e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGCAGC AATTCAAGCC TCGGGAGGTC TATGTCGCCT CCTCTTTCAT GGCTCCGGTC 
GGGCGCTATA ACGGGCGCGA ACGCGAAGCC CTGAGCTTTT TGGAGATGGC CGAAAAGGCG
GGAGAGGTTT TCGCCGGCAG CCGGCTCAAG CGCTCCGACA TAAACGCCGT CGTCGTCGGC
TGCCAGAACC CGGTCGCCTT CTCGGGGGTC GACAACACGG CGGCCAAGAT CGCCGGCGTC
CTCGGGATCT CCGGCGCCAA ATCGGTGCTG ATCGACACCG CCTCCTCCTC GGGCGCGTCG
GCTCTCGAAT ACGCCTACCT GCAGATCGCC TCCGGCCGCT GCGACCACGT CCTCGCCATC
GGGATCCAGA AGATGAGCGA CGTCCCCACC GGGCAGGCCA CCCGCATCGT CGCCGGGGTG
ATCGACAAGG ACGAGGCGGA GTTCGGGCTC TCCATGCCGG CCTGCGGGGC ACTCGTGGCG
CGCTCCCTGA TCGAGCGGCT GAAGCTCTCC ACCGACGAGT GGACCGCCTT CTCCGCCCTT
TTGACCCAGC GGGCGCACCG CTTTGCCGCG CGCAACCCCG AGGCGCACCT GGGCTTCGAG
ATCCCGCTTC AGGATTACTA CCGCCAGATC GTCACCGGCA AGAACTACCG CTACTGGTGG
CCTTTGCGCT ACCACGACTT CTGCCCCATG TCGGACGGGG TCGCCGCCGT GCTCCTCTCG
GCGACGCCGC ACGAGGTGAT CGTCTCCGGG GTGGGGAGCG CCACCGACAT CCCCACCATC
GCCGACCGCC CCTACTTCCA CAGCTTCCCC GCCACCGTGC GCGCCGCGGC CGAGGCCTAC
GCCATGGCCG GGATCAAGAA GATCTCCGAC TTCGCCGGAA AGATCCACGT GAACATGCAC
GATCCGTTCA ACGGCTTCGG GCCGATCAAC ATGGTGGACC TGGGCTTCGT GCACCGGCGC
CGGATCGTGG AAGCGCTTTT GAACGACGAG CTTACCGGCG AGCATGGGGC CTTCCCGACC
AACATAACCG GCGGACTCAA GGGGCGCGGC CATCCGCTTG GTGCCACCGG CATGATCCAG
ATCGTCGAGA ACCACCGGCT CATCACCTCC GGCCGCTTCC AGATGGGGCT CGCCCACTCA
ATCGGCGGAC CGATCAACAA CAACGTGGTG ACGCTCCTGG AGAGGAGCAG CCACTACCGG
GGGCGCTCGC GCCCGGCACT CACCCCCTGG GGGCTCCCCC CCCTTGGGCG CATGAAGCCA
AAGCAGATGA ACGTCGGCGA GCTCTTGAAA GGGTCGGGCG AGGTGCAGGG GCGTTTCGTC
GCCGCCACCA CCCGCTTCGA TTTCAAGACC GGTGACCCGG AGGGGATCAT CATCATCGTT
TCCTGCCTGG TGAACGCGAC CCGCCACTCC TTCCTCTTCG GGGTAGGCGG CGAGCACTAC
CGGCAGGTGG TGCAACTCAG GTCGGGGGAC CAGGTGAGCC TGGAGCAAAG CGAGGAGGGG
ATCCTGGTGA ACCGGATCCC GGTCAGGAAG TTCTACCAAA GGAGCATGAG CGGCGTGCTG
GAGCTGGCCG GAAACGGCTG GAAGAAGCTC ACCGGTGGGA GCTAG
 
Protein sequence
MTQQFKPREV YVASSFMAPV GRYNGREREA LSFLEMAEKA GEVFAGSRLK RSDINAVVVG 
CQNPVAFSGV DNTAAKIAGV LGISGAKSVL IDTASSSGAS ALEYAYLQIA SGRCDHVLAI
GIQKMSDVPT GQATRIVAGV IDKDEAEFGL SMPACGALVA RSLIERLKLS TDEWTAFSAL
LTQRAHRFAA RNPEAHLGFE IPLQDYYRQI VTGKNYRYWW PLRYHDFCPM SDGVAAVLLS
ATPHEVIVSG VGSATDIPTI ADRPYFHSFP ATVRAAAEAY AMAGIKKISD FAGKIHVNMH
DPFNGFGPIN MVDLGFVHRR RIVEALLNDE LTGEHGAFPT NITGGLKGRG HPLGATGMIQ
IVENHRLITS GRFQMGLAHS IGGPINNNVV TLLERSSHYR GRSRPALTPW GLPPLGRMKP
KQMNVGELLK GSGEVQGRFV AATTRFDFKT GDPEGIIIIV SCLVNATRHS FLFGVGGEHY
RQVVQLRSGD QVSLEQSEEG ILVNRIPVRK FYQRSMSGVL ELAGNGWKKL TGGS