Gene Gmet_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2058 
Symbol 
ID3738652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2312207 
End bp2313406 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID637779352 
Productthiolase 
Protein accessionYP_385012 
Protein GI78223265 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.35513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.039203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAG CAGTAATTGT CGATGCTGTC CGTACTCCGG TGGGAAAATT CAACGGCGCC 
CTGAAAAACG TCCGCTCTGA CGACCTGGCC GCCCACTGTA TTTCCGAACT GGTGAAGCGT
AACAATCTTG ATCCGAACCT GGTCGAAGAT GTGGTGCTCG GTTGCACCAA CCAGGCGGGC
GAGGACAACC GGAACGTCGG CCGGATGGCG GCGCTTCTGG CCGGTCTGCC GTATTCGGTC
GCGGGGCAGA CCATCAACCG TCTCTGTGCC TCGGGCCTGA ATGCCATCAA CAGCGCAGCC
CATGCGATTA AACTCGGCGA AGGTGATGTC TTTATCGCTG GCGGTACCGA ATCCATGACC
CGTGCCCCCT TTGTCATGGC CAAGTCCGAA TCCCCTTTCT CGCGCGATAT CAGGGTGTTT
GACAGCGTCA TCGGCTGGCG GTTCACCAAC CCGAAGATGA CTGAACCATA TGCCAAGGAA
GGAATGGGCG AAACCGCCGA GAACGTGGCG GTGCGGTATG GCCTCACCCG CCAGGAGCAG
GACGAGTTTG CCCTGGAGAC CCAACGCAAA TGGGCTGCCG CCGATGCGGC CGGCAAGTTC
AATGACGAGA TCGTTCCCGT CGTTATCCCC CAGAAGAAGG GGGATCCGAT CATCGTCTCC
AGGGATGAAT TCCCTCGCGG CAACGATGTC ACCATGGAGC AGCTTGCCAA GCTGCCGGCT
GCCTTCAGAA AGGAGGGCAC CGTCACCGCC GGCAACTCCA GCGGCATCAA CGACGGCGCC
GCAGCGCTCC TCCTCATGGA GGCAGAAACC GCCAAGAAGC TCGGCTACAA GCCGCTTGCC
AGGGTCGTCG CCAGTGCGGT TGCCGGTTGC GATCCCTCGT ACATGGGGCT CGGCCCCATC
CCGGCGATCC AGAAGGTGCT GCAACGGTCC GGCCTGAAAA TCGAAGATAT TGACCTCTTC
GAGCTGAACG AGGCCTTTGC CGCCCAGTCC ATCCCCTGCA TCCGCGAACT GGGGATCGAT
CCGGCCAAGG TGAACGTCAA CGGCGGCTCC ATCGCCATCG GCCACCCCCT CGGCTCCACC
GGCGCCCGGA TCACCGCCAC GCTGGTCCAT GAGATGAAGC GCCGTGGCTC CCGCTACGGT
CTCGTGTCCC TCTGTATCGG TGTCGGACAG GGAATTGCGA CGATCTTCGA ACGCGTGTAA
 
Protein sequence
MREAVIVDAV RTPVGKFNGA LKNVRSDDLA AHCISELVKR NNLDPNLVED VVLGCTNQAG 
EDNRNVGRMA ALLAGLPYSV AGQTINRLCA SGLNAINSAA HAIKLGEGDV FIAGGTESMT
RAPFVMAKSE SPFSRDIRVF DSVIGWRFTN PKMTEPYAKE GMGETAENVA VRYGLTRQEQ
DEFALETQRK WAAADAAGKF NDEIVPVVIP QKKGDPIIVS RDEFPRGNDV TMEQLAKLPA
AFRKEGTVTA GNSSGINDGA AALLLMEAET AKKLGYKPLA RVVASAVAGC DPSYMGLGPI
PAIQKVLQRS GLKIEDIDLF ELNEAFAAQS IPCIRELGID PAKVNVNGGS IAIGHPLGST
GARITATLVH EMKRRGSRYG LVSLCIGVGQ GIATIFERV