Gene M446_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2398 
Symbol 
ID6129395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2665511 
End bp2666725 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content75% 
IMG OID641642622 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001769290 
Protein GI170740635 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.681376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0712183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGA CGCGGGACGT CTTCATCTGC GATTACCTGC GCACGCCGGT CGGCCGCTAC 
GGCGGCGCGC TGGCGGAAGT GCGGGCGGAC GACCTCGCCG CGGTGCCCCT CGCCGCCCTG
GTGCATCGCC ACCCCTCCCT GAAGGAGGGC GTCGAGGAGG TCTTCCTCGG CTGCGCGAAC
CAGGCCGGCG AGGATAACCG CAACGTGGCC CGCATGGCCC TTCTGCTCGC GGGCCTGCCC
GAGACCGTGC CGGGCGCGAC GCTCAACCGG CTCTGCGCCT CGGGCCTCGA CGCGGTCGGG
GCCGCCGCGC GGGCGATCAG GGCCGGCGAC ATCGACCTCG CCCTCGCAGG CGGCGTCGAA
TCGATGACTC GCGCGCCCTT CGTGATGGGC AAGAGCGAGG CGCCGTGGCA GCGCCAGGCC
GAGATCCACG ACACGACGAT CGGCTGGCGC TTCATCAACC CGGTGCTCAA GGCGCAGTAC
GGCGTCGATT CGATGCCCGA GACGGCCGAG AACGTGGCGG AGGATTACCA GATCAGCCGC
GCGGACCAGG ATGCCTTCGC GCTGCGCTCG CAGCAGCGGG CGGCGCGCGC GCAGGCGGAC
GGGACCTTCG CGCAGGAGAT CGTGCCGGTC GAGATCCCGA CCCGGCGGGG CGAGCACCGG
CGCGTCGACA CCGACGAGCA TCCGCGGCCC GACACGGACG CGCAGGCCCT GGCGCGGCTC
AAACCCTTCG TGCGGCGGGA CGGGACCGTG ACGGCCGGCA ACGCCTCGGG CGTCAATGAC
GGCGCCGCCG CCCTGGTCCT GGCGAGCGCC GAGGCCGCGC GGCGCCACGG CCTGACCCCG
CTCGCCCGGG TGCTCGGCCT CGCCTCGGCC GGCGTGCCGC CGCGCGTGAT GGGGATCGGG
CCGATCCCGG CGGTGGAGAA GCTCTGCGCG CGGCTCGGGC TGAAGCCCGG CGATTTCGAC
GCGGTCGAGC TCAACGAGGC CTTCGCGTCG CAGTCGCTCG CCTGCCTGCG CGGGCTCGGG
CTGCCGGACG ACGCGGAGCA CGTCAACCCG CATGGCGGGG CGATCGCGCT GGGGCACCCG
CTCGGCATGT CGGGCGCACG CATCGCCGGG GCGGTGACGC GGGAACTCGC CCGCCGGGGT
GGGCGGCTCG GGCTCGCCAC GATGTGCGTC GGGGTCGGGC AGGGGGTCGC GCTCGCGGTC
GAGCGGATCT CCTGA
 
Protein sequence
MGATRDVFIC DYLRTPVGRY GGALAEVRAD DLAAVPLAAL VHRHPSLKEG VEEVFLGCAN 
QAGEDNRNVA RMALLLAGLP ETVPGATLNR LCASGLDAVG AAARAIRAGD IDLALAGGVE
SMTRAPFVMG KSEAPWQRQA EIHDTTIGWR FINPVLKAQY GVDSMPETAE NVAEDYQISR
ADQDAFALRS QQRAARAQAD GTFAQEIVPV EIPTRRGEHR RVDTDEHPRP DTDAQALARL
KPFVRRDGTV TAGNASGVND GAAALVLASA EAARRHGLTP LARVLGLASA GVPPRVMGIG
PIPAVEKLCA RLGLKPGDFD AVELNEAFAS QSLACLRGLG LPDDAEHVNP HGGAIALGHP
LGMSGARIAG AVTRELARRG GRLGLATMCV GVGQGVALAV ERIS