Gene Mnod_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3361 
Symbol 
ID7308727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3475775 
End bp3476983 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID643601042 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_002498586 
Protein GI220923284 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.408018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCACG CCTATATTTG CGATTTCGCC CGCACGCCCA TCGGCCGCTA TGGCGGCGCG 
CTCAAGGACG TTCGCGCCGA CGACCTCGCG GCCTATCCGA TCCGCGTGCT GAAGGAGCGC
AACCCGGGTA TCGACTGGGA GGCGGTGGAC GACGTGGTGC TCGGCTGCGC CAACCAGGCC
GGGGAGGACA ACCGGGACGT CGCTCGCATG GCGGCGCTGC TCGCCGGCCT GCCAGTGAGC
GCACCGGGGA CAACGGTCAA CCGGCTGTGC GGCTCGGGCC TCGACGCGGT CGGCATCGCC
GCCCGTGCCA TCATGACGGG CGATGCGGAC CTCATGCTGG CGGGCGGCGT GGAGAGCATG
ACGCGTGCGC CCTTCGTGAT GGGCAAGGCG ACGGAGGCCT TCTCCCGCCA AGCGGAGGTG
TTCGACACCA CCATCGGCTG GCGCTTTGTG AACCCGCTGA TGAAGGCTCA GTACGGCATC
GATTCCATGC CCGAAACCGG CGAGAACGTC GCCGAAGAGT TCCGGATCTC GCGACAGGAC
CAGGACCTCT TCGCTCTTCG CTCGCAGCAG CGCGCAGCGG CGGCCCAGGC CGAGGGGTTC
TTCGACCGCG AGATCGTGGC TCTGGAGGTG AAGGGCAAGA AGGGGGCGGT GATCCGGGTA
GATCGGGACG AGCATCCGCG CCCCGACACC ACGCTGGAGC AGCTGGCGGC CCTGAAGACC
TCGTTTCGCA AGGAGGGCGG CACCGTAACG GCCGGCAACG CCTCCGGCGT GAATGACGGG
GCCGGTGCCC TCATCCTTGC TTCAGAAGAA GCCGCACGGA AGTATGGCCT CACGCCGCGG
GCGCGGGTGG TGTCCGTCGT GCAGGCGGGC GTGCCACCGC GCATCATGGG CATCGGGCCT
GCCCCGGCCA CCCGCAAGCT CCTCGCGAAG AATGGACTCT CTCTGAGCGA GATCGACCTG
ATCGAGCTCA ACGAGGCGTT CGCCTCACAG GCCCTGGCGG TATTGCGCGA GCTCGGGCTG
CCGGACGATG CGGAGCACGT GAACCCTCAC GGCGGAGCCA TCGCGCTAGG CCATCCGCTC
GGCATGTCGG GGGCGCGGCT CGCAATGACG GCCGTGAGTG CGCTCGAGGT CCGCGGCGGC
AAGCGCGCGG TCGCCACGAT GTGCATTGGC GTTGGACAGG GGATCGCCGC CCTCATCGAG
AGGGTGTGA
 
Protein sequence
MSHAYICDFA RTPIGRYGGA LKDVRADDLA AYPIRVLKER NPGIDWEAVD DVVLGCANQA 
GEDNRDVARM AALLAGLPVS APGTTVNRLC GSGLDAVGIA ARAIMTGDAD LMLAGGVESM
TRAPFVMGKA TEAFSRQAEV FDTTIGWRFV NPLMKAQYGI DSMPETGENV AEEFRISRQD
QDLFALRSQQ RAAAAQAEGF FDREIVALEV KGKKGAVIRV DRDEHPRPDT TLEQLAALKT
SFRKEGGTVT AGNASGVNDG AGALILASEE AARKYGLTPR ARVVSVVQAG VPPRIMGIGP
APATRKLLAK NGLSLSEIDL IELNEAFASQ ALAVLRELGL PDDAEHVNPH GGAIALGHPL
GMSGARLAMT AVSALEVRGG KRAVATMCIG VGQGIAALIE RV