Gene Mnod_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_2083 
Symbol 
ID7305017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2181196 
End bp2182377 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID643599816 
Product3-oxoadipate enol-lactonase 
Protein accessionYP_002497371 
Protein GI220922070 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 
TIGRFAM ID[TIGR02425] 4-carboxymuconolactone decarboxylase
[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.603861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTGA TCCAGGCCAA CGGCACGACC CTGAACTACG AGCTCTCGGG CCCGAGCGGC 
GCACCCGTCG TGGCCTTCTC GAACTCGCTC GGCACGGCGC TGGCGATGTG GGACCCGCTC
GTGCCGGCCC TGCGCGGCCG CTACCGCGTC CTGCGCTACG ACACCCGCGG CCACGGCGCC
TCGCAGGTGC GCGACGAGTC CGCCTCCGTG GAGGATCTCG CCGACGACCT CCTCGGCCTC
CTCGACGCGC TCGGGATCGC GCGCGCCCAC ATCGTCGGCC TCTCGCTCGG CGGCATGACC
GGCCAGGCGC TGGCGATGCG CGCGCCCGAG CGGGTGCAGA GCCTGACGCT CATGGCGACC
GCCGCCCACA TGCCGACCGA GGCCTCGTGG AACGAGCGCG CCGAGACGGT GCGGGCGCAG
GGCACCGCCG CCATCGTGGA CGTGACCATG GAACGCTGGT TCACGCCGGA CTTCCCCCGG
ACCGCACCCG CCCTGGTCGA TCCGGTGCGC CGGCAATTCC TCGGAACCGA CCGGGCCGGC
TACGCGGTCT GCTGCCATGC CATCGGCCGG ATGGATCTGC GCCCCGGGCT CGGCCGCATT
GAGGCCCCGA CCCTGGTGAT CGCCGGGCGC GACGACCCCT CGACCCCGCC CGCGAAGTCG
GAGGAGATCT GCGAGGGCAT CCGGCACGCG GAGCTGGTGG TGCTGCCCGC AGCCCGCCAT
CTCCTCGCCA TCGAGCGCCC CGAGGCCGCC GCCGCCCACC TCCTCGCCTT CCTCGACCGT
CATCGCGGGG CCGCCGAGGC AGCGACCGGG GCGGTGCCCT TCACGGAGGG CCTGTCGAAC
CGGCGGGGGG TGCTGGGCGA GGCGCATGTC GACCGCTCGC TCGCGGCGGC CGGCACCTTC
GCGGGGCCCT GGCAGGACTT CATCACCCGC ATCGCCTGGG GCGAGATCTG GGGCGATCCG
CGGCTGCCGT GGAAGACGCG CTCGCTCGTG ACCCTCGCCC TCATGGTGGC GCTCGGCCGC
GAGGAGGAGT TCAAGCTCCA CGTGCGGCCG GCGCTTGCGA ACGGCGTGAC GCCGTCCGAG
CTCCAGGCGC TGCTGCTTCA GGCCGCCGTC TATGCGGGCG TGCCCGCAGT CAACGGGGCC
TTCCGCTGGG CGAAGGACGT GCTCGGCGAC GAGCTGGAGT GA
 
Protein sequence
MPLIQANGTT LNYELSGPSG APVVAFSNSL GTALAMWDPL VPALRGRYRV LRYDTRGHGA 
SQVRDESASV EDLADDLLGL LDALGIARAH IVGLSLGGMT GQALAMRAPE RVQSLTLMAT
AAHMPTEASW NERAETVRAQ GTAAIVDVTM ERWFTPDFPR TAPALVDPVR RQFLGTDRAG
YAVCCHAIGR MDLRPGLGRI EAPTLVIAGR DDPSTPPAKS EEICEGIRHA ELVVLPAARH
LLAIERPEAA AAHLLAFLDR HRGAAEAATG AVPFTEGLSN RRGVLGEAHV DRSLAAAGTF
AGPWQDFITR IAWGEIWGDP RLPWKTRSLV TLALMVALGR EEEFKLHVRP ALANGVTPSE
LQALLLQAAV YAGVPAVNGA FRWAKDVLGD ELE