Gene M446_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2088 
Symbol 
ID6134769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2334397 
End bp2335491 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content76% 
IMG OID641642317 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001768985 
Protein GI170740330 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0445619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCCG AGGCCCCGAT CGACGCCGTC ACGGCCCGGG CCTACGCGGT CCCGACCGAC 
GCGGCCGAAG CGGACGGCAC CTTCGCGTGG CACCGCACCA CCCTGGTGGT GGTCGAGGTC
GCGGCGGGGG ACCGGAGAGG CCTCGGCTAC ACCTACTCGG ATGCGGGGAA TGCCGGGCTC
GTCAAGGGCA CGCTCTCCCC CCTGCTGAAG GGGGGCGACC CCTTCGACGT CCCGGCGCTC
ACCGCGCGGC TCCGGCGCCG GGTGCGCAAT CTCGGCCGCG CGGGCCTCGC CGCCACCGCC
ATCTCGGCCC TCGACGCGGC CCTGTGGGAC CTGAAGGCCC GCCTCCTCGA CCTGCCGCTC
GTGCGCCTCC TCGGGGCGGC GCGGCCGGCC GTCGCGATCT ACGGCAGCGG CGGCTTCACC
AGCTACGGCG AGGCGCGGCT CACCGCCCAG CTCGCCGGCT TCGTCGAGCG GGACGGCTGC
CGCGCCGTGA AGATGAAGAT CGGCAGCGAT CCCGACCGCG ACCCCGACCG GATGCGCGCG
GCGCGGGCGG CGATCGGCGG GGCGGCGCTG TTCATCGACG CGAACGGCGC CTTCTCGCCC
CGCGCCGCCC TGGCCATGGC CGAGACGGCG GCGGGCCTCG GCGTGCGCTG GTTCGAGGAG
CCGGTCTCCA GCGACGACCG CGCGGGCCTG CGCTTCGTGC GCGAGCGGGT GCCGGCCGGC
ATCGACGTGG CGGCCGGAGA ATACGCCTAC TCCCTCGACG ACGTCCGGCA CATGCTGGAG
GCGGGCGCCG TCGACGTGCA GCAGGCGGAC GCGACGCGCT GCGGCGGGGT CTCGGGCTTC
CTGGCGGCGG GCGCGCTCTG CGAGGCGCAC CACACCGACC TGTCGGGCCA CTGCGCCCCC
GCCCTCCACC TCCACCCGGC CTGCGCCGCC GCGCGGGTCC GCCACCTCGA ATGGTTCCAC
GACCACGTCC GCATCGAGTC GATGCTGTTC GACGGGGCGC CGGTCCCGCG GGAGGGGCGG
ATCGCGCCGG ACCTCGGCCG GCCCGGCCAC GGCCTCACCT TCAAGCACCA GGACGCGGAG
CGCTATGCTG TCTGA
 
Protein sequence
MRPEAPIDAV TARAYAVPTD AAEADGTFAW HRTTLVVVEV AAGDRRGLGY TYSDAGNAGL 
VKGTLSPLLK GGDPFDVPAL TARLRRRVRN LGRAGLAATA ISALDAALWD LKARLLDLPL
VRLLGAARPA VAIYGSGGFT SYGEARLTAQ LAGFVERDGC RAVKMKIGSD PDRDPDRMRA
ARAAIGGAAL FIDANGAFSP RAALAMAETA AGLGVRWFEE PVSSDDRAGL RFVRERVPAG
IDVAAGEYAY SLDDVRHMLE AGAVDVQQAD ATRCGGVSGF LAAGALCEAH HTDLSGHCAP
ALHLHPACAA ARVRHLEWFH DHVRIESMLF DGAPVPREGR IAPDLGRPGH GLTFKHQDAE
RYAV