Gene M446_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4541 
Symbol 
ID6132698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4997700 
End bp4998680 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content78% 
IMG OID641644681 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001771316 
Protein GI170742661 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.222249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCC TCGCGGTCGC GGTCGAGACC TGGCCCATCG CCGGGCGCTT CACCATCTCG 
CGCGGCAGCC GCACCGAGGC CGTGGTGGTG GTGGCCAGCG TAACGGACGG CACCCGTACG
GGCCGCGGCG AGGGGGTGCC CTATCCCCGC TACGGCGAGA GCGTCGAGGG CGTGCGCGAC
CTGATCGCCG CGCAGGGCGA GGCGGTGGCG GCGGGCGCGA CCCGGGCCGA CCTGCTCGCC
CGGATGCCGG CGGGCGCGGC CCGCAACGCC CTCGACTGCG CCCTCTGGGA CTACGAGGCC
AAGGCGGCGG GGCGGCCGGC CCACGCCCTC GCGGGGCTCG CGCCGCCGCG CCCGGTCACC
ACCGCCTACA CGCTCAGCCT CGGCGCACCC GAGGAGATGG AGGCGGCGGC CCGCGCCGCC
GCGGCGCGGC CCCTCCTCAA GGTCAAGCTC GGCGGCGAGG GCGACCCGGC CCGCATCGCC
GCCGTGCGCC GCGGGGCGCC CGCCTCCCGC CTCGTCGTCG ATGCCAACGA GGCATGGCGG
CCGCGCAACC TCGCCGAGAA CATGGCGGCC TGCGCGGCGG CGGGGGTGGA GCTGATCGAG
CAGCCGCTCC CCGCCGGCGA GGACGAGGCG CTCGCCGGGC TCGCGCGCAC GATCCCGCTC
TGCGCCGACG AGAGCCTGCA CCCGGGCGCC GGGCTCGACG GGCTCGCCGG CCGCTACGAC
GCGATCAACA TCAAGCTCGA CAAGGCGGGC GGGCTCACGC CGGCCCTCGC CCTCGCCCGG
GCCGCGCGGG AGCAGGGCCT CGCGATCATG GTCGGCTGCA TGGTCGGCAC CTCGCTCGCC
ATGGCGCCCG CCATGCTGCT CGCGGGCTTC GCGACCTTCG TCGACCTCGA CGGCCCCCTG
CTCCTCGCCC GGGACCGCGA GCCGGGGCTG CGCTTCGAGG GCAGCCTCGT CCACCCGCCC
GATCCGGCCC TGTGGGGGTG A
 
Protein sequence
MRRLAVAVET WPIAGRFTIS RGSRTEAVVV VASVTDGTRT GRGEGVPYPR YGESVEGVRD 
LIAAQGEAVA AGATRADLLA RMPAGAARNA LDCALWDYEA KAAGRPAHAL AGLAPPRPVT
TAYTLSLGAP EEMEAAARAA AARPLLKVKL GGEGDPARIA AVRRGAPASR LVVDANEAWR
PRNLAENMAA CAAAGVELIE QPLPAGEDEA LAGLARTIPL CADESLHPGA GLDGLAGRYD
AINIKLDKAG GLTPALALAR AAREQGLAIM VGCMVGTSLA MAPAMLLAGF ATFVDLDGPL
LLARDREPGL RFEGSLVHPP DPALWG