Gene M446_6204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6204 
Symbol 
ID6129866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6821525 
End bp6822880 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content74% 
IMG OID641646301 
Productglucarate dehydratase 
Protein accessionYP_001772906 
Protein GI170744251 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.254671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG CCGCCGGCCC CGCGCAGGGC CTCTCGCGCA CGCCGACCAT CGCCGCCATG 
CGGGTCGTCC CGGTGGCCGG CCGCGACAGC ATGCTGCTCA ACCTGTCGGG CGCGCACGGG
CCGTTCTTCA CCCGCAACCT CGTGGTGCTG ACCGACTCGA CCGGCGCCAC CGGCCTCGGC
GAGGTGCCGG GAGGCGAGCG CATCCGGCAG ACCCTGGAGG ATGCGCGCGG CCTCGTCCTC
GGCCAGCCCC TCGGCGCCTG GAACCGCGTG CTCGCCGCCA TGCGGGCGCG CTTCGCCGAT
CGCGACGCGG GGGGGCGGGG TCTCCAGACC TTCGACCTGC GCGTCGCCAT CCACGCGGTC
ACGGCGGTGG AATCCGCCTT CCTCGACCTG CTCGGCCAGT TCCTCGGCGT GCCGGTGGCG
GCCCTCCTCG GCGAGGGCCA GCAGCGCGAC GCGGTCGAGA TGCTGGGCTA CCTGTTCTAC
GTCGGCGACC GCCGCCGCAC GAATCTGGCC TACCGGGAGC CCGAGGCGCG CGACGGCTGG
CTGCGGCTGC GCGACGAGGA GGCGCTGACG CCCGATGCCG TGGTGCGGCT CGCCGAGGCC
GCGCACGAGC GCTACGGCTT CAACGATTTC AAGCTCAAGG GCGGGGTGCT GGCGGGCGAG
CGGGAGATCG AGGCGGTGAC CGCCCTCGCC AAGCGCTTCC CGCAGGCCCG CGTCACCCTC
GACCCGAACG GGGCGTGGTC GCTCGAGGAG GCGATCCGCC TGTGCAGGGG CCGGGGCGAC
GTGCTCGCCT ACGCGGAGGA TCCCTGCGGG GCCGAGAACG GCTTCTCCGG CCGCGAGGTC
ATGGCCGAGT TCCGCCGCGC CACCGGGCTG CCCACCGCCA CCAACATGAT CGCGACCGAC
TGGCGCCAGA TGGTGCACGC GCTCCAGCTC GGCGCCGTCG ACATTCCCCT GGCCGACCCG
CATTTCTGGA CGCTGGCCGG GGCGGTGCGC GTCGCCCAGA CCTGCCGCGA CCACGGCCTC
ACCTGGGGCT CGCACTCGAA CAACCACTTC GATGTGTCGC TGGCCATGTT CACCCACGCC
GCCGCGGCCG CGCCCGGCAA GGTCACGGCG ATCGACACGC ACTGGATCTG GCAGGACGGC
CAGCGCCTGA CCCGGGAGCC GCCGGAGATC CGCGGCGGCC TGGTGCGGGT GCCGGAGCGG
CCGGGCCTCG GCGTCGCGCT CGACTGGGAG GCGGTCGAGG CGGCCCACGC CCTCTACGAG
CGCCACGGGC TCGGCGCCCG CGACGACGCC GCGGCGATGC AGTTCCTGAT TCCCGGCTGG
ACCTTCGATC CCAAGACACC CTGCCTCGTG CGCTGA
 
Protein sequence
METAAGPAQG LSRTPTIAAM RVVPVAGRDS MLLNLSGAHG PFFTRNLVVL TDSTGATGLG 
EVPGGERIRQ TLEDARGLVL GQPLGAWNRV LAAMRARFAD RDAGGRGLQT FDLRVAIHAV
TAVESAFLDL LGQFLGVPVA ALLGEGQQRD AVEMLGYLFY VGDRRRTNLA YREPEARDGW
LRLRDEEALT PDAVVRLAEA AHERYGFNDF KLKGGVLAGE REIEAVTALA KRFPQARVTL
DPNGAWSLEE AIRLCRGRGD VLAYAEDPCG AENGFSGREV MAEFRRATGL PTATNMIATD
WRQMVHALQL GAVDIPLADP HFWTLAGAVR VAQTCRDHGL TWGSHSNNHF DVSLAMFTHA
AAAAPGKVTA IDTHWIWQDG QRLTREPPEI RGGLVRVPER PGLGVALDWE AVEAAHALYE
RHGLGARDDA AAMQFLIPGW TFDPKTPCLV R