Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2677 |
Symbol | solA |
ID | 6133688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 2970304 |
End bp | 2971479 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641642891 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_001769550 |
Protein GI | 170740895 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.895142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTCA CCTCCCGCCG CACCGACGTC GCCGTGATCG GCCTGGGCGC GGTCGGCGCC GCCATCCTGT ACCAGCTCGC CCGCCGCGGC GTGCGCGTCC TCGGCCTCGA CCGCTTCAGC CCGCCCCACA CCCAGGGATC GAGCCACGGC GAGACGCGGA TCACCCGGGA GGGCGTCGGC GAGGGCGAGG CCTATCGGCC GCTGGTGTGG GAGAGCCACC GCATCTGGCG CGACCTCGAG GCGCGCACCG GCGAGAAGCT GCTGACCGCC TGCGGCCTCG TCCTGATCGC CCCGGCCGAC GGCGCCTCTC TCGCCCACGG CCGCTCGGAC TTCGTCGGGC GCTCGGCCGC GGCCGCGCGG GCGGGCGGAA TCGCGCACGA GGTGATCGAC GCGGCCGAAC TGGCGCGGCG CTTTCCCCAC TTCGCCGGGC TGCGCGGCGG CGAGCAGGCC TACCTGGAGC CCGGCGGCGG CTACGTGGCG CCCGAGCGCT GCATCGCCGC GCAGTTGCGG CTCGCCGCGG CGCACGGCGC CGAGATCCGC ACGGACGCGC CCGTGCGGGC GCTGCGGCCG GAGGCGGCCG GCGTGCGCGT CGAGACCGAC GCGGGCGCGG TCACGGCCGA CCGCGTCGTG GTGGCGGCGG GCGCGTGGCT GGGGCCGCTC CTCGGTCCCC CGTTCGACGC GCTCCTCACC GTGAACAGGC AGGTGCTGCA CTGGTTTCCG GTCGATCCCG GCGCCCCGTT CCCGGCGCGC GCGCCGGCCT TCATCTGGAT CGCGCCGTCC CCCGGGGTGG ATTTCCTGTA CGGTTTCCCG CCGCTTCCCG GCGAGGCCCG GGTCAAGCTC GGGACCGAGC AGTTCGTCCG CCCGAGCCGG GCCGAGGAGG CCCGCGCCGC GGATCCGGCC GAGGCCGAGG CCTTCTACGC CTCGCACGTC GCCGGGCGCC TCGCCGGGGT CGCGCCCGGG GCGGTCGCCT CCGCCGCCTG CCTCTATACG GTGACGCCCG ACCGGGACTT CCTGATCGAC GACCATCCGG AGAGCGACCG CATCCTCGCG GTCTCGGCCT GTTCGGGCCA CGGCTTCAAG CACTCGGCCG GGCTCGGGGC CGCCCTGGCC GAGCGCCTCG TCACCGGCCG CTCCCCGGTC GACCTCGCGC CGTTCGGGCT CGCCCGGTTC CGCTGA
|
Protein sequence | MPVTSRRTDV AVIGLGAVGA AILYQLARRG VRVLGLDRFS PPHTQGSSHG ETRITREGVG EGEAYRPLVW ESHRIWRDLE ARTGEKLLTA CGLVLIAPAD GASLAHGRSD FVGRSAAAAR AGGIAHEVID AAELARRFPH FAGLRGGEQA YLEPGGGYVA PERCIAAQLR LAAAHGAEIR TDAPVRALRP EAAGVRVETD AGAVTADRVV VAAGAWLGPL LGPPFDALLT VNRQVLHWFP VDPGAPFPAR APAFIWIAPS PGVDFLYGFP PLPGEARVKL GTEQFVRPSR AEEARAADPA EAEAFYASHV AGRLAGVAPG AVASAACLYT VTPDRDFLID DHPESDRILA VSACSGHGFK HSAGLGAALA ERLVTGRSPV DLAPFGLARF R
|
| |