Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_8586 |
Symbol | |
ID | 7300813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011892 |
Strand | + |
Start bp | 327004 |
End bp | 330003 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643597588 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002495164 |
Protein GI | 220919861 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.523076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCC ACCGCCTCCC GAACCGCGGC CGCGTCGATC ATCGGCGCCC GATCCGCTTC AGCTTCGACG GCAAGGACTA CCAGGGACTT GCCGGCGACA CCCTCGCCTC GGCGCTGCTC GCGAACGGCG TCCACCTGAT CGGCCGGTCC TTCAAGTACC ACCGGCCCCG CGGCGTGGTC TCCGCCGGCT CGGACGAGCC GAACGCGCTC TTGGGCACCC ACCGCGGCCC TGGCCGGTTC GAGCCGAACA CCCGCGCCAC GATCCAGGAA CTCCGCGACG GCCTCGTGGC GACGAGCCAG AACCGCTGGC CCTCGCTCGC CTTCGACGTC GGGTCCATCA ATGACCGGCT GGGCTCGCTG TTCTCGGCCG GCTTCTATTA CAAGACGTTC ATGTGGCCCC GCGCCTTCTG GGACCGCGTC TACGAGCCGG TGATCCGCAA CGCCGCCGGC CTCGGCGTCT CGCCCACCGA GCCCGATGCC GACCGCTATG CCAGCCGCTT CGCCCATACC GACGTGCTTA TCGTCGGCGC CGGCCCCGCC GGCCTCGCTG CGGCGCTCGC CGCCGGCCGC TCCGGCGCGT CCGTCATGCT TGTCGACGAG ACCGCCGAGC CGGGCGGCAG CCTCCTGTCC GAACCCGCCG TCGCCATCGA CGGCAAGCCG GCCTGGGACT GGCTCGCCGC CACCCTCGCC GAACTGGCCG CGCTGCCGAA CGTCACCGTC ATGACTCGGA CGACGGCGAT CGGCTACTAC CACCAGAACC TCGTCGGCCT CGCCCAGCGG CTCACCGACC ACCTCGCCGC CCCACCCGCC GACGCCCCGC GCGAGCGGCT CTGGAAGGTC CGCGCCGGCC AGGTCGTGCT CGCCCAGGGC GCGCTGGAGA AGCCGCTGGT CTTCGACGGC AACGACCGCC CGGGCGTCAT GCTGGCCGGC GCCGCGCAGA CCTACCTCAA CCGCTACGGC GTGAGGGTCG GCGACCGGAC CGCGATCGTC ACCGCCCACG ACAGCGCATG GTACGCCGCC TTCGACCTCG CCGAGGCCGG TGCGAAACCC GTGGCCATCG TCGACATCCG CCCGATCGTG GATCTGGCCC TGACCGACAA AGCCCGCGCC TTGGGGATCG AGCTCCTGCT CGGCCACACC GTCACCGGGA CCGAGGGCCG CCTGCGGGTG AAGTCCCTCC GCGTCAACCC GGTCCGGAAC GGCAAGGCCG GCGCCGCCCG CCGCATCGCC TGCGACGCGG TGCTGATGTG CGGCGGCTGG ACCCCGTGCC TGCACCTCTT CTCCCATACC AAGGGCAGCC TCGCCTGGGA CGAGACGCTG CAGGCGTTCC TGGCGGACAA GAAGTCCGAG GCCGTCCACA TCGCCGGTGC CGGCCGCGGC CTCTGGGGCA TCGCCGCCGC GCTCACCGAC GGCGCCGCCG CCGGAGCGCG GGCCGCCCGC GACGCCGGCC GCGCGGCCGA GGCGCAGGCC CACCGCGTCA CCGCCGACCG CACCGGTTCG GGCATCACGC TCAAGGAGCT CCCGACCGAC CGCAACCCGG CCGCCGCCAA GGCCTTCATC GATTTCCAGA ACGACGTCAC CGCCAAGGAC ATCCGCCTCG CCGTCCGCGA GGGCATGCGC TCGATCGAGC ACGTGAAGCG CTACACCACC AACGGCATGG CGACCGATCA GGGCAAGATG TCGAACATCA ACGGCCTGAT GATCGCTGCC GACGCGCTCG GCAAACAGCC GCCGCAGGTC GGCCTGACCA CCTTCCGGCC GCCCTACACG CCGACGACCT TCGGCACCTT CGCCGGCTAC CACCAGGGCG CCACTTTCGA GGTCACGCGC AAGACGCCCA TCGACCCCTG GGCCGAGGCC AACGGCGCGG TCTTCGAGCC GGTCTCCCTC TGGCGCCGCG CCTGGTACTT CCCGAAGGCG GACGAGGACA TGCATGCGGC GGTCGCCCGC GAGTGCCGTG CCACCCGCGC CTCGCTCGGC ATCTTCGACG CCTCGACGCT CGGCAAGATC GAGGTCGTCG GCCCGGACGC CGTCACCTTC ATGGAGCGGA TGTACACGAA CCCCTGGGCG AAGCTCGGAA TCGGCCGCTG CCGCTATGGC CTGCTGCTGG GCGAGGACGG CTTCATCCGT GACGACGGCG TCATCGGCCG CCTCGCCGCC GACCGCTTCC ACGTCACGAC GACGACCGGT GGCGCGGCGC GCGTCCTCAC CATGATGGAG GACTACCTCC AGACCGAGTG GCCCGACCTC AAGGTCTGGC TCACCTCCAC CACCGAGCAA TGGGCGACCG TCGCGCTGAA CGGCCCGAAC GCCCGTAAAC TGCTCGAGCC GCTGGTGAAG GGTCTCGACC TCTCCGACGC CGCCTTCCCG CATATGTCGG TGGCGAAATG CACGGTCGCC GGCTTCCCGG CCCGGCTCTT CCGCGTCTCC TTCACCGGCG AGCTCGGCTT CGAGGTCAAC GTCCCCGCCC GCCACGGCCG CGCCCTCTGG GAGAAGCTGA TGGCTGCCGG GCGGCAGTAC GACATCTGTC CCTACGGGAC CGAGACCATG CACGTGCTGC GCGCCGAGAA GGGCTACATC ATCGTCGGCC AGGACACCGA CGGCACGCTG ACCCCGGACG ACGCTGGCCT CTCCTGGGCG ATCGGCAAGG CCAAGCATGA TTTCGTCGGC AAGCGCTCGC TCGTCCGCCC CGACATGGTG GCGAAGGGCC GCAAGCAGCT CGTCGGTCTC CTGACCGAGG ACCCGAAGAC AATCCTCCAG GAAGGCGCCC AGATCGTCGC CGACCCGAAC CAGCCGAAGC CGATGACCAT GCTCGGCCAC GTGACCTCCT CCTACTGGAG CGAGGCGCTC GGCCGCTCGA TCGCCATGGC CGTCATCGCC GACGGCCGCG CCCGCGACGG GGAGATGCTG CACATCCCGA TGCCGGACCG GATTCTCAAG GCCCGCGTCG TCAAGAGCAC CGTGTTCTAC GACCCCGAAG GCACCCGCCT CAGCGTCTGA
|
Protein sequence | MTSHRLPNRG RVDHRRPIRF SFDGKDYQGL AGDTLASALL ANGVHLIGRS FKYHRPRGVV SAGSDEPNAL LGTHRGPGRF EPNTRATIQE LRDGLVATSQ NRWPSLAFDV GSINDRLGSL FSAGFYYKTF MWPRAFWDRV YEPVIRNAAG LGVSPTEPDA DRYASRFAHT DVLIVGAGPA GLAAALAAGR SGASVMLVDE TAEPGGSLLS EPAVAIDGKP AWDWLAATLA ELAALPNVTV MTRTTAIGYY HQNLVGLAQR LTDHLAAPPA DAPRERLWKV RAGQVVLAQG ALEKPLVFDG NDRPGVMLAG AAQTYLNRYG VRVGDRTAIV TAHDSAWYAA FDLAEAGAKP VAIVDIRPIV DLALTDKARA LGIELLLGHT VTGTEGRLRV KSLRVNPVRN GKAGAARRIA CDAVLMCGGW TPCLHLFSHT KGSLAWDETL QAFLADKKSE AVHIAGAGRG LWGIAAALTD GAAAGARAAR DAGRAAEAQA HRVTADRTGS GITLKELPTD RNPAAAKAFI DFQNDVTAKD IRLAVREGMR SIEHVKRYTT NGMATDQGKM SNINGLMIAA DALGKQPPQV GLTTFRPPYT PTTFGTFAGY HQGATFEVTR KTPIDPWAEA NGAVFEPVSL WRRAWYFPKA DEDMHAAVAR ECRATRASLG IFDASTLGKI EVVGPDAVTF MERMYTNPWA KLGIGRCRYG LLLGEDGFIR DDGVIGRLAA DRFHVTTTTG GAARVLTMME DYLQTEWPDL KVWLTSTTEQ WATVALNGPN ARKLLEPLVK GLDLSDAAFP HMSVAKCTVA GFPARLFRVS FTGELGFEVN VPARHGRALW EKLMAAGRQY DICPYGTETM HVLRAEKGYI IVGQDTDGTL TPDDAGLSWA IGKAKHDFVG KRSLVRPDMV AKGRKQLVGL LTEDPKTILQ EGAQIVADPN QPKPMTMLGH VTSSYWSEAL GRSIAMAVIA DGRARDGEML HIPMPDRILK ARVVKSTVFY DPEGTRLSV
|
| |