Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4031 |
Symbol | |
ID | 7118036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4242936 |
End bp | 4244189 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643526750 |
Product | sarcosine oxidase, beta subunit family |
Protein accession | YP_002422759 |
Protein GI | 218531943 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.198896 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCT TCTCCCTCAC GTCGCTGCTC TCCGAATCCC TGCGGGGCCA TACCGGCTGG GGCCGCCAGT GGCGTGCCCC CGAGCCCAAG CCCCGCTACG ACGTCGTCAT CGTCGGCGGC GGCGGCCATG GCCTCGCGAC CGCCTATTAT CTCGCCACCG TGCACGGCAT CACCAACGTG GCGGTGCTGG AGAAGGGATG GATCGGCGGC GGCAATACCG GCCGCAACAC CACGATCATC CGCTCCAACT ACCTCTACGA CGAGAGCGCG GCGATGTACG AGCACGCGCT CAAGCTCTGG GAGGGGCTCA GCCAGGAGCT GAACTACAAT ATCATGTTCT CCCAGCGCGG CGTGTTGATG CTCGCGCACA ACATCCACGA CGTTCAGAGC TTCAAGCGCC ACGTCTACGC CAACCGCCTC AACGGCATCG ACAACGAGTG GCTCTCGAAG GAAGAGTGCA AGGAATTCTG CCCGCCGCTC GATATCTCCG GGAGCCTACG CTACCCCGTG CTCGGCGGCG CGCTCCAGCG CCGGGCCGGC ACCGCCCGCC ACGACGCCGT AGCCTGGGGC TATGCCCGCG GCGCCGACAA CCGCGGCGTC GATATTATCC AGAACTGCGA GGTCACCGGC ATCCGCCGCG ATGCCTCCGG TGCGGCGGTG GGCGTCGAGA CGACCCGCGG CTTCATCGGC GCCGGCCGGA TCGGCGTGGT CGCCGCCGGC CACACCTCGA CGCTGATGTC GATGGCCGGC GTGTCGATGC CGCTGGAGAG CTACCCGCTC CAGGCTTTGG TCTCCGAGCC GGTCAAGCCG TGCTTCCCCT GCGTGGTGAT GTCGAACGCG GTCCACGCCT ACCTGTCGCA ATCCGACAAG GGCGAACTGG TGATCGGTGC GGGCACCGAC CAGTACACCT CCTACAGCCA GCAGGGTGGC CTCCACATCA CCACCCACAC GCTCGACGCG ATCTGCGAAC TGTTTCCGCA ATTCACCCGG ATGCGGATGC TGCGCTCCTG GGGCGGCATC GTCGACGTGA CACCGGATCG TTCGCCGATC ATCGGCAAGA CCCCGGTGCC GAACCTGTTC GTCAATTGCG GCTGGGGCAC TGGCGGCTTC AAGGCGACGC CGGGCTCGGG CCACGTCTTC GCCCACACGC TCGCGACGGC TGAGCCGCAC GCGATCAACG CGCCCTTCAC CCTCGACCGG TTCCGCACCG GGCGCCTCAT CGACGAAGCC GCCGCCGCGG CCGTCGCGCA CTGA
|
Protein sequence | MRRFSLTSLL SESLRGHTGW GRQWRAPEPK PRYDVVIVGG GGHGLATAYY LATVHGITNV AVLEKGWIGG GNTGRNTTII RSNYLYDESA AMYEHALKLW EGLSQELNYN IMFSQRGVLM LAHNIHDVQS FKRHVYANRL NGIDNEWLSK EECKEFCPPL DISGSLRYPV LGGALQRRAG TARHDAVAWG YARGADNRGV DIIQNCEVTG IRRDASGAAV GVETTRGFIG AGRIGVVAAG HTSTLMSMAG VSMPLESYPL QALVSEPVKP CFPCVVMSNA VHAYLSQSDK GELVIGAGTD QYTSYSQQGG LHITTHTLDA ICELFPQFTR MRMLRSWGGI VDVTPDRSPI IGKTPVPNLF VNCGWGTGGF KATPGSGHVF AHTLATAEPH AINAPFTLDR FRTGRLIDEA AAAAVAH
|
| |