Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3736 |
Symbol | |
ID | 5833317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4138569 |
End bp | 4139822 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641369526 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_001641181 |
Protein GI | 163853138 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.42969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.377029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCT TCTCCCTCAC GTCGCTGCTC TCCGAATCCC TGCGGGGCCA TACCGGCTGG GGCCGCCAGT GGCGTGCCCC CGAGCCCAAG CCCCGCTACG ACGTCGTCAT CGTCGGCGGC GGCGGCCATG GCCTCGCAAC CGCCTATTAT CTCGCCACCG TGCACGGCAT CACCAACGTG GCGGTGTTGG AGAAGGGCTG GATCGGCGGC GGCAATACCG GCCGCAACAC CACGATCATC CGCTCCAACT ACCTCTACGA CGAGAGCGCG GCGATGTACG AGCACGCGCT CAAGCTCTGG GAGGGGCTCA GCCAGGAGCT GAACTACAAC ATCATGTTCT CCCAGCGCGG CGTGTTGATG CTCGCGCACA ACATCCACGA CGTTCAGAGC TTCAAGCGCC ACGTCTACGC CAACCGCCTC AACGGCATCG ACAACGAGTG GCTCTCGAAG GAAGAGTGCA AGGAATTCTG CCCGCCGCTC GATATCTCGG GGAGCCTGCG CTACCCCGTG CTCGGCGGCG CGCTCCAGCG CCGGGCCGGC ACCGCCCGCC ACGACGCGGT GGCCTGGGGC TATGCCCGCG GCGCCGACAA CCGCGGCGTC GATATCATCC AGAACTGTGA GGTCACCGGC ATCCGCCGCG ACGCCTCCGG CGCGGCGGTG GGCGTCGAGA CCACCCGCGG CTTCATCGGC GCCGGCCGGA TCGGCGTGGT CGCCGCCGGC CACACCTCGA CGCTGATGGC GATGGCCGGC GTATCGATGC CGCTGGAGAG CTACCCGCTT CAGGCTTTGG TCTCCGAGCC GGTCAAGCCG TGCTTCCCCT GCGTGGTGAT GTCGAACGCG GTCCATGCCT ACCTGTCGCA ATCCGACAAG GGCGAACTGG TGATCGGTGC GGGCACCGAC CAGTACACCT CCTACAGCCA GCAGGGTGGC CTTCACATCA CCACCCATAC GCTCGACGCG ATCTGCGAAC TGTTTCCGCA ATTCACCCGG ATGCGGATGC TGCGCTCCTG GGGTGGCATC GTCGACGTGA CGCCGGATCG CTCGCCGATC ATCGGCAAGA CCCCGGTGCC GAACCTGTTC GTCAATTGCG GCTGGGGCAC CGGCGGCTTC AAGGCGACGC CGGGCTCGGG CCACGTCTTC GCCCACACGC TCGCGACCGG CACGCCGCAC GCGATCAACG CGCCCTTCAC CCTCGACCGG TTCCGCACCG GGCGCCTCAT CGACGAGGCC GCCGCCGCGG CCGTCGCGCA CTGA
|
Protein sequence | MRRFSLTSLL SESLRGHTGW GRQWRAPEPK PRYDVVIVGG GGHGLATAYY LATVHGITNV AVLEKGWIGG GNTGRNTTII RSNYLYDESA AMYEHALKLW EGLSQELNYN IMFSQRGVLM LAHNIHDVQS FKRHVYANRL NGIDNEWLSK EECKEFCPPL DISGSLRYPV LGGALQRRAG TARHDAVAWG YARGADNRGV DIIQNCEVTG IRRDASGAAV GVETTRGFIG AGRIGVVAAG HTSTLMAMAG VSMPLESYPL QALVSEPVKP CFPCVVMSNA VHAYLSQSDK GELVIGAGTD QYTSYSQQGG LHITTHTLDA ICELFPQFTR MRMLRSWGGI VDVTPDRSPI IGKTPVPNLF VNCGWGTGGF KATPGSGHVF AHTLATGTPH AINAPFTLDR FRTGRLIDEA AAAAVAH
|
| |