Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2117 |
Symbol | soxB |
ID | 5713113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2246632 |
End bp | 2247885 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641268039 |
Product | sarcosine oxidase subunit beta |
Protein accession | YP_001533454 |
Protein GI | 159044660 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATT TCTCCGCGAT GTCGCTGCTG AAAAATGCGC TTACCGGCCA CAAGAACTGG CCCGAGCAAT GGCCCGACAA ACAGCCCAAG GACGAATACG ACGTCATCAT CGTCGGCGCT GGCGGCCACG GGCTGGGCGC GGCCTACTAC CTGGCCAAGG AGCACGGCAT CACCAATGTC GCCGTGATCG AGAAGGGCTG GCTCGGGGGT GGCAACACCG GCCGCAACAC CACGATCATC CGTTCCAACT ATCTCTATGA CGAGAGCGCG AAGCTCTACG ATCACGCACT CGATCTGTGG GAAAACCTGT CGACCGAGCT GAACTACAAC GTGATGTATT CCAAGCGCGG CGTGATGATG CTGGCCCATA ACGTGCATGA CGTGCAGTCG TTCCAGCGCC ATATCCATGC CAACCGGCTG AACGGCGTCG ACAACCAGTG GCTGACCCCA AAGCAGGCCA AGGAATTCTG CCCGCCGCTC AATATCTCGC CCGATGCGCG CTATCCCGTG ATGGGCGCGG CCCTGCAAAA GCGCGCCGGG ACCGCGCGCC ACGATGCGGT GGCCTGGGGC TATGCGCGGG CCGCGGCCAA GCGCGGCGTC GACATCATCC AGAACTGCCC CGTGATCGCG ATCCGCCGTG CCGCCGATGG CTCGGTCGAG GGCGTCGATA CCGCAAAGGG CTTCATCAAG GCCAAGAAGG TGGCCGTGTC CGCCGCCGGT CACACATCGG TCGTGATGGA CAGCGCGGGC GTGCGGATGC CGCTGGAAAG CTACCCGCTC CAGGCGCTCG TGTCGGAACC GATCAAGCCG GTCTTCCCCT GCGTCGTGAT GTCGAACACC GTGCACGCCT ATATCAGCCA GTCCGACAAG GGCGAGCTGG TGATCGGCTC GGGCACCGAC CAGTACACCA GCTATTCCCA GCGCGGCGGC CTGCCGCTGA TCGAACACAC GGTGTCGGCG ATCTGCGAGG TCTTCCCGAT CTTCAACCGG ATGCGGATGC TGCGCAAATG GGGCGGCATC GTGGACGTGA CCCCCGACCG GTCGGCCATT CTCGGCAAGA CCCCGGTCAA GGGGCTCTAC GTCAACTGCG GTTGGGGCAC GGGTGGCTTC AAGGCGACCC CGGGGGCGGC GCACACGCTG GCCTGGACCG TGGCCAAGGA CGAACCACAC CCGATCAACG CCCCCTTCAC GCTCGAACGC TTCACCACCG GCCGTCTCAT CGACGAAGCC GCCGCAGCCG CTGTTGCGCA CTGA
|
Protein sequence | MTHFSAMSLL KNALTGHKNW PEQWPDKQPK DEYDVIIVGA GGHGLGAAYY LAKEHGITNV AVIEKGWLGG GNTGRNTTII RSNYLYDESA KLYDHALDLW ENLSTELNYN VMYSKRGVMM LAHNVHDVQS FQRHIHANRL NGVDNQWLTP KQAKEFCPPL NISPDARYPV MGAALQKRAG TARHDAVAWG YARAAAKRGV DIIQNCPVIA IRRAADGSVE GVDTAKGFIK AKKVAVSAAG HTSVVMDSAG VRMPLESYPL QALVSEPIKP VFPCVVMSNT VHAYISQSDK GELVIGSGTD QYTSYSQRGG LPLIEHTVSA ICEVFPIFNR MRMLRKWGGI VDVTPDRSAI LGKTPVKGLY VNCGWGTGGF KATPGAAHTL AWTVAKDEPH PINAPFTLER FTTGRLIDEA AAAAVAH
|
| |