Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1949 |
Symbol | soxB |
ID | 5151720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 2013759 |
End bp | 2015009 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640556892 |
Product | sarcosine oxidase, beta subunit |
Protein accession | YP_001238048 |
Protein GI | 148253463 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.572623 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATT CGCTGTTCTC TCTCGCCCGC CAAGCCCTGC GCGGCCATCG CGACTGGCAG CCGGTCTGGC GCGATGCCGC CCCGCAAGCG AGCTACGACA TCATCATCGT CGGCGGCGGC GGCCATGGGC TGGCCACCGC CTATTACCTC GCCAAGGAGC ACGGCATTCG CAAAATTGCC GTGCTCGAAA AAGGCTGGAT CGGCTCCGGC AATGCCGGCC GCAACACCAC GATCATCCGC TCCAATTATC TGCTGCCCGG CAACGAGCCG TTCTACGAAT GGTCGATGAA GCTGTGGGAG GGCCTGGAGC AGGACCTGAA CTACAACACC ATGGTCAGCC AGCGCGGCGT GCTGAACCTC TATCATTCCG ACGCGCAGCG CGATGCGTTT GCCCGGCGCG GCAATGCGAT GCGGCTGGCG GGCGCCGATG CCGAGCTGCT CGATGCCGAC GCCGTCAGGC GCATGCTGCC ATTTCTCGAT TTTGACAATG CGCGCTTTCC GATCAAGGGC GGCCTGCTGC AGCGGCGCGG CGGCACCGCG CGGCACGATG CGGTCGTCTG GGGCTACGCC CGCGGCGCCA GCGCGCGTGG CGTCGACATC ATCCAGAACT GCGAGGTCAC CGGCTTCGTG CGTGACGGCG ATCGCATCAC CGGGGTCACC ACGACGCGCG GCACGATCCA GGCCGGCAAG GTCGGGCTTG CGGTTGCCGG CAGCAGCTCG CGGGTCGCGG AGTATGCCGG CCTGCGGCTG CCGATCGAAA GCCACGTGCT GCAGGCCTTC GTCTCCGAAG CCATCAAGCC GCTGATCCCC GGCGTGGTCA CCTTCGGCGC CGGTCATTTC TACATCAGCC AGTCCGACAA AGGCGGGCTC GTCTTCGGCG GCGATATCGA CGGCTACAAT TCCTATGCGC AGCGCGGCAA TCTGCCGACC GTGGAGGACG TCTGTGAGGG CGGCATGGCG CTGATGCCGG CGATCGGCCG CGTGCGCATG CTGCGCTCCT GGGGCGGGCT CGTCGACATG TCGATGGACG GCTCGCCGAT CATCGACCGC ACGCCGCTGC AAGGACTCTA TCTCAACGCC GGCTGGTGCT ATGGCGGCTT CAAGGCCACG CCGGGCTCCG GCTGGTGCTT CGCGCATCTG CTGGCGCGCG ATGAGCCGCA TCCGGTCGCG AGCGCCTATC GCCTCGACCG TTTCGCAACC GGCCACCTGA TCGACGAAAA AGGTCAGGGC GCGCAGCCAA ATCTGCACTG A
|
Protein sequence | MRYSLFSLAR QALRGHRDWQ PVWRDAAPQA SYDIIIVGGG GHGLATAYYL AKEHGIRKIA VLEKGWIGSG NAGRNTTIIR SNYLLPGNEP FYEWSMKLWE GLEQDLNYNT MVSQRGVLNL YHSDAQRDAF ARRGNAMRLA GADAELLDAD AVRRMLPFLD FDNARFPIKG GLLQRRGGTA RHDAVVWGYA RGASARGVDI IQNCEVTGFV RDGDRITGVT TTRGTIQAGK VGLAVAGSSS RVAEYAGLRL PIESHVLQAF VSEAIKPLIP GVVTFGAGHF YISQSDKGGL VFGGDIDGYN SYAQRGNLPT VEDVCEGGMA LMPAIGRVRM LRSWGGLVDM SMDGSPIIDR TPLQGLYLNA GWCYGGFKAT PGSGWCFAHL LARDEPHPVA SAYRLDRFAT GHLIDEKGQG AQPNLH
|
| |