Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3289 |
Symbol | |
ID | 5324173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3481318 |
End bp | 3482568 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640792241 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_001328946 |
Protein GI | 150398479 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.96044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACT CTGCCCTTTC CATCCTCCTC AATGGCCTGC GCGGCAACCG GAACTGGACG CCGGCATGGC GCCAGCCAGA CCCGAAGCCA CATTACGACG TGATCATCGT CGGCGGCGGT GGCCATGGCC TCGCGACTGC CTATTACCTT GCCAAGGAAT TCGGCGTCAC CAATGTCGCG GTGCTGGAAA AGAATTATGT CGGTTCGGGC AATGTCGGCC GCAACACGAC GATCATTCGC TCGAACTACC TGCTCCCCGG GAACAATCCA TTTTACGAGC TCTCCATGCA GCTATGGGAG GGGCTGGAGC AGGATTTTAA TTTCAATGCG ATGGTCTCGC AGCGCGGCGT TCTCAATCTC TATCATTCGG ACGCTCAGCG CGATGCCTAC ACGCGCCGCG GCAATGCGAT GCGGCTTCAC GGGGTAGACG CAGAACTCCT CGATCGGGCG GCCGTACGCC GGATGCTGCC CTTTCTCGAT TTCGACAATG CCCGCTTCCC CATCCAGGGC GGGCTCCTGC AGCGCCGCGG CGGTACCGTG CGCCACGACG CCGTCGCCTG GGGATATGCC CGCGGCGCCG ACAGCCGCGG GGTCGATATC ATCCAGAATT GCGAAGTGAC CGGGATCAGG CGAGAAGACG GGCGAGTCAC CGGCGTCGAG ACCAGCCGCG GCTTCATCGG CTGCGGAAAG CTCGCCCTGG CGGCGGCCGG AAATTCCTCG AAGGTCGCCG AACTGGCGGG ATTGCGCCTG CCGATCGAGA GTCACGTGCT TCAGGCCTTC GTGTCGGAGG GGCTGAAACC GTTCATTGAC GGGGTGGTCA CTTTCGGAGC CGGACATTTC TACGTTTCAC AATCGGACAA GGGCGGCCTC GTCTTTGGCG GCGATCTCGA CGGCTATAAT TCCTACGCTC AGCGCGGCAA CCTGGCGACC GTCGAGCATG TGGCGGAGGC CGGAAAGGCC ATGATTCCGG CATTGTCGCG GGTGCGGGTG CTGCGCTCCT GGGGCGGTAT CATGGATATG AGCATGGACG GCTCGCCGAT CATCGACCGC ACGCCGATCG ACAATCTCTA TCTGAATGCC GGCTGGTGCT ATGGCGGGTT CAAGGCGACC CCTGCCTCAG GATTCTGCTT CGCACATCTC CTCGCCCGAG GCGCGCCGCA AAAGACAGCC GCAGCGTTTC GTCTCGACCG TTTCGAGCGA GGCTACCTCC TTGATGAAAA AGGCCAAGGC GCTCAGCCGA ACCTTCACTG A
|
Protein sequence | MRYSALSILL NGLRGNRNWT PAWRQPDPKP HYDVIIVGGG GHGLATAYYL AKEFGVTNVA VLEKNYVGSG NVGRNTTIIR SNYLLPGNNP FYELSMQLWE GLEQDFNFNA MVSQRGVLNL YHSDAQRDAY TRRGNAMRLH GVDAELLDRA AVRRMLPFLD FDNARFPIQG GLLQRRGGTV RHDAVAWGYA RGADSRGVDI IQNCEVTGIR REDGRVTGVE TSRGFIGCGK LALAAAGNSS KVAELAGLRL PIESHVLQAF VSEGLKPFID GVVTFGAGHF YVSQSDKGGL VFGGDLDGYN SYAQRGNLAT VEHVAEAGKA MIPALSRVRV LRSWGGIMDM SMDGSPIIDR TPIDNLYLNA GWCYGGFKAT PASGFCFAHL LARGAPQKTA AAFRLDRFER GYLLDEKGQG AQPNLH
|
| |