Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7233 |
Symbol | |
ID | 8022939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 658677 |
End bp | 659927 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644834065 |
Product | sarcosine oxidase, beta subunit family |
Protein accession | YP_002985199 |
Protein GI | 241667115 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0553602 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATT CCGCTCTTTC GATTTTCTTG AACGGCCTTC GCGGCAACAA AGGCTGGGCG CCGGCCTGGC GCGACCCGGC GCCGAAGCCG CATTACGACG TCATCATCGT CGGCGGCGGC GGGCATGGTC TTGCCACCGC CTACTACCTT GCCAAGGAAT TCGGCGTCAC CAATGTCGCT GTCCTCGAAA AGGGCTATCT CGGCTCCGGC AATATCGGCA GGAACACGAC GATCATCCGC TCGAACTATC TGCTGCCGGG TAATAATCCG TTCTACGAGC TGTCGATGAA ACTTTGGGAG GGGCTGGAGC AGGACTTCAA CTTCAACGCC ATGGTCTCGC AGCGCGGCGT TCTCAACCTC TTCCATTCCG ACGCGCAGCG CGACGCCTAT ACGCGCCGCG GCAACGCCAT GCGGCTGCAT GGCGTCGACG CCGAGCTTCT CGACCGGCAG GCGGTACGCA GGAAATTGCC CTTCCTCGAT TTCGACAATG CCCGTTTCCC CATTATGGGC GGCCTGTTCC AGCCGCGCGG CGGCACGGTG CGCCACGATG CGGTCGCCTG GGGTTATGCG CGAGGCGCCG ATCAGCGCGG CGTCGACATC ATCACCCAGT GCGAGGTGAC CGGCATCCGC AGCGAAAACG GTCGGATTAC CGGCGTCGAG ACCAACAAGG GCTTCATCGG CTGCGGCAAG CTGGCGCTTG CTGCCGCCGG CAACTCCACC GTCGTCGCCG ATATGGCCGG CCTTCGCCTG CCGATCGAAA GCCATGTGCT GCAGGCCTTC GTCTCCGAAG GGCTGAAACC GTTCATCGAC ACGGTCGTCA CCTTCGGCGC CGGGCATTTC TACGCGTCCC AGTCGGACAA GGGCGGCCTG GTCTTCGGCG GTGACATCGA CGGCTACAAT TCCTATGCCC AGCGCGGCAA TCTCGCCTCG GTCGAGCATG TCGCCGAAGC CGGGCTGGCG CTGATTCCGT CGCTGTCGCG CGTGCGTTAT CTGCGCTCCT GGGGTGGCGT CATGGATATG AGCATGGACG GCTCGCCGAT CATCGACCGC ACCCATATCG ACAATCTCTA TCTCAACACC GGCTGGTGTT ACGGCGGCTT CAAGGCCACA CCCGCCTCCG GCTTCTGCTA CGCCCATCTG ATCGCCCGCA ACGCGCCGCA TCAGACCGCC CGCGCCTTCC GGCTCGACCG CTTCGCGCGG GGCTATCCGA TCGACGAAAA GGGCGTCGGC GCCCAGCCCA ATCTGCACTG A
|
Protein sequence | MRYSALSIFL NGLRGNKGWA PAWRDPAPKP HYDVIIVGGG GHGLATAYYL AKEFGVTNVA VLEKGYLGSG NIGRNTTIIR SNYLLPGNNP FYELSMKLWE GLEQDFNFNA MVSQRGVLNL FHSDAQRDAY TRRGNAMRLH GVDAELLDRQ AVRRKLPFLD FDNARFPIMG GLFQPRGGTV RHDAVAWGYA RGADQRGVDI ITQCEVTGIR SENGRITGVE TNKGFIGCGK LALAAAGNST VVADMAGLRL PIESHVLQAF VSEGLKPFID TVVTFGAGHF YASQSDKGGL VFGGDIDGYN SYAQRGNLAS VEHVAEAGLA LIPSLSRVRY LRSWGGVMDM SMDGSPIIDR THIDNLYLNT GWCYGGFKAT PASGFCYAHL IARNAPHQTA RAFRLDRFAR GYPIDEKGVG AQPNLH
|
| |