Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5635 |
Symbol | |
ID | 6977026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 22960 |
End bp | 24210 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393092 |
Product | sarcosine oxidase, beta subunit family |
Protein accession | YP_002277910 |
Protein GI | 209546020 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATT CCGCTCTTTC GATTTTTCTC AACGGCCTTC GCGGCAACAA AGGCTGGGCG CCCGCCTGGC GCGACCCGGC GCCGAAGCCG CATTACGACG TCGTCATCGT TGGTGGCGGC GGTCACGGCC TTGCCACCGC CTATTATCTC GCCAAGGAAT TCGGCATCAC CAATGTCGCT GTCCTCGAAA AGGGTTATCT CGGCTCCGGC AACATCGGCA GGAACACGAC GATCATTCGC TCGAACTACC TGCTGCCCGG CAACAATCCC TTCTACGAAC TGTCGATGAA GCTTTGGGAA GGCTTGGAGC AGGACTTCAA TTTCAACGCC ATGGTCTCGC AGCGCGGCGT TCTCAATCTC TTCCATTCCG ATGCGCAGCG TGACGCCTAT ACGCGCCGCG GCAACGCCAT GCGGCTGCAC GGCGTCGACG CCGAGCTTCT CGACCGGCAG GCGGTGCGCA AAAAACTGCC CTTCCTCGAT TTCGACAATG CCCGTTTCCC GGTGATGGGT GCTTTGTACC AACCGCGCGG CGGCACGGTG CGCCACGATG CGGTCGCCTG GGGTTATGCG CGCGGCGCCG ACAGCCGCGG CGTCGACATC ATCACCCAGT GCGAAGTGAC CGGCATCCGC AGCGAAAACG GTAAGGTGAC CGGCGTCGAG ACCAGCCGGG GTTTCATCGG CTGCGGCAAG CTGGCGCTGG CAGCGGCCGG CAATTCCACT GTCGTTGCCG ACATGGCCGG CCTTCGCCTG CCGATCGAAA GCCATGTGCT GCAGGCCTTC GTCTCCGAAG GACTAAAACC CTTCATCGAC AACGTCGTCA CCTTCGGCGC CGGCCATTTC TACGTCTCCC AGTCGGACAA GGGCGGGCTC GTCTTCGGCG GCGATATCGA TGGTTATAAT TCCTATGCCC AGCGCGGCAA TCTGGCCTCG GTCGAGCATG TCGCCGAAGC CGGGCTGGCG ATGATCCCAT CCCTGTCGCG GGTGCGTTAC CTGCGCTCCT GGGGCGGGGT GATGGACATG AGCATGGACG GCTCACCGAT CATCGACCGC ACCCATATAG ACAATCTCTA TCTCAATGCC GGCTGGTGTT ACGGCGGCTT CAAGGCGACG CCGGCCTCCG GCTTCTGCTA CGCCCATCTG ATCGCCCGCA ACACGCCGCA TCAGACGGCC CGCGCCTTCC GGCTCGACCG GTTCGCCCGC GGCTATCCGA TCGACGAAAA GGGCGTCGGC GCCCAGCCCA ATCTGCACTG A
|
Protein sequence | MRYSALSIFL NGLRGNKGWA PAWRDPAPKP HYDVVIVGGG GHGLATAYYL AKEFGITNVA VLEKGYLGSG NIGRNTTIIR SNYLLPGNNP FYELSMKLWE GLEQDFNFNA MVSQRGVLNL FHSDAQRDAY TRRGNAMRLH GVDAELLDRQ AVRKKLPFLD FDNARFPVMG ALYQPRGGTV RHDAVAWGYA RGADSRGVDI ITQCEVTGIR SENGKVTGVE TSRGFIGCGK LALAAAGNST VVADMAGLRL PIESHVLQAF VSEGLKPFID NVVTFGAGHF YVSQSDKGGL VFGGDIDGYN SYAQRGNLAS VEHVAEAGLA MIPSLSRVRY LRSWGGVMDM SMDGSPIIDR THIDNLYLNA GWCYGGFKAT PASGFCYAHL IARNTPHQTA RAFRLDRFAR GYPIDEKGVG AQPNLH
|
| |