Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3358 |
Symbol | |
ID | 6982112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3466614 |
End bp | 3467867 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398076 |
Product | sarcosine oxidase, beta subunit family |
Protein accession | YP_002282851 |
Protein GI | 209550934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAAT ATTCGGTTTT TGCCGTGGCG AGGGAGGCCC TTCGGGGCCA TAAGGGCTGG GAGAAGCAGT GGACTTCGCC TGAGCCGCGC GCCGAATACG ACGTCGTCAT CATCGGCGCC GGTGGTCATG GGCTGGGTGC AGCCTATTAC CTTGCCAAGG AGCACGGCAT CACCAATGTG GCGGTGATCG AAAAAGGCTG GCTCGGCGGC GGCAATACCG GCCGCAACAC CACCATCATC CGCTCGAACT ACCTCTACGA AGAGAGCATG CAGATCTACG AGCATTCGAT GAAGCTCTGG GAAGGGCTTT CGCAGGAGCT CAACTACAAT GTGATGTATT CGCCGCGCGG CGTGATGATG CTCTCGCACA ATATTCACGA CCAGCAGTCG TTCAAGCGGC ATATCCATGC CAACCGGCTC TACGGTATCG ACAATGAATG GCTGACGCCG GAGCAGGCCA AGGCCTATTG CCCGCCGCTC GACATTTCGG CCAGTGCCCG CTACCCGATC AACGGTGCGG CGCTGCAGCG GCGCGGCGGC ACGGCGCGGC ACGATGCGGT TGCCTGGGGT TATGCGCGTG CGGCCTCGGA CCGCGGCGTG CACATCATCC AGAATTGCGA AGTGACCGGT ATCCGCCGCG GCCCCGATGG ACGCGTGACC GGAGTCGAGA CCTCGCGCGG CTTCATCGGC GCCAAAAAGG TCGCCGTGTC GGCCGCCGGC CATACGACGA CGATCATGAA GATGGCCGAT GTGCGCGTGC CGCTGCAATC GAGCCCGCTG CAGGCGCTGG TCTCCGAGCC GCTGAAGCCG ATCTTCCCGT GCGTCGTCAT GTCGAACACG GTGCATGCCT ATATCTCCCA GTCCGACAAG GGAGAGCTCG TCATCGGCGC CGGCACCGAC CAGTATAATT CCTACTCGCA GACCGGCGGG CTGCAGATCA TTACCCATAC GCTCGACGCT ATCTGCGAGC TTTTCCCGAT GTTCCGGCGC GTCAAGATGA TGCGGCAATG GGGCGGCATC GTCGACAATA CGCCGGACCG CTCGGCGATC CAGTCGAAGA CGCCGGTTCC CGGGCTTTAC GTCAATTGCG GCTGGGGCAC CGGCGGCTTC AAGGCGACGC CGGGCTCGGC CAATCTCTTC GCGCATCTGA TTGCCCGCGA CGAGCCGCAC AAGTTCAATG CCGGGCTGAC GCTGGAACGT TTCCGCAGTG GCCGGCTGAT CGACGAGGCG GCGGCGGCAG CGGTGGCACA CTGA
|
Protein sequence | MRKYSVFAVA REALRGHKGW EKQWTSPEPR AEYDVVIIGA GGHGLGAAYY LAKEHGITNV AVIEKGWLGG GNTGRNTTII RSNYLYEESM QIYEHSMKLW EGLSQELNYN VMYSPRGVMM LSHNIHDQQS FKRHIHANRL YGIDNEWLTP EQAKAYCPPL DISASARYPI NGAALQRRGG TARHDAVAWG YARAASDRGV HIIQNCEVTG IRRGPDGRVT GVETSRGFIG AKKVAVSAAG HTTTIMKMAD VRVPLQSSPL QALVSEPLKP IFPCVVMSNT VHAYISQSDK GELVIGAGTD QYNSYSQTGG LQIITHTLDA ICELFPMFRR VKMMRQWGGI VDNTPDRSAI QSKTPVPGLY VNCGWGTGGF KATPGSANLF AHLIARDEPH KFNAGLTLER FRSGRLIDEA AAAAVAH
|
| |