Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3661 |
Symbol | |
ID | 8014507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3706454 |
End bp | 3707707 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826224 |
Product | sarcosine oxidase, beta subunit family |
Protein accession | YP_002977443 |
Protein GI | 241206347 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.616865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.13922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAAT ATTCGGTTTT TGCCGTGGCA CGGGAGGCCC TTCGCGGCCA CAAGGGCTGG GAGAAGCAGT GGACTTCGCC TGAGCCGCGC GCCGAATACG ACGTCGTCAT CATTGGCGGC GGCGGCCACG GGCTGGGCGC TGCCTACTAT CTCGCCAAGG AGCACGGCAT CACCAATGTG GCGGTGATCG AGAAGGGCTG GCTCGGCGGC GGCAATACCG GGCGCAACAC CACCATCATC CGCTCGAACT ATCTCTACGA AGAGAGCATG CACATTTACG AGCATTCGAT GAAGCTCTGG GAAGGGCTTT CCCAGGAGCT CAACTACAAT GTGATGTATT CGCCGCGCGG CGTGATGATG CTTTCGCACA ATATTCACGA CCAGCAGTCC TTCAAGCGGC ATATCCATGC CAACCGGCTC TACGGCATCG ACAATGAATG GCTGACGCCG GAGCAGGCGA GGGCCTATTG TCCGCCGCTC GATATCTCGG CCAGCGCGCG CTACCCGATC AACGGCGCAG CACTGCAGCG GCGCGGCGGC ACGGCCAGGC ACGACGCGGT CGCCTGGGGT TATGCGCGGG CGGCCTCAGA CCGCGGGGTG CACATCATCC AGAATTGTGA AGTGACCGGC ATCCGGCGCG GTCCGGACGG ACAGGTGACC GGGGTCGAGA CCTCGCGTGG CTTCATCGGC GCCAGAAAGA TCGCCGTCTC GGCGGCCGGC CATACGACAA CAATCATGCA GATGGCTGGT GTGCGCGTGC CGCTGCAATC GAGCCCTCTG CAGGCGCTGG TCTCCGAGCC GCTGAAGCCG ATCTTTCCCT GCGTCGTCAT GTCGAACACG GTGCATGCCT ATATCTCCCA GTCCGACAAG GGAGAGCTGG TCATCGGCGC CGGCACCGAC CAGTATAATT CCTATTCCCA GACCGGCGGG CTGCAGATCA TCACGCACAC GCTCGACGCT ATCTGCGAGC TCTTCCCGAT GTTCCGGCGC GTCAAGATGA TGCGGCAATG GGGCGGCATC GTCGACAATA CGCCGGATCG TTCGGCGATC CAGTCGAAGA CGCCGGTTCC CGGGCTTTAC GTCAATTGCG GCTGGGGCAC CGGCGGCTTC AAGGCGACGC CGGGCTCGGC CAATCTCTTC GCGCATCTGA TTGCCCGCGA CGAGCCGCAC AAATTCAATG CCGGGCTGAC GCTGGATCGT TTCCGCAGTG GCCGGCTGAT CGACGAGGCG GCGGCGGCGG CGGTGGCACA CTGA
|
Protein sequence | MRKYSVFAVA REALRGHKGW EKQWTSPEPR AEYDVVIIGG GGHGLGAAYY LAKEHGITNV AVIEKGWLGG GNTGRNTTII RSNYLYEESM HIYEHSMKLW EGLSQELNYN VMYSPRGVMM LSHNIHDQQS FKRHIHANRL YGIDNEWLTP EQARAYCPPL DISASARYPI NGAALQRRGG TARHDAVAWG YARAASDRGV HIIQNCEVTG IRRGPDGQVT GVETSRGFIG ARKIAVSAAG HTTTIMQMAG VRVPLQSSPL QALVSEPLKP IFPCVVMSNT VHAYISQSDK GELVIGAGTD QYNSYSQTGG LQIITHTLDA ICELFPMFRR VKMMRQWGGI VDNTPDRSAI QSKTPVPGLY VNCGWGTGGF KATPGSANLF AHLIARDEPH KFNAGLTLDR FRSGRLIDEA AAAAVAH
|
| |