Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3658 |
Symbol | |
ID | 8014504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3702709 |
End bp | 3705702 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826221 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002977440 |
Protein GI | 241206344 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.259851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.147827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG TGAACCGTAT CGCAGGCAAG GGGCGCCTGA CACCGGCCAG GACCGCGCGC TTCAGCTTCG ACGGCAAGAG TTATACGGCG CTCGAAGGTG ATACCGTCGC CTCGGCGCTG ATCGCCAACG GCGTGCATCT CATCGGCCGT TCGTTCAAAT ATCACCGGGC CCGCGGTATT CTGTCGGCAG GGGCCGAAGA GCCGAACGCG CTGATCGACG TTTCCCGCGA TACGGCGCGC AAGCAGCCGA ACGTGCGCGC CACCGTGCAG GAAGTCTTCG ACGGCATGAT CGTCAGCTCG CAAAACCGCT GGCCCTCGCT TGCCTTCGAC GTCGGTGCGG TCAACAACCT GATGTCGCCG TTCTTTGCCG CTGGTTTCTA CTACAAGACC TTCATGTGGC CGCGCGCCGC CTGGAAACAC GTCTACGAAC CGCTCATCCG TCGTGCCGCC GGCCTCGGCG TCGCACCGAC GGAGGAGGAT CCGGACCATT ATGCCAGCCG CTATGCCCAT TGCGACGTGC TGGTGGTCGG CGCCGGTGTT GCCGGGCTTT CGGCGGCGCT GGCTGCGGCC GAGACCGGCG CCCGCGTGAT CCTTTGCGAC GAGCAGGCGG AAGCGGGCGG GGCATTGCGC TACGATGCCG GCGTGAGGAT CGACGGACAG GACGGCAATA GCTGGGCACA GAAGGCCGTG GCGCGACTGA AGGCGATGGA TAATGTCGAA GTGCTCACCC GCACGACCGC CTTCGGCTAC TACAACCACA ATTTCGTCGG TCTTGCCGAG CGCGTCACCG ATCATATTGC CAAGCCTTCC CGCGATCTGC CGCGCGAGCG GCTTTGGCAG GTCCGGGCCA AGCGGGTGAT CCTTGCCACC GGCGCGATCG AGCGGCACAT GGTGTTTCCG AACAACGACC GGCCGGGCAT CATGCTCGCC TCGGCGGGGC GGATGTATCT CAATCATTAC GGCGTTGCGG TCGGGGCCAA AGTCGGCATC TACACGGCGC ACGACTCGGC CTATGAGGCG GCTTTCGACC TGAAACGATC CGGTGTTTCG ATCGCCGCCA TCGTCGATTG CAGGCAGACG CCGGGAGCGG CGGTGCTGGA GGAGGCGCGT ACGCTCGGCA TCGATGTTCT GGCCGGCCAA TCGGTCGTCA ACACCTCGGG ACGGCTGCGC ATCTCGTCGA TGACGGTGGC GCGCAACGGC GGCGGTTCGC CGCGCAAGAT CGCGGTCGAT GCGCTGCTGG TTTCGGCCGG CTGGACGCCG TCCGTCCATC TGTTCTCGCA GTCGCGCGGC AAAGTGGCCT TCGATGCCGA AAGCCAGCGT TTCCTGCCCG GCTCCTATGC GCAGGACTGC CTCTCGGTCG GCGCGTGCAA CGGCACCGAC GACCTGCAGC GGACGATCGA GGAGTCGCTT GCCGCCGGCG AACTGATGGC GCAGGCCACC GGCAGAAGCA GTGGCGAGAA GATCGCAATT TCGGCCGAAC AGGCCTATGA CTGGACGGGC GGCATGATTG GTGCTGCCGA AGGCGCCGGG CCGAAGACCA ACGCCAAGGC CTTTATCGAT TTCCAGCATG ACGTCTGCGC CAAGGATATC CGCCTTGCCG TGCGCGAGGG CATGCATTCG ATCGAGCATA TCAAGCGCTT CACGACCAAC GGCATGGCCT CGGACCAGGG CAAGCTCTCC AACATGCATG GCCTGGCGAT TGCCGCTGAA ATGCTCGGCA AGGAAATCCC GCAGGTCGGC CTCACCACCT TCCGTGCGCC CTATACGCCG GTTACCTACG GTACGCTGAT CGGTCATTCG CGCGGAGAGC TGTTCGATCC GACGCGCAAG ACGCCGCTGC ATGCCTGGGA AGAAGCCCAT GGCGCGGTCT TCGAGGATGT CGGCAACTGG AAGCGTGCCT GGTTCTATCC GCAGGCCGGC GAGACCATGC ATCAGGCGGT GGCTCGCGAA TGCCGGACGG CACGCGAGGC GGCCGGTATC TTCGACGCCT CGACACTCGG CAAGATCGAG GTGGTGGGGC CGGATGCGGC GGAATTTCTC AACCTCATCT ACACCAATGC CTGGGACACG CTGAAGCCCG GCAAGGCCCG CTACGGTATC ATGACCCGCG AGGACGGTTT CGTTTATGAC GACGGCGTTG TCGGACGCCT GGCGGACGAC CGTTTCCATG TGACGACGAC CACCGGCGGC GCGCCGCGTG TTCTCCATCA CATGGAAGAT TACCTGCAGA CGGAATTCCC GCATCTGAAG GTATGGCTGA CTTCGGTGAC CGAACAATGG GCTGTCATCG CCGTGCAGGG ACCGAAGGCG CGCGAGATCG TCGCGCCGCT GGTCGAAGGG CTCGATCTTT CGAACGAGGC CTTCCCGCAT ATGAGCGTTG CCGAGTGCAC GGTTTGCGGC GTGCCGGCGC GGCTCTTCCG CGTCTCTTTC ACCGGTGAAA CCGGCTTCGA AATCAATGTG CCGGCCGATT ACGGCCAGTC GGTTCTCGAA GCGGTCTGGG CCAATGCTGA GCCGCTCGGC GCCTGCGTCT ACGGCACGGA GACGATGCAC GTTCTTCGCG CCGAGAAGGG TTACATCATC GTCGGGCAGG ATACCGATGG GACGGTGACC CCCGATGACG CCGGACTTTC CTGGGCGGTT TCGAAGAAAA AGACGGATTT CGTCGGCATC CGCGGGTTGA AGCGGCCGGA TCTCGTCAAG GACGGGCGCA AACAGCTCGT CGGCCTCGTC ACCAAGGACC CGAAGCTGGT GCTCGAAGAA GGCGCGCAGA TCGTCGCGAG CCCGAACGAG CCGAAGCCGA TGACCATGCT CGGCCACGTC ACATCAGCCT ATTGGTCGGA CAATTGCGGC AGGTCGATTG CCTTCGCGCT CGTCGCCGGC GGCCGGGCGC GGATGGGCGA AACGCTCTAT GTGCCGATGC CGGACCGGAC GATCGCCGTA GACGTGACGG ATCTGGTATT CTTTGACAAG GAAGGGGGGC GCATCCATGG CTGA
|
Protein sequence | MSGVNRIAGK GRLTPARTAR FSFDGKSYTA LEGDTVASAL IANGVHLIGR SFKYHRARGI LSAGAEEPNA LIDVSRDTAR KQPNVRATVQ EVFDGMIVSS QNRWPSLAFD VGAVNNLMSP FFAAGFYYKT FMWPRAAWKH VYEPLIRRAA GLGVAPTEED PDHYASRYAH CDVLVVGAGV AGLSAALAAA ETGARVILCD EQAEAGGALR YDAGVRIDGQ DGNSWAQKAV ARLKAMDNVE VLTRTTAFGY YNHNFVGLAE RVTDHIAKPS RDLPRERLWQ VRAKRVILAT GAIERHMVFP NNDRPGIMLA SAGRMYLNHY GVAVGAKVGI YTAHDSAYEA AFDLKRSGVS IAAIVDCRQT PGAAVLEEAR TLGIDVLAGQ SVVNTSGRLR ISSMTVARNG GGSPRKIAVD ALLVSAGWTP SVHLFSQSRG KVAFDAESQR FLPGSYAQDC LSVGACNGTD DLQRTIEESL AAGELMAQAT GRSSGEKIAI SAEQAYDWTG GMIGAAEGAG PKTNAKAFID FQHDVCAKDI RLAVREGMHS IEHIKRFTTN GMASDQGKLS NMHGLAIAAE MLGKEIPQVG LTTFRAPYTP VTYGTLIGHS RGELFDPTRK TPLHAWEEAH GAVFEDVGNW KRAWFYPQAG ETMHQAVARE CRTAREAAGI FDASTLGKIE VVGPDAAEFL NLIYTNAWDT LKPGKARYGI MTREDGFVYD DGVVGRLADD RFHVTTTTGG APRVLHHMED YLQTEFPHLK VWLTSVTEQW AVIAVQGPKA REIVAPLVEG LDLSNEAFPH MSVAECTVCG VPARLFRVSF TGETGFEINV PADYGQSVLE AVWANAEPLG ACVYGTETMH VLRAEKGYII VGQDTDGTVT PDDAGLSWAV SKKKTDFVGI RGLKRPDLVK DGRKQLVGLV TKDPKLVLEE GAQIVASPNE PKPMTMLGHV TSAYWSDNCG RSIAFALVAG GRARMGETLY VPMPDRTIAV DVTDLVFFDK EGGRIHG
|
| |