Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7231 |
Symbol | |
ID | 8022937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 655430 |
End bp | 658387 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644834063 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002985197 |
Protein GI | 241667113 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0118066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.098262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCT TCCGTCTTTC CTCCGGCGGC CGCATCGACC GCGCCACGAC GCTCGGTTTT ACTTTCGACG GCAGGCCACT CGAAGGCCAT CCGGGCGATA CACTCGCCTC GGCGCTGCTT GCCAACGGCG TCCAGCTCGT CGGCCGCAGC TTCAAATATC ACCGCCCGCG CGGCATCCTG ACGGCGGGCG CTGCCGAACC GAACGCGCTG GTAACGACGG GCAGCGGCGG GCGCACCGAG CCGAATACGC GCGCCACGAT GATCGAGCTC TACCAGGGGC TGACGGCAAA GAGCCAGAAC CGCTGGCCTT CGCTCGGCTT CGATGTCGGC GCGGTCAACG GGCTGCTGTC GCCCTTCCTC AGTGCCGGCT TCTACTACAA GACCTTCATG TGGCCGGCGG CCCTCTGGGA AAAGCTCTAC GAGCCGGTGA TCCGCAAGGC GGCGGGCCTC GGCAGGGCTT CCTACGAGGC CGATCCCGAT AGCTACGAGA AATGCTGGGC GCATTGCGAT CTGCTGGTGA TCGGTGCCGG CCCGACCGGT CTTGCCGCAG CGCTGACCGC CGGCCGCGCC GGTGCGCGCG TCATCCTCGC CGACGAGGGG TCCGAACTTG GCGGCAGCCT GCTTTCCGAC AGCGGCACTA TTGACGGCAA GCCGGCGGAT GCGCTTCTCG GCGAGCTGCT TGCCGAACTG GCGAGCCTGG AGAACGTGCG CCGCCTGCCG CGCATGACGG TGTTCGGCTG GTATGACGAC AATGTCTTTG GCGCCGTCGA ACGGGTGCAG AAGCATGTGG CGATGCCGGA TCCGGAACGC CCCGTGGAAC GGTTGTGGCG CATCGTCGCC CGCCAGGCGA TCCTTGCAAC CGGCGTCGAG GAGCGTCCGC TGGTCTTCGG CGGCAACGAT ATTCCCGGTG TGATGATGGC AGGCGCGATG CGCAGCTATC TCAATCGGCA GGCGGTCGCG CCCGGCAGAA GCACGGTCAT CTTCACCACC AACGATGCCG GATATCGAAC CGCAGCCGAT CTGGAGGCCG CCGGCCTCAC CGTCGCGGCG ATCGTCGACA GCCGCGCGGA TGCGGCCGCG AGCTGGCAGG GCCGCGCCGA AGTGCTGAAG GGCGCCAGGG TCATCGATGC GCTTGGCGGC AAGCGCCTGC GCGGCGTCAG CGTCGAAACC GCAACCGGCA CCCGCCGGAT CGAAGCCGAT GCGCTAGCCA TGTCGGGCGG CTGGAGCCCG ATCATTCATC TTGCCTGCCA TCGCGGTGCG AAACCGCAAT GGTCGGACGA AGCCTCGGCG TTTCTGGCGC CTGTCCAGCA GGAAGGTCTG ACCGTTGCCG GCTCGGCTGC CGGCCTTGCG CAAACCGCGA CCTGCCTCGG CGATGGCGCC GCAAAGGCAG TTGCCGCGTT GAAGGCGATC GGTTTCGCGG CGGCGGAACC GGCCTTTGCC TTGGCGTCCG AGGCGGCTGC ACAGGCAAAG CCGCTCTGGT CGGTCAAGGG CTCGAAGGGC AAGGCCTTCG TCGATTACCA GAACGACGTG CATCTCAAGG ATCTCGGCCT CGCCGTCCGC GAAGGCTACG GTCATGTCGA GCTTGCCAAG CGCTACACCA CATCAGGCAT GGCGACCGAT CAGGGCAAGC TTTCCAACAT CAACGCCATC GGCATCCTTG CCGAGGCGCG CGGCGTGTCG CCCGCCGATG TGGGAACGAC GACTTTCCGG CCCTTCTATA CGCCGGTTTC CTTCGGCGCC TTGACTGGGG CGTCGCGCGG CAAACATTTC CAGCCGGCGC GCAAATCGCC GCTGCATGGC TGGGCTAAGA AGAATGGCGC GATCTTCGTC GAAACCGGCC TCTGGTACCG GTCCGCCTGG TTTCCGCGCG CCGGAGAGAC GACATGGCGT GAAAGCGTCG ACCGCGAGGT GCTGAACATC CGGAAGAATG CCGGCCTCTG CGACGTCTCG ACCCTCGGCA AGATCGAAAT ATCAGGGAGG GATGCCGCGA CCTTCCTCGA TCGCATCTAT TGCAACGGCT TCGCCAAGCT CGCTGTCGGC AAGGCGCGTT ACGGCATCAT GCTGCGTGAG GACGGCTTCA TCTATGATGA CGGCACCACC AGCCGGTTCA GTGACGAGCA TTTCTTCATG ACGACGACGA CCGCACTTGC CGCCGGCGTG TTGACCCATC TCGAATTCTG CGCCCAGACG CTCTGGCCGG AACTCGACGT CTGCTTCGCC TCCTCGACCG ATCAATGGGC GCAGATGGCC GTTGCCGGGC CGAAGTCGCG CGCCATTCTT CAGGAGATCG TCGACGAGGA TTTGTCGGAT GCAGCCTTCC CCTTCATGAG CGCGCGCAAG GTCTCCCTGT TCGGCGGCCA ACTTGAAGGG CGGCTCTTCC GGATTTCCTT CTCGGGCGAA CTCGCCTACG AGCTGGCGGT GCCCGCCGGC TACGGCGAAG GCGTTGCGGA TGCGATCATG GCGGCGGGCG AGAAACATGG CATCTGCGCC TATGGCGCCG AAGCGCTCGG CGTCATGCGC ATCGAAAAGG GCCATGTCAC CCATGCCGAA ATCAACGGCA CGGTGACGCC TGGCGACCTC GGTTTCGGCC GCATGGTTTC ATCAACCAAG CCGGATTTCA TCGGCAAGGC ATTGCTTGCC CGCGAAGGGC TGCAAGATCC CGACCGGCAG CGCCTTGTCG GCGTCAAGCC GCTTAATCCG GCGACCGGTT TCCGCACCGG CTCGCATATT CTCGCCGACG GCGTTGCGGC GACCCTCGAA AACGACCAAG GCTACGTCAC TTCAAGCGCC TTTTCGCCAG GGCTCGGCCA TACGATCGGC CTGGCGCTCG TCAGGCGTGG GCCGGAGCGC ATCGGTGAAA AGGTGACGGT CTGGAACGGC CTGCGCAACG AATTCACCGA TGCGGTGCTC TGCCATCCCG TCTTCATCGA TCCCGAAAAC GAGAAGCTCC ATGCGTGA
|
Protein sequence | MTSFRLSSGG RIDRATTLGF TFDGRPLEGH PGDTLASALL ANGVQLVGRS FKYHRPRGIL TAGAAEPNAL VTTGSGGRTE PNTRATMIEL YQGLTAKSQN RWPSLGFDVG AVNGLLSPFL SAGFYYKTFM WPAALWEKLY EPVIRKAAGL GRASYEADPD SYEKCWAHCD LLVIGAGPTG LAAALTAGRA GARVILADEG SELGGSLLSD SGTIDGKPAD ALLGELLAEL ASLENVRRLP RMTVFGWYDD NVFGAVERVQ KHVAMPDPER PVERLWRIVA RQAILATGVE ERPLVFGGND IPGVMMAGAM RSYLNRQAVA PGRSTVIFTT NDAGYRTAAD LEAAGLTVAA IVDSRADAAA SWQGRAEVLK GARVIDALGG KRLRGVSVET ATGTRRIEAD ALAMSGGWSP IIHLACHRGA KPQWSDEASA FLAPVQQEGL TVAGSAAGLA QTATCLGDGA AKAVAALKAI GFAAAEPAFA LASEAAAQAK PLWSVKGSKG KAFVDYQNDV HLKDLGLAVR EGYGHVELAK RYTTSGMATD QGKLSNINAI GILAEARGVS PADVGTTTFR PFYTPVSFGA LTGASRGKHF QPARKSPLHG WAKKNGAIFV ETGLWYRSAW FPRAGETTWR ESVDREVLNI RKNAGLCDVS TLGKIEISGR DAATFLDRIY CNGFAKLAVG KARYGIMLRE DGFIYDDGTT SRFSDEHFFM TTTTALAAGV LTHLEFCAQT LWPELDVCFA SSTDQWAQMA VAGPKSRAIL QEIVDEDLSD AAFPFMSARK VSLFGGQLEG RLFRISFSGE LAYELAVPAG YGEGVADAIM AAGEKHGICA YGAEALGVMR IEKGHVTHAE INGTVTPGDL GFGRMVSSTK PDFIGKALLA REGLQDPDRQ RLVGVKPLNP ATGFRTGSHI LADGVAATLE NDQGYVTSSA FSPGLGHTIG LALVRRGPER IGEKVTVWNG LRNEFTDAVL CHPVFIDPEN EKLHA
|
| |