Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1636 |
Symbol | |
ID | 8012707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1626791 |
End bp | 1627921 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644824222 |
Product | Alkanesulfonate monooxygenase |
Protein accession | YP_002975463 |
Protein GI | 241204367 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0295033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTT TCTGGTACAT GTGCGCGCCC GACGGCGCCT ATCCCTGGCA GCCGGAAGGC TCTCGCCAGG TCGACTACGG CTACTACAAG CAGCTTGCCC AGGCCTATGA CCATCTGGGT TACACCGGCG CCCTGTTTGC GACCGGCGCC CACGACGTCT GGGTGCTGGC CAGCGCGCTG CTTGCCCACA CCGAGCGGCT ACGCTTTCTC GTCGCCATCC ATCCCGGTCT CGTCGCGCCG ACATTGCTTG CCAAGATGGC GGCGACCTTT CAGGAATTCG CCCGCGGCCG GCTGCTGATC AATGTCGTCT CGGGCGACGC CAAGATGCTC GGCGCTTATG GCATGACGCT GCCGCATGAC GAGCGCTACG ACATGGCGGA CGAATATCTT CAGCTATGGC ACCGGCTCTT TGCCGGCGAG AGCGTCACTT ATCAGGGCAA GTATTTCTCG ACCGACGGCG CCAAGCTCGC CTTGCCGGTC GGCGAGAGCA TCGCACCGCC GCCGCTCTGG TTCGGCGGCT CCTCGGACAA GGCGCTGGAG GTCGCGGCAA AACATGTGGA CACCTATCTC TCCTGGGGCG AGACGCCCGC CCAGATCGGC GAGAAGGTCG AGGCGGTGAA GGCGCGCGCC GCCCACTACG GGCGTGAACT CGAATACGGC ATCCGCCTCT ATGTCATCGT TCGCGACACC GACGAAAAGG CCTGGGAGGC GGCGGCCGAT CTCTATGGTC GCATGGACGA TGCGGCGATC GCCGCCAACC AGCGCTTCGT CGCAAGAAGC GATTCCGTCG GCCAGCAGCG CATGACCGCA CTGCATGGCG GCCTCAAGCC GGAAAACCTG CGCGATCTCG AAGTTGCGCC GAACCTCTGG GCCGGGATCG GTCTGGTCAG GCCAGGCCCG GGTACGGCGA TCGTCGGTTC GCCGGATACG GTTCTGCGCA CGCTCGAAGC CTATCAGAAG GCTGGCGTCG ACACTTTCAT TCTTTCTGGC ATGCCGCTGC TTGAGGAAGC CTATCGTTTC GGCGAAAAGG TGCTGCCGCG GCTCGATGTC AGCAGGGAAG TCTCGAAGGC GCGGAACTAC ACTTGGTCGA CGCTCTTCGA TCGCGATCTT TCGACCGTCA AGAGCGCCTG A
|
Protein sequence | MNVFWYMCAP DGAYPWQPEG SRQVDYGYYK QLAQAYDHLG YTGALFATGA HDVWVLASAL LAHTERLRFL VAIHPGLVAP TLLAKMAATF QEFARGRLLI NVVSGDAKML GAYGMTLPHD ERYDMADEYL QLWHRLFAGE SVTYQGKYFS TDGAKLALPV GESIAPPPLW FGGSSDKALE VAAKHVDTYL SWGETPAQIG EKVEAVKARA AHYGRELEYG IRLYVIVRDT DEKAWEAAAD LYGRMDDAAI AANQRFVARS DSVGQQRMTA LHGGLKPENL RDLEVAPNLW AGIGLVRPGP GTAIVGSPDT VLRTLEAYQK AGVDTFILSG MPLLEEAYRF GEKVLPRLDV SREVSKARNY TWSTLFDRDL STVKSA
|
| |