Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4522 |
Symbol | |
ID | 6977616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 160787 |
End bp | 161857 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643393700 |
Product | Alkanesulfonate monooxygenase |
Protein accession | YP_002278518 |
Protein GI | 209546600 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.574553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC ATTCCGAATT TCTCTGGTAT ATCCCGAACG ACGTCAGGCC CGGCCATCGC GGCGATTCCG CCATCGGCAA CCACAACAGC CTGGAAACGC TGACCAGCCA CGCGAGAGCG CTGGAGGAGC ATGGTTGGAA GGGCGCGCTT ATCGGCACCG GCTGGGGCCG TCCCGACACC TTCACCGTCG CGGCCTCGCT CGCCGCGCGG ACCACCACCT TCGAGCCGCT GATTGCGATC CGTCCGGGCT ATTGGCGGCC GGCGAACTTC GCCTCCGCGG CGGCGACGCT CGACCATCTG ACGGGCGGCC GGGTGCGGAT CAACATCGTC TCGGGCAAGG ACAACCTGCC CGCCTATGGC GATAGCGAAG GCGACCAGGC GCATCGTTAT GACAGGACCA AGGAGTTCAT GCGGCTGGTC CGCAGGCTGT GGACCGAAGA AAACGTCACC TCTGCAGGCG ACCATTTCCG GGTCGCCGAA TCCACCGTGG TGCCGCGCAT CGAGGTTCGC GGCAACCGCC GGCATCCCAA ATTCTATTTC GGCGGCGCCT CGGAAGCGGC CGAACGGGTG GCGGCCACCG AGGCCGATGT CCAACTTTTC TGGGGCGAGC CGCTCGAGGG CGTCCGCGAG CGGATCGGAC GGCTCAAGGG GCTGAGCCGG GCGCTCGACC GCGACCTCCC GCCGCTGGAA TTCGGGCTGC GAATAACGAC GCTGGTCCGC GACACGACGG AACAGGCCTG GGCCGAAGCC GAGGCGAAGG TCGCCGAGAT GGCGAAAAAC AACGGCACCG GCTGGCACGA TCATCAGCGT GTGCTTGCCG TCGGCCAGCA GCGGCTGCTG GCTCTTTACG AGCGCGGCGA TGTTCTCGAC GACAATCTCT ATACAGCGCC GGGCAAATTC GGCGGCGGCG GCGCCGGCAC CACCTGGCTG GTCGGTTCGG CGGCGGATGT GGCGCGGTCG CTGCGCAAAT ATCAGGATCT CGGGGTCACG CATTTCGTGT TGTCCGACAC GCCCTATCTC TCCGAGATCA AGCGGCAGGG CGATCAGCTT TTGCCGCTGC TGCGCGACTG A
|
Protein sequence | MSNHSEFLWY IPNDVRPGHR GDSAIGNHNS LETLTSHARA LEEHGWKGAL IGTGWGRPDT FTVAASLAAR TTTFEPLIAI RPGYWRPANF ASAAATLDHL TGGRVRINIV SGKDNLPAYG DSEGDQAHRY DRTKEFMRLV RRLWTEENVT SAGDHFRVAE STVVPRIEVR GNRRHPKFYF GGASEAAERV AATEADVQLF WGEPLEGVRE RIGRLKGLSR ALDRDLPPLE FGLRITTLVR DTTEQAWAEA EAKVAEMAKN NGTGWHDHQR VLAVGQQRLL ALYERGDVLD DNLYTAPGKF GGGGAGTTWL VGSAADVARS LRKYQDLGVT HFVLSDTPYL SEIKRQGDQL LPLLRD
|
| |