Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6594 |
Symbol | |
ID | 8022844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 20374 |
End bp | 21405 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644833463 |
Product | aldo/keto reductase |
Protein accession | YP_002984597 |
Protein GI | 241666513 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.688437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000922081 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATTATC GCAAGCTCGG TCCCAGCGGG ACCGTCGTCA CCGCCTATTG CCTCGGCACC ATGACCTTCG GCGCGGAGGC CGACGAGGCC GCCTCGCACA AGCTGCTCGA CGATTATTTC GCCTGGGGCG GCAATTTCAT CGATACCGCC GATGTCTACA GCGCCGGCAA GTCAGAAGAG ATCATCGGAC GCTGGCTGAA GGCCCGCCCG ACCGAGGCCC GCCAGGCGAT CGTCGCCACC AAGGGACGCT TTCCGATGGG CAACGGACCC AACGATATCG GCCTGTCGCG CCGGCATCTT AGCCAGGCGC TCGACGATTC GCTCCGCCGC CTCGACCTCG AGCAGATCGA CCTCTACCAG ATGCATGCCT GGGACGCGCT GACCCCGATC GAAGAGACGC TGCGTTTCCT CGACGATGCG GTATCCTCCG GCAAGATCGG CTATTACGGC TTCTCCAATT ATGTCGGCTG GCATATCGCC AAGGCCTCGG AGATCGCCAA GGCGCGCGGT TATACCCGTC CGGTGACGCT GCAGCCGCAA TATAACCTGC TGGTGCGCGA AATCGAACTC GAGATCGTCG CGGCCTGCCA GGATGCCGGC ATGGGTCTGT TGCCGTGGTC ACCGCTCGGC GGCGGCTGGC TGACCGGCAA GTACAAGCGC GATGAGATGC CGACCGGCGC CACCCGCCTC GGTGAAAACC CCAATCGCGG CGGCGAATCC TATGCACCGC GCAATGCGCT GGAGAGAACC TGGGCGATCA TCGGCGTCGT CGAAGAGATC GCCAAGGTGC ATGGCGTCAG CATGGCGCAG GTGGCGCTCG CCTGGACAGC GGCGCAGCCG GCGATCACCT CCGTGATCCT CGGCGCCCGC ACACCGGAAC AGCTCGCCGA CAATCTCGGC GCCATGAAGC TCAAACTCTC CGACGACGAG ATGGCGAGAC TGAACGATGT GAGCGCGCCT CAGCCGTTCG ACTATCCCTA CGGCAAGGGC GGCATTAACC AGCGCCATCG CAAGATCGAG GGTGGGCGCT GA
|
Protein sequence | MDYRKLGPSG TVVTAYCLGT MTFGAEADEA ASHKLLDDYF AWGGNFIDTA DVYSAGKSEE IIGRWLKARP TEARQAIVAT KGRFPMGNGP NDIGLSRRHL SQALDDSLRR LDLEQIDLYQ MHAWDALTPI EETLRFLDDA VSSGKIGYYG FSNYVGWHIA KASEIAKARG YTRPVTLQPQ YNLLVREIEL EIVAACQDAG MGLLPWSPLG GGWLTGKYKR DEMPTGATRL GENPNRGGES YAPRNALERT WAIIGVVEEI AKVHGVSMAQ VALAWTAAQP AITSVILGAR TPEQLADNLG AMKLKLSDDE MARLNDVSAP QPFDYPYGKG GINQRHRKIE GGR
|
| |