Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4543 |
Symbol | |
ID | 8015300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4670642 |
End bp | 4671589 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644827120 |
Product | aldo/keto reductase |
Protein accession | YP_002978320 |
Protein GI | 241207224 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.785787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.78245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATGC GCCGGCTTGG AAAAACAGGT CTATCGGTCG CGCCGATCGT CATCGGCGGC AATGTCTTCG GCTGGACGGC CGACGAAAAA ACATCCTTCG CCATTCTCGA TGCCTTCTTC GACGCGGGCC TGAACACGAT CGATACGGCC GATGTCTATT CCTCATGGGT TCCCGGCAAC AAGGGCGGCG ATTCCGAGGA GATCATCGGG CGCTGGCTAA GCCAAGCCAA GGTTTCTCGC GACAAGGCCG TCATCGTCAC CAAGGTCGGC TCCGACATGG GACAGGGAAA GACGCTGAAG GAGACCTATA TCCTGAAGGC GGTCGAGGAT TCGCTGCGCC GGCTGCAGAC CGACTATATC GACGTCTATC TCTCGCACTG GCCGGATGAA GATACGCCAC ACGAGGAAAC GCTCGGCGCC TTTGCCAAGC TGAAGCAGCA GGGCAAGATC CGCGCCATCG GCTGCTCGAA CTATGATGCG AAACTCCTCC AGGCTTCCTT CGACGCTGCT GAAAAGGCCG GATTGCCGCG GTACGATGTG TTGCAGCCGG AATATAATCT CTATGAGCGT TCGAGCTTCG AGGGGCCGCT TGCCGATCTC TGCGTCAAAG AGGACATCGG CGTCATCACC TATTTCAGTC TCGCCGCCGG CTTCCTCACC GGCAAATACC GCAGCAAATC CGATATGCAA GGCCGCGCAC GCGAGGGCCG GGTTTCGAAA TATCTCGACG ACAAAGGCCT GCGCATCCTG GCCGCGCTCG ACAGTGTTTC TGCCGAGACC GGTGCCAAGC CGGCGGAAAT TTCGCTCGCC TGGCTGCTGC GCAAGAAGGG CGTGACGGCG CCGATCGCCA GCGCGACCAG CCTTTCGCAG CTTGAAAGCC TGGCGAAATC GGCGACACTT GCGCTTTCCG ATGACGCGAT GGCGCTGCTC GACGAAGCCG GTGCCTGA
|
Protein sequence | MEMRRLGKTG LSVAPIVIGG NVFGWTADEK TSFAILDAFF DAGLNTIDTA DVYSSWVPGN KGGDSEEIIG RWLSQAKVSR DKAVIVTKVG SDMGQGKTLK ETYILKAVED SLRRLQTDYI DVYLSHWPDE DTPHEETLGA FAKLKQQGKI RAIGCSNYDA KLLQASFDAA EKAGLPRYDV LQPEYNLYER SSFEGPLADL CVKEDIGVIT YFSLAAGFLT GKYRSKSDMQ GRAREGRVSK YLDDKGLRIL AALDSVSAET GAKPAEISLA WLLRKKGVTA PIASATSLSQ LESLAKSATL ALSDDAMALL DEAGA
|
| |