Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1198 |
Symbol | |
ID | 8012307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1173742 |
End bp | 1174785 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644823782 |
Product | aldo/keto reductase |
Protein accession | YP_002975032 |
Protein GI | 241203936 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.5324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.017013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACA ATTCGCTAGG CCGCACCGAG ATTTCCGTTT CAGAGATTTG CCTTGGCACC ATGACCTGGG GTTCGCAGAA CAGCGAAGCC GATGCTCATG CGCAGATGGA CTACGCCGTC GAAAAGGGCG TCAATTTCTT CGATACGGCC GAACTTTATC CGACCACCCC GATTTCGGCC GCTACGCAGG GCTGGACGGA AGACTATATC GGCAGCTGGT TCAAGAAGAC CGGCAAGCGC GGCGATATCG TGCTCGCCAC CAAGGTCGCC GGCCGCGGCC GCGACTATAT ACGTGGTGGC GAAGGTGCCG ATGCAAAGAA TATCCGCCTG GCGCTCGAAG CCAGCCTGGC GCGGCTGAAG ACGGATTACG TCGACCTCTA CCAGATCCAC TGGCCGAACC GCGGCCATTT CCATTTCCGT CAGAATTGGA GCTACAATCC CTTCAACCAG AACCGCGACG AGGCCGTCGC CAATATGCTC GACATCCTGG AAACGCTTGG CGTGCTGGTG AAGGAAGGCA AGATCCGCGC GATCGGCCTT TCCAACGAAA CCACCTGGGG CATACAGAAA TATCTGACGC TCGCCGAACA GAAGAGCCTG CCGCGGGTCG CCTGCGTCCA GAACGAATAC AACTTGCTCT ACCGCCATTT CGACCTCGAT CTCGCCGAAC TCTCGCATCA TGAGGATGTC GGGCTGCTCG CCTATTCTCC GCTCGCCGGC GGCATCCTCT CCGGAAAATA TGTCGATGGC GGCAGGCCGA AGGGTTCGCG CGGCTCGATC AACCACGATA TCGGCGGTCG CCTGCAGCCG CTACAGGAGC CGGCGACCAA AGCCTATCTG GAGATCGCTG CAACATACCG CCTCGACCCG GCAGCAATGG CGCTCGCCTT CTGCCTTTCC AGGCCCTTCA TGGCCTCGGC CATCATCGGC GCGACCTCGA TGGAGCAGTT GAAAATCGAT ATCGGCGCGG CCGACATTAC GCTTTCGAAC GAGATCCTGG CGGAAATCGC CAAGGTGCAC CGGCAGTATC CGCTGACGCT TTGA
|
Protein sequence | MKYNSLGRTE ISVSEICLGT MTWGSQNSEA DAHAQMDYAV EKGVNFFDTA ELYPTTPISA ATQGWTEDYI GSWFKKTGKR GDIVLATKVA GRGRDYIRGG EGADAKNIRL ALEASLARLK TDYVDLYQIH WPNRGHFHFR QNWSYNPFNQ NRDEAVANML DILETLGVLV KEGKIRAIGL SNETTWGIQK YLTLAEQKSL PRVACVQNEY NLLYRHFDLD LAELSHHEDV GLLAYSPLAG GILSGKYVDG GRPKGSRGSI NHDIGGRLQP LQEPATKAYL EIAATYRLDP AAMALAFCLS RPFMASAIIG ATSMEQLKID IGAADITLSN EILAEIAKVH RQYPLTL
|
| |