Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1004 |
Symbol | |
ID | 6979723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1025909 |
End bp | 1026904 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643395716 |
Product | aldo/keto reductase |
Protein accession | YP_002280524 |
Protein GI | 209548607 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0837346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.566721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCA AGACCGCAAC ACCGACCACG ATCACGCTCT GGAACGGCCG CGAAATTCCG CGTCTCGGCA TGGGATGCTG GGCGATCGGC GGCCCCTTCT TTGCCGGCGA CACGCCGCTC GGCTGGGGCG AAGTCGACGA CGATGAATCC GTCGAAGCGA TCGGCAGCGC CATCGACCTC GGCATCCGCT TCTTCGATAC CGCCTCGAAT TACGGCGCCG GCCATTCCGA AGAAGTGCTC GGCCGGGCGA TCGGCAATCG CGACGATATT ATTGTTGCCA CCAAATTCGG CTTTGCCACC GATGCCGCCA CCAAGCAGGC TACCGGCGCC TTTGCCGATG AAGCCTTCAT CCGCCGTTCG GTCGAGACCT CGCTGCGCCG CCTCAAGCGT GACCGCCTCG ATCTCCTACA GTTCCACATC AATGATTTTC CGTTGGAACA GTCCGATGCC GTCTTCGACG TGCTGGAGGC GTTGCGCGTC GAAGGCAAGA TCGACGCATT CGGCTGGAGC ACCGATTCTC CCGATCGCGC CGCCCGCCAT GCCGGCCGCC AGGGCTATGT CTCGGTGCAG CATACGATGA ACGTCTTCGA GCCGGTGCCG GAGATGATCG CAGTGATCGA AAGGCAGGAA CTGATCTCGA TCAATCGCGG TCCGCTGGCC ATGGGGCTGC TGACCGGCAA GTTCACCGCC GACAAGGCGG TGGGCGCCAA GGATGTCCGC GGCGCGGCCC TCGACTGGAT GGTCTACTTC AAGGACGGGC GCATGGCCCC GGAATTTGCC GCAAGGCTCG ACGCCGTCCG CGATCTCCTG ACCTCGGGCG GCCGCACACT GACGCAAGGG GCGCTCGCCT GGCTCTGGGC AAAGTCGCCG CGCACCCTCC CCATTCCAGG CTTCCGCACC GTCGCCCAGG TGGAGGAAAA TGCCGGCGCA CTGGAAAAGG GACCGCTGCC GGCCGATGTC ATGGCGGGGA TCGACGCCGC ACTCGGGCAT CAGTGA
|
Protein sequence | MLTKTATPTT ITLWNGREIP RLGMGCWAIG GPFFAGDTPL GWGEVDDDES VEAIGSAIDL GIRFFDTASN YGAGHSEEVL GRAIGNRDDI IVATKFGFAT DAATKQATGA FADEAFIRRS VETSLRRLKR DRLDLLQFHI NDFPLEQSDA VFDVLEALRV EGKIDAFGWS TDSPDRAARH AGRQGYVSVQ HTMNVFEPVP EMIAVIERQE LISINRGPLA MGLLTGKFTA DKAVGAKDVR GAALDWMVYF KDGRMAPEFA ARLDAVRDLL TSGGRTLTQG ALAWLWAKSP RTLPIPGFRT VAQVEENAGA LEKGPLPADV MAGIDAALGH Q
|
| |