Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2259 |
Symbol | |
ID | 8013262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2265254 |
End bp | 2266042 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824845 |
Product | short chain dehydrogenase |
Protein accession | YP_002976075 |
Protein GI | 241204979 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0792422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0451006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGACA TCACTCTGAA CGCCCCGAAG CTTTTCGATC TCAGCGGCCA GGTTGCCATC GTGACCGGAG CCGGCAGCGG TATTGGTCAG CGCATTGCCA TCGGCCTTGC GCAATGCGGC GCCGACGTGG CGCTGCTCGA CCGCCGAACC GACGACGGGC TGGTCAAGAC GGCCGAACAT ATTCGCGCCG CCGGCCGCCG CTCGATCCAG ATCGCGGCGG ATGTCACGAG CAAATCTTCC CTTGGAGAGG CAATAGCACG GACCGAAGCC GATCTCGGCA CGCTGACGCT TGCAGTCAAC GCGGCAGGCA TCGCTAACGC GAACGCCGCG GAAGAGATGG AGGAGGACCA ATATCAGACG TTGATGGATA TCAACCTGAA AGGTGTCTTT CTTTCCTGCC AGGCCGAGGC TCGCGCCATG CTCAAGAATG GACGCGGCTC CATCGTCAAC ATCGCTTCCA TGTCCGGCGT GATCGTAAAT CGCGGGCTGA GCCAAGCGCA CTATAACGCC TCCAAGGCGG GCGTCATCCA TATGTCGAAG TCCATGGCGA TGGAATGGGT CGACCGCGGC ATTCGCGTCA ACACCATCTC CCCCGGATAC ACGGCAACGC CCATGAACAC CCGTCCGGAG ATGGTCCACC AGACCAAGCT CTTCGAAGAG CAGACGCCGA TGCAGCGCAT GGCAGCGGTG GACGAGATGG TAGGCCCGGC GGTGTTTCTG CTGTCGAATG CAGCAAGTTT CGTGACCGGC GTCGATCTTC TCGTCGACGG CGGTTTCTGC TGCTGGTGA
|
Protein sequence | MSDITLNAPK LFDLSGQVAI VTGAGSGIGQ RIAIGLAQCG ADVALLDRRT DDGLVKTAEH IRAAGRRSIQ IAADVTSKSS LGEAIARTEA DLGTLTLAVN AAGIANANAA EEMEEDQYQT LMDINLKGVF LSCQAEARAM LKNGRGSIVN IASMSGVIVN RGLSQAHYNA SKAGVIHMSK SMAMEWVDRG IRVNTISPGY TATPMNTRPE MVHQTKLFEE QTPMQRMAAV DEMVGPAVFL LSNAASFVTG VDLLVDGGFC CW
|
| |