Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2037 |
Symbol | |
ID | 6980776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2099794 |
End bp | 2100582 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396759 |
Product | short chain dehydrogenase |
Protein accession | YP_002281547 |
Protein GI | 209549630 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00988022 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGACA TCACTCTGAA CGCCCCGAAG CTTTTCGATC TCAGCGGCCG GGTTGCCATC GTCACCGGGG CCGGAAGCGG CATCGGGCAG CGCATTGCCA TCGGCCTTGC CCAATGCGGC GCCGACGTGG CCCTGCTCGA CCGTCGAACC GACGACGGAT TGGCCAGGAC GGCTGAACAT ATCAGCGCCG CCGGCCGCCG CTCGATCCAG ATCGCGGCTG ACGTCACGAA CAAATCTTCG CTTGGAGATG CGGTCGCACG CACCGAAGCC GATCTCGGCG CCTTGACGCT TGCCGTCAAC GCGGCTGGCA TCGCCAACGC GAACCCGGCG GAGGAGATGG AGGAGGACCA ATATCAGACG TTGATGGATA TCAACCTGAA AGGCATCTTC CTTTCCTGCC AGGCCGAGGC TCGCGCCATG CTGAAGAACG GATGCGGCTC CATCGTCAAC ATTGCTTCCA TGTCGGGCGT GATCGTCAAC CGGGGGCTGA ACCAAGCGCA TTACAACGCC TCCAAGGCAG GCGTCATCCA TATGTCGAAG TCGTTGGCCA TGGAATGGGT CGGCCGCGGC ATTCGCGTCA ACACCATTTC CCCCGGCTAC ACGGCAACGC CCATGAACAC CCGTCCGGAG ATGGTCCACC AGACCAAGCT CTTCGAAGAG CAGACGCCGA TGCAGCGCAT GGCTGATGTC GACGAGATGG TCGGTCCGGC GGTGTTCCTG CTGTCGAATG CAGCAAGCTT CGTGACCGGC GTCGATCTTC TCGTCGACGG TGGTTTCTGC TGCTGGTGA
|
Protein sequence | MSDITLNAPK LFDLSGRVAI VTGAGSGIGQ RIAIGLAQCG ADVALLDRRT DDGLARTAEH ISAAGRRSIQ IAADVTNKSS LGDAVARTEA DLGALTLAVN AAGIANANPA EEMEEDQYQT LMDINLKGIF LSCQAEARAM LKNGCGSIVN IASMSGVIVN RGLNQAHYNA SKAGVIHMSK SLAMEWVGRG IRVNTISPGY TATPMNTRPE MVHQTKLFEE QTPMQRMADV DEMVGPAVFL LSNAASFVTG VDLLVDGGFC CW
|
| |