Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4705 |
Symbol | |
ID | 6977799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 345984 |
End bp | 346721 |
Gene Length | 738 bp |
Protein Length | 245 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393878 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_002278696 |
Protein GI | 209546778 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0218272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.740514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAC TGGAAAGTAA GGTCGCCGTC GTGATCGGCG GCGGAAGCGG CATTGGCGCG GCGATTTCGG AGCGCTTTGC GAAGGAAGGC GCGAGCGTCT ACGCCACGAG CCGAAAGGTG AGCGATGAAG AGGCCCGCGG CCGGCAACCG CTTGACCCGG GAAGCATCCA GCCGGTTCGG GCCGATGCCG GCAATCTGGA CGATCTTGCT GCCGTCTTTG AGCACGTTCG CGCCGAGCGC GGACGGATTG ACATCCTGGT GCTGAATGCC GGCATCTCGG AATACTCTGT GCTCGGCAGC ATCACTACCG ACCATTTCGA TCGGATATTC GCGCTGAACG TCCGCTCGTT GCTGTTTGCC GCCCAGGGCG GACTTGATTT GATGGGATCA GGCGGCTCCA TCGTTCTTGT CGGCTCGATC GCCGACGCCG TCGGAACCAA GGGTTACGGC GTCTATAGCG CTAGCAAGGC GGCCGTTCGG TCCTTCGCCC GGACATGGGC CAGCGAACTC GCGCCGAGGG GTATCCGCGT GAACGTCGTC AGCCCGGGCC CGACGGATAC GGCCATGATG GCTGCAACGA CGGAGGAGGT TCGTCAGGCG CTGACCCATC TTATACCTCT GGGACGATTG GGAAGGCCTG ACGAAGTTGC CTCGGCGGCT CTCTTCCTTG CAAGTGACGA AAGCAGCTTC ACGACCGGAG CGGAACTCTG TGTCGATGGC GGCGCCACCC AAGTTTGA
|
Protein sequence | MSRLESKVAV VIGGGSGIGA AISERFAKEG ASVYATSRKV SDEEARGRQP LDPGSIQPVR ADAGNLDDLA AVFEHVRAER GRIDILVLNA GISEYSVLGS ITTDHFDRIF ALNVRSLLFA AQGGLDLMGS GGSIVLVGSI ADAVGTKGYG VYSASKAAVR SFARTWASEL APRGIRVNVV SPGPTDTAMM AATTEEVRQA LTHLIPLGRL GRPDEVASAA LFLASDESSF TTGAELCVDG GATQV
|
| |