Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5146 |
Symbol | |
ID | 6978240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 784337 |
End bp | 785086 |
Gene Length | 750 bp |
Protein Length | 249 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394274 |
Product | short chain dehydrogenase |
Protein accession | YP_002279092 |
Protein GI | 209547174 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.485132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGAC TAGCCAACAA GACCGCTCTG ATCACTGGGG GCACCAGCGG CATCGGCCTT GAAACCGCCC GTCAGTTCGT CGCCGAAGGC GCGCGGGTCG CCGTCACTGG CAGCAGCGCA GCCAGCGTGG AGAAGGCCCG CATCGAGCTC GGCGACAACG TCATCGTCAT TCAGTCAGAC GCGGGCGACG TCAACGGCCA GAAGAAGGTC GTCAGCCAAA TCGAGGAGGC GTTCGGTCAC CTCGACATCC TGTTCGTCAA CGCAGGTGTC GCTCAATTCG GACCGGTCGA AAACTGGTCG GAGGCGGACT TTGACAAGTC GTTTGCAACG AACGTGAAGG GTCCCTACTT CCTGATCCAG GGCCTCCTGC CACTGTTCTC GAAGGGTGCC TCCATCGTGC TGAACACGTC GATCAACGCG CATATCGGCA TGCCGAACTC GAGCGTCTAT TCACTAACAA AGGGAGCGCT CCTGACGCTG GCGAAGACCC TCTCGGGCGA ACTGGTTGGT CGTGGCATCC GCGTGAACGC CGTCAGCCCC GGCCCGATCG CTACCCCGCT TTACGGAAAG CTCGGTATGT CCGAGGCGGA CATGAAGGCG ATGGCCGACG GCGTGCAGAA GCAGATTCCG GTCGGTCGCT TCGGTGACGT GTCGGAGGTA GCAAAGACCG TGGTCTTCTT CGCATCTGAC GAGGCGGCTT ATATCGTCGG CAGCGAGCTC GTCATCGACG GTGGCATGAG CAACCTTTAA
|
Protein sequence | MSRLANKTAL ITGGTSGIGL ETARQFVAEG ARVAVTGSSA ASVEKARIEL GDNVIVIQSD AGDVNGQKKV VSQIEEAFGH LDILFVNAGV AQFGPVENWS EADFDKSFAT NVKGPYFLIQ GLLPLFSKGA SIVLNTSINA HIGMPNSSVY SLTKGALLTL AKTLSGELVG RGIRVNAVSP GPIATPLYGK LGMSEADMKA MADGVQKQIP VGRFGDVSEV AKTVVFFASD EAAYIVGSEL VIDGGMSNL
|
| |