Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4625 |
Symbol | |
ID | 8015371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4748223 |
End bp | 4748972 |
Gene Length | 750 bp |
Protein Length | 249 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827200 |
Product | short chain dehydrogenase |
Protein accession | YP_002978400 |
Protein GI | 241207304 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.402352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.18483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGCC TGCAGAACAA GACAGCCCTC ATCACCGGCG GCACCAGTGG CATCGGTCTC GAGACCGCCC GCCAGTTCAT CGCTGAGGGC GCCCGCGTCG TCGTTACCGG CAGCAGCACG GCAAGCGTCG AGGCGGCCCG CGCCGAATTC GGCGGCAAAG CCACCGTCAT CCAGGCCGAT GCGGGCAATG CGGTCGGCCA GAAGGCCGTC GCCGATCGCG TGAGAGAGGC TTTCGGCACG CTCGACATCC TCTTCGTCAA CGCCGGCGTC GCCGAATTCG GGCCGCTGGA ACAGTGGAGC GAAGCCGCCT TCGACAAGTC GGTTGATATC AACGTCAAAG GACCGTTCTT CCTGATTCAG TCACTGCTGC CGATTTTTTC GAAGCAGGCC GCGATCGTGC TCAACACCTC GATCAACGCC CATATCGGCA TGCCGAACTC CAGCGTCTAT TCGCTGACGA AGGGCGCGCT GCTGACGCTT GCCAAGACAT TGTCGGGCGA ACTGATCGGC CGCGGCATTC GCGTCAACGC CGTCAGCCCC GGCCCGATCG CCACGCCGCT CTACAGCAAG CTCGGGGCGT CGGAAGCGGA TTCCAAGGCG ATGACCGCGC AGATCCAGGC TCAGATCCCC GTCGGCCGCT TCGGAACCCC CGGCGAAGTC GCCAAGACGA TCGTCTTCCT CGCCTCCGAT GAGGCGGCCT ATATCGTCGG CAGCGAACTC ATCATCGACG GCGGGATGAG TAACCTCTGA
|
Protein sequence | MSRLQNKTAL ITGGTSGIGL ETARQFIAEG ARVVVTGSST ASVEAARAEF GGKATVIQAD AGNAVGQKAV ADRVREAFGT LDILFVNAGV AEFGPLEQWS EAAFDKSVDI NVKGPFFLIQ SLLPIFSKQA AIVLNTSINA HIGMPNSSVY SLTKGALLTL AKTLSGELIG RGIRVNAVSP GPIATPLYSK LGASEADSKA MTAQIQAQIP VGRFGTPGEV AKTIVFLASD EAAYIVGSEL IIDGGMSNL
|
| |