Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5249 |
Symbol | |
ID | 6978343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 882473 |
End bp | 883429 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643394361 |
Product | short chain dehydrogenase |
Protein accession | YP_002279179 |
Protein GI | 209547261 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.109132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCGA AAGCCACCGC CGCGAAGCAG CGCGATATCC AGGCTCAGGT CGAAAAGGCC GACAAGAAAG CGAAATTGAA GGCGACCGGC GCCATGCAGG CTGGCGCTCG GCATTATCCT GAACCACCAT TCCCGAAAGT TCATCAAGAC AAGCCCGGCT CCGAAGCCGA TCTTCCGCTC GCCCCGATGT GTGACGCACC CTTCTACGAG GGCTCGGGAA AGCTAAGGGA CAAGATTGCC CTCATAACAG GTGGCGATTC CGGCATCGGT CGCTCCGTTG CGATCCTGTT CGCAAGGGAG GGTGCCGATA TCGCCATTGT TCATCTCGAC GAGGATCAGG ATGCGGCGGA CACGAAGGCG GCTATCGAAA AGGAAGGCCG CCAATGCCTC GTCATCAAGG GCGACGTCAA GGATCCGAAG TTCTGCCGCG AAGCTGTTCA AAGGACTACC GAGCATTTCT CGCGCCTTGA TGTACTTGTA AACAATGCAG CGTTCCAGGT CCACGCCGCT GCCATCGAGG ACCTGACCGA CGAGCACTTC GATGAGACGC TGAAAACCAA CCTCTACGGC TACTTTTACA TGGCGAAGGC GGCAATACCT TACCTGACGA ACGGATCGGC GATCATCAAC ACAGGATCGG TCACCGGACT GGAAGGCTCG AAGGAACTGC TGGACTACTC GATGACGAAG GGTGGTATCC ATGCTTTCAC CAAGGCTCTT TCAAGCCAAC TCGTTCCAAA GGGCATCCGC GTCAACGCTG TCGCGCCGGG GCCTGTCTGG ACGCCGTTGA ATCCTTCGGA CAAGCAGGCC GATGACGTTG CCAAATTCGG CAGTCAGACA ACAATGAAGC GTGCGGCGCA GCCCGAGGAG ATTGCGCCCG CGTACGTCTT CCTCGCCTCC CCGCAGATGT CGAGCTACAT CACCGGCGAG ATTTTGCCGA TCGTCGGCGG ATATTGA
|
Protein sequence | MSSKATAAKQ RDIQAQVEKA DKKAKLKATG AMQAGARHYP EPPFPKVHQD KPGSEADLPL APMCDAPFYE GSGKLRDKIA LITGGDSGIG RSVAILFARE GADIAIVHLD EDQDAADTKA AIEKEGRQCL VIKGDVKDPK FCREAVQRTT EHFSRLDVLV NNAAFQVHAA AIEDLTDEHF DETLKTNLYG YFYMAKAAIP YLTNGSAIIN TGSVTGLEGS KELLDYSMTK GGIHAFTKAL SSQLVPKGIR VNAVAPGPVW TPLNPSDKQA DDVAKFGSQT TMKRAAQPEE IAPAYVFLAS PQMSSYITGE ILPIVGGY
|
| |