Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3124 |
Symbol | |
ID | 5324003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3272642 |
End bp | 3273430 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640792074 |
Product | short chain dehydrogenase |
Protein accession | YP_001328785 |
Protein GI | 150398318 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA TCAGTCTTGA CGCACCGAAA TTGTTCGATC TTGCCGGAAA CGTCGCCCTC GTGACCGGCG CGGGCAGCGG CATCGGCCAA CGCATCGCCA TCGGTCTCGC ACAGTGCGGC GCAGATGTCG GGTTACTAGA CCGACGGACC GATGATGGTC TCGCGAAGAC GGCCGATCTC ATCGAAAAGG CTGGGCGTCG CAGCATTCAG ATCGCCGCTG ATGTCACCAG CGGCGCGGCG CTCGACGAGG CGGTTGCGCG CACAGAAGCG GAGCTGGGCC CGCTGACCCT CGCGCTCAAC GCCGCGGGCA TCGCCAATGC TCATCCGGCG GAGGAGATGG AGGAAACTCA GTTCCAGACG ATGATGGACG TCAATCTGAA AGGCGTTTTC CTCTCCTGCC AGGCGGAAGC GCGCGCCATG TTGAGGAACG GACGCGGGTC GATCGTCAAC ATCGCCTCGA TGTCCGGCGT CATCGTGAAT CGAGGACTTA ACCAGTGCCA CTACAACGCC TCTAAGGCGG GTGTAATTCA CATGTCGAAG TCCATGGCGA TGGAGTGGGT AGGCCGCGGC ATCCGGGTCA ACACCATCAG TCCGGGCTAT ACGGCAACAC CTATGAATAC TAGACCGGAA ATGGTGCACC AGACCAGGCT CTTCGAGGAA CAGACGCCGA TGCAGCGCAT GGCTGGCGTG GACGAAATGG TCGGCCCGGC CGTTTTCCTT CTGTCGGATG CGGCGAGCTT CGTAACCGGA GTAGACCTTC TCGTTGATGG CGGCTTCTGC TGCTGGTGA
|
Protein sequence | MSEISLDAPK LFDLAGNVAL VTGAGSGIGQ RIAIGLAQCG ADVGLLDRRT DDGLAKTADL IEKAGRRSIQ IAADVTSGAA LDEAVARTEA ELGPLTLALN AAGIANAHPA EEMEETQFQT MMDVNLKGVF LSCQAEARAM LRNGRGSIVN IASMSGVIVN RGLNQCHYNA SKAGVIHMSK SMAMEWVGRG IRVNTISPGY TATPMNTRPE MVHQTRLFEE QTPMQRMAGV DEMVGPAVFL LSDAASFVTG VDLLVDGGFC CW
|
| |