Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0223 |
Symbol | |
ID | 5321055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 245863 |
End bp | 247962 |
Gene Length | 2100 bp |
Protein Length | 699 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789158 |
Product | short chain dehydrogenase |
Protein accession | YP_001325917 |
Protein GI | 150395450 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.133149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.171909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCA AGCAACAGGG CGCGCGCCTT GCCAATCTTT GGGACGACGG CAAGGCGGCA GGAATGACCG AGCCTGAAAG GCTCCTCTAT CGCTCGAACC TGCTCGGATC TGACAAGAGG ATCACCAATT ACGGCGGCGG CAACACCTCG GCCAAGGTCA TGGAAAAGGA TCCGTTGACG GGCGAGATGG TCGAGGTCCT CTGGGTCAAG GGCTCCGGCG GCGACGTCGG CACGATCAAG ATGGACGGCT TCGCCACCCT CTATATGGAC AAGCTGCGGG CCCTCGAGGG CATCTATCGC GGGGTCGAAT TCGAAGACGA GATGTTCGGG TATCTGCCGC ATTGCACCTT CAATCTCAAC CCGCGCGCCG CTTCGATCGA CACGCCGCTG CATGCCTATG TTCCGAAGGT GCATGTCGAC CACATGCATC CGGATGCGAT CATTGCCATC GCCGCCTCGA AGAACAGCAA GGAACTGACG AGCAAGATAT TCGGAACTGA GATCGGCTGG CTGCCGTGGA AACGGCCCGG CTACGAACTC GGTCTCTGGC TTGAAAGATT CTGCCGTGAG AACCCGGAAG CGCGCGGCGT CGTGCTCGAA AGCCACGGCC TCTTCACCTG GGGCGACACG GCCAAGCAGG CCTACGAGAC GACGATCGAA ATCATCAATC GCGCCATTGC CTGGTTCGAG ACGGAAAACA CGGGTCCGGC CTTCGGTGGA CCGGCGAAAC CGGCGCTCGC CGGCGGTGAA CGCGTCGCCA TCGCCAGGAA ACTGATGCCT GTCATCCGCG GTCTCATCAG CGGAGCGGAA AGCAAGGTCG GGCACTTCGA CGACAGCGAG GCCGTGCTCG ATTTCGTAAC CTCTACAAAT CTGGAGCCGT TGGCAGCACT CGGCACCAGT TGCCCCGATC ACTTCCTGCG CACCAAGATC CGCCCGCTGA TCGTCGACTT CGACCCGGCC CAGCCGGACC TCGAAAAGAC GCTTGCGGGC CTGCCCGAGG CGATCGCGAC CTATCGCGCC GATTACGCGG CCTATTACGA CCGCTGCAAG CGCGCCGACA GCCCGGCGAT CCGTGATCCG AACGCCGTCG TCTATCTCCT GCCGGGCGTC GGCATGATCA CCTTCGCCAA GGACAAGGCG ACGGCGCGCA TCTCCGCGGA GTTCTACGTC AACGCCATCA ATGTCATGCG CGGCGCGTCC GGCGTCTCGA CCTATGTCGG CCTGCCGGAG CAGGAAGCTT TCGACATCGA ATACTGGCTG CTCGAAGAAG CCAAGCTTCA GCGCATGCCG AAGCCGAAGA GCCTTGCCGG GCGCATCGCT CTCGTCACCG GCGGCGCCGG CGGCATAGGC AAGGCGACAG CCAACCGGCT GATGCAGGAA GGCGCCTGCG TCGTGCTTGC AGATATCGAC GAACGGGCGC TCGAGGCTGC GCAGAACGAG CTTTCGAACC GCTATGGCAA GGATTTCGTC CGCTCCGTCA ACATGAACGT CACCGATGAG GCGGCCGTCG CGTCGGGTTT CGGCGACGCG CTCCTGGCCT TCGGCGGTCT CGATATTCTC GTCTCCAATG CGGGTCTCGC GACGTCGGCT GCTGTCGAGG ACACCACGCT GGCGCTCTGG AACAAGAATA TGGACATTCT CGCGACGGGC TATTTCCTGG TCTCGCGCGA GGCTTTCCGC ATCTTCCGCA ACCAGAAGAC CGGCGGAAAT GTCGTCTTCG TCGCTTCGAA GAACGGGCTT GCGGCCTCGC CCGGAGCTTC TGCCTATTGC ACTGCCAAAG CGGCCGAAAT CCACCTCGCC CGCTGCCTGG CGCTCGAGGG GGCTTCGGCG CAGATTCGCG TCAACGTGGT CAATCCCGAC GCGGTACTGC GCGGTTCCAA GATCTGGACC GGCGAATGGA AGGAACAGCG CGCCGCCGCC TACAATATGG ACGTCGATGA ACTGGAGGCG CATTACCGCG AGCGCTCCAT GCTGAAGCTC AGCGTATTCC CGGAAGATAT CGCCGAGGCA ATCTACTTCC TTGCTTCCGA CATGTCGGCG AAGTCCACCG GTAACATCGT CAATGTCGAC GCGGGCAATG CCCAGTCGTT CACGCGCTGA
|
Protein sequence | MLGKQQGARL ANLWDDGKAA GMTEPERLLY RSNLLGSDKR ITNYGGGNTS AKVMEKDPLT GEMVEVLWVK GSGGDVGTIK MDGFATLYMD KLRALEGIYR GVEFEDEMFG YLPHCTFNLN PRAASIDTPL HAYVPKVHVD HMHPDAIIAI AASKNSKELT SKIFGTEIGW LPWKRPGYEL GLWLERFCRE NPEARGVVLE SHGLFTWGDT AKQAYETTIE IINRAIAWFE TENTGPAFGG PAKPALAGGE RVAIARKLMP VIRGLISGAE SKVGHFDDSE AVLDFVTSTN LEPLAALGTS CPDHFLRTKI RPLIVDFDPA QPDLEKTLAG LPEAIATYRA DYAAYYDRCK RADSPAIRDP NAVVYLLPGV GMITFAKDKA TARISAEFYV NAINVMRGAS GVSTYVGLPE QEAFDIEYWL LEEAKLQRMP KPKSLAGRIA LVTGGAGGIG KATANRLMQE GACVVLADID ERALEAAQNE LSNRYGKDFV RSVNMNVTDE AAVASGFGDA LLAFGGLDIL VSNAGLATSA AVEDTTLALW NKNMDILATG YFLVSREAFR IFRNQKTGGN VVFVASKNGL AASPGASAYC TAKAAEIHLA RCLALEGASA QIRVNVVNPD AVLRGSKIWT GEWKEQRAAA YNMDVDELEA HYRERSMLKL SVFPEDIAEA IYFLASDMSA KSTGNIVNVD AGNAQSFTR
|
| |