Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2236 |
Symbol | |
ID | 5323097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2315788 |
End bp | 2316978 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640791174 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001327903 |
Protein GI | 150397436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.060795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCATA GGCGCTGGAC GGCTCTCGGA TCAAGCAATA TCCCTGCGAT CCTAATATCC AGTATGGCGG CTGCGGCCCC GTTCCTTGTG AACTTCACCG CATTCGCTGC GCTTGCGCAG GAGGCGCGGG AATTTTCGAC GCAGACAGGA ACCGTTCTCG TCGAGACGCT CGCCTCCGGG CTCGAACATC CCTGGGCGGT TGAGGCCATG CCCGATGGGG CGCTTATCGT TACCGAGCGA CCGGGCCGGC TACGCATCCT GCGCGACGGC AAGCTTTCAG ACGCGATCAA GGGCGTACCC ACAGTGGCCG CCCACGGCCA GGGCGGCCTT CTCGATGTCG CTCTCGATCG GCAATTCGCG ACGAACAGGA CTATTTATCT CACCTTATCC GCGCGCGGCG AAGGCGGCTA CGGCACGGCC CTTGTCCGCG CGGCCCTCTC GCAGGATGGT CGGAGCCTGA CGGATGCGAA GGAGATCTTC CGGATGAACC GGTTCACCCG GAAGGGGCAG CATTTCGGCT CACGCATTGC CATCGACAAG GACGGCAGCC TGTTTTTCGG CATTGGCGAT CGCGGTGAAG GTGAACGCGC CCAGGACCCG CACGACCATG CCGGCTCGGT CCTCCACATC AATGCCGACG GCAGCATCCC CGCCTCCAAC CCGTTTCGTG GCGGTACTGG CGGCCTGGCC GAAATCTGGT CCACCGGACA TCGGAACCCC CAGGGAATTA CCTTTGATCC GGAAGATGGC AAGCTCCTCA CAGTCGAGCA CGGCGCGCGC GGCGGCGACG AAGTGAACAA TCCGCAGCCT GGCAGAAATT ACGGCTGGCC GGTGATCACC TTCGGCAAGG ACTATTCCGG TGTGGAGATC GGCGAAGGCA CGGCGAAGGA AGGCCTGGAG CAGCCGCTCT TTTACTGGGA CCCCTCGATC GCGCCGGGTG CGATTGCCGT ATACCGCGGC AGCATGTTTC CAGAGTGGAA CGGCGATCTC TTGATCGCAG CACTGAAATA CCAGTTGCTT ACCCGCCTCG ACCGCGACGA GACCGGCACG GTCACGGCCG AGGAACGTTT GTTCGACGGC GAATTCGGCC GAATCCGCGA CGTCATCGTC GCTCCCGACG GGGCACTCAT CATGGTCACC GATGAGGAAG ATGGCGAAGT GCTCAGGGTC TCCAAAGCCC CGACACAGTA G
|
Protein sequence | MRHRRWTALG SSNIPAILIS SMAAAAPFLV NFTAFAALAQ EAREFSTQTG TVLVETLASG LEHPWAVEAM PDGALIVTER PGRLRILRDG KLSDAIKGVP TVAAHGQGGL LDVALDRQFA TNRTIYLTLS ARGEGGYGTA LVRAALSQDG RSLTDAKEIF RMNRFTRKGQ HFGSRIAIDK DGSLFFGIGD RGEGERAQDP HDHAGSVLHI NADGSIPASN PFRGGTGGLA EIWSTGHRNP QGITFDPEDG KLLTVEHGAR GGDEVNNPQP GRNYGWPVIT FGKDYSGVEI GEGTAKEGLE QPLFYWDPSI APGAIAVYRG SMFPEWNGDL LIAALKYQLL TRLDRDETGT VTAEERLFDG EFGRIRDVIV APDGALIMVT DEEDGEVLRV SKAPTQ
|
| |