Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0097 |
Symbol | smoM |
ID | 3719929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1808034 |
End bp | 1809131 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640071301 |
Product | TRAP-T family sorbitol/mannitol periplasmic binding protein SmoM |
Protein accession | YP_353173 |
Protein GI | 77463669 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.410099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGTC GTTCATTCAT CACCAAGGCC GCCGTGGGAG GGGCCGCCGC GAGCGCCCTC GCCGCGCCGG CGCTTGCCCA GTCCGCGCCC AAGGTCACCT GGAGGCTCGC CTCCTCCTTC CCGAAATCGC TCGACACGAT CTTCGGCGGC GCCGAGGTGC TGTCGAAGAT GCTCTCCGAG GCCACCGACG GCAACTTCCA GATCCAGGTC TTCTCGGCGG GCGAGCTGGT GCCGGGCCTG CAGGCCGCCG ACGCCGTGAC CGAGGGCACC GTCGAATGCT GCCACACGGT CGGCTACTAT TACTGGGGCA AGGATCCCAC CTTCGCGCTC GCGGCGGCCG TGCCCTTCTC GCTGTCGGCG CGCGGCATCA ACGCCTGGCA CTACCATGGC GGCGGGATCG ACCTCTACAA CGAGTTCCTT TCGCAGCACA ACATCGTGGC CTTCCCGGGC GGCAACACCG GTGTGCAGAT GGGCGGCTGG TTCCGGCGCG AGATCAACAC CGTGGCCGAC ATGCAGGGCC TGAAGATGCG GGTCGGCGGG TTTGCCGGCA AGGTGATGGA GCGTCTGGGC GTCGTGCCGC AGCAGATCGC GGGCGGCGAC ATCTATCCGG CGCTGGAGAA GGGCACGATC GATGCGACCG AATGGGTCGG CCCCTATGAC GACGAGAAGC TCGGCTTCTT CAAGGTGGCG CCCTACTACT ACTATCCCGG CTGGTGGGAA GGCGGCCCGA CCGTCCATTT CATGTTCAAC AAGAGCGCCT ACGAGGGTCT GACCCCGACC TATCAGTCGC TGCTGCGCAC CGCCTGCCAT GCGGCCGATG CGAACATGCT CCAGCTCTAC GACTGGAAGA ACCCGACGGC GATCAAGTCG CTCGTGGCGC AGGGAACGCA GCTCCGTCCC TTCAGCCCCG AGATCCTGCA GGCCTGTTTC GAGGCCGCGA ACGAGGTCTA TGCCGAGATG GAAGCCTCGA ACCCCGCCTT CAAGAAGATC TGGGACTCGA TCAAGGCCTT CCGCTCCGAG CATTACACCT GGGCGCAGAT TGCCGAATAC AACTATGACA CCTTCATGAT GGTGCAGCAG AACGCCGGCA AGCTCTGA
|
Protein sequence | MDRRSFITKA AVGGAAASAL AAPALAQSAP KVTWRLASSF PKSLDTIFGG AEVLSKMLSE ATDGNFQIQV FSAGELVPGL QAADAVTEGT VECCHTVGYY YWGKDPTFAL AAAVPFSLSA RGINAWHYHG GGIDLYNEFL SQHNIVAFPG GNTGVQMGGW FRREINTVAD MQGLKMRVGG FAGKVMERLG VVPQQIAGGD IYPALEKGTI DATEWVGPYD DEKLGFFKVA PYYYYPGWWE GGPTVHFMFN KSAYEGLTPT YQSLLRTACH AADANMLQLY DWKNPTAIKS LVAQGTQLRP FSPEILQACF EAANEVYAEM EASNPAFKKI WDSIKAFRSE HYTWAQIAEY NYDTFMMVQQ NAGKL
|
| |