Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0091 |
Symbol | smoE |
ID | 3719922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1801150 |
End bp | 1802460 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640071294 |
Product | ABC sorbitol/mannitol transporter, periplasmic binding protein |
Protein accession | YP_353167 |
Protein GI | 77463663 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.126585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAA GATTTCGCGC CCTGATGGGC GCGTGCGCCG TGGCTGCGCT CTCGTCCGCC GCCGGCGCCG AAACCATCAC CGTGGCGACT GTCAACAACG GCGACATGAT CCGCATGCAG GGGCTCATGT CCGAGTTCAA CGCGCAGCAC CCCGACATCA CCGTCGAGTG GGTGACGCTC GAGGAAAACG TGCTGCGCCA GAAGGTCACG ACCGACATCG CCACGAAGGG CGGGCAGTTC GACGTGCTGA CCATCGGCAC CTACGAGGTT CCGATCTGGG GCAAGCAGGG CTGGCTCGTG AGCCTGAACG ACCTGCCGCC GGAGTATGAT GCCGACGACA TCCTGCCCGC GATCCGCAAC GGCCTGACCG TCGACGGCGA GCTCTATGCC GCGCCCTTCT ACGGCGAGAG CTCGATGATC ATGTATCGCA AGGACCTGAT GGAGAAGGCG GGGCTGACGA TGCCCGACGC CCCCACCTGG GACTTCGTGA AGGAAGCGGC GCAGAAGATG ACCGACAAGG ATGCCGAGGT CTACGGCATC TGCCTGCGCG GCAAGGCGGG CTGGGGCGAG AACATGGCCT TCCTCACCGC CATGGCCAAC AGCTACGGCG CGCGCTGGTT CGACGAGAAC TGGCAGCCGC AGTTCGACGG CGAGGCTTGG AAGGCCACGC TGACCGACTA TCTCGACATG ATGACGAACT ACGGCCCGCC CGGCGCCTCG AACAACGGCT TCAACGAGAA CCTCGCGCTG TTCCAGCAGG GCAAGTGCGG CATGTGGATC GACGCGACGG TGGCCGCCTC CTTCGTGACC AACCCCGAGG AATCCACGGT GGCCGACAAG GTGGGCTTCG CGCTCGCCCC CGATACCGGC AAGGGCAAGC GGGCCAACTG GCTCTGGGCC TGGAACCTCG CGATCCCGGC GGGCTCGCAG AAGGTCGATG CCGCCAAGCA GTTCATCGCC TGGGCGACCT CGAAGGACTA TGCCGAGCTG GTGGCCTCGA AGGAAGGCTG GGCCAACGTG CCTCCGGGGA CGCGGATCTC GCTCTACGAG AACCCGGAAT ATCAGAAGGT GCCGTTCGCG AAGATGACGC TCGACAGCAT CAACGCGGCT GACCCGACCC ACCCGGCCGT CGATCCGGTG CCTTACGTCG GCGTGCAGTT CGTGGCGATC CCCGAGTTCC AGGGCATCGG CACCGCCGTG GGCCAGCAGT TCTCGGCGGC TCTCGCGGGC TCGATGTCGG CCGAGCAGGC GCTTCAGGCG GCCCAGCAGT TCACGACGCG CGAAATGACC CGCGCGGGCT ACATCAAGTA G
|
Protein sequence | MTARFRALMG ACAVAALSSA AGAETITVAT VNNGDMIRMQ GLMSEFNAQH PDITVEWVTL EENVLRQKVT TDIATKGGQF DVLTIGTYEV PIWGKQGWLV SLNDLPPEYD ADDILPAIRN GLTVDGELYA APFYGESSMI MYRKDLMEKA GLTMPDAPTW DFVKEAAQKM TDKDAEVYGI CLRGKAGWGE NMAFLTAMAN SYGARWFDEN WQPQFDGEAW KATLTDYLDM MTNYGPPGAS NNGFNENLAL FQQGKCGMWI DATVAASFVT NPEESTVADK VGFALAPDTG KGKRANWLWA WNLAIPAGSQ KVDAAKQFIA WATSKDYAEL VASKEGWANV PPGTRISLYE NPEYQKVPFA KMTLDSINAA DPTHPAVDPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG SMSAEQALQA AQQFTTREMT RAGYIK
|
| |