Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1681 |
Symbol | |
ID | 5083413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1724797 |
End bp | 1726107 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640483239 |
Product | extracellular solute-binding protein |
Protein accession | YP_001167879 |
Protein GI | 146277720 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.901925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAA GATTTCGCGC CCTGATGGGG GCGTGCGCCG TGGCTGCGCT TTCGACCGCC GCAGGCGCCG AAACCATCAC CGTGGCGACC GTCAACAACG GCGACATGAT CCGCATGCAG GGGCTCATGT CCGAGTTCAA CGCGCAGCAT CCCGACATCA CGGTCGAGTG GGTCACGCTC GAGGAAAACG TGCTGCGCCA GAAGGTCACG ACCGACATCG CCACCCGGGG CGGGCAGTTC GACGTGCTGA CCATCGGCAC CTACGAGGTG CCGATCTGGG GCAAGCAGGG CTGGCTCGTG AGCCTGAACG ACCTGCCGCC CGAATATGAC GCCGACGACA TCCTGCCCGC GATCCGCAAC GGCCTGACCG TCGATGGCGA GCTCTATGCC GCGCCCTTCT ACGGCGAAAG CTCGATGATC ATGTACCGGA CGGACCTGAT GGAGAAGGCC GGGCTGACCA TGCCCGACGC CCCCACCTGG GAATTCGTCA AGGAAGCCGC CGCCAAGATG ACCGACAAGG ATGCCGAGAT CTACGGCATC TGCCTGCGCG GCAAGGCCGG CTGGGGCGAG AACATGGCGT TCCTGACCGC CATGGCCAAC AGCTACGGCG CGCGCTGGTT CGACGAGAAC TGGCAGCCGC AGTTCGACGG CGAGGCCTGG AAGGCCGCGC TGACCGATTA TCTCGACCTG ATGACGAACC ACGGGCCTCC GGGCGCCTCG AACAACGGCT TCAACGAGAA CCTCGCGCTG TTCCAGCAAG GCAAGTGCGG CATGTGGATC GACGCGACGG TTGCGGCCTC GTTCGTGACC AACCCCGCGG AATCGACCGT GGCCGACCAG GTGGGCTTCG CGCTGGCGCC CGACACCGGC AAGGGCAAGC GGTCCAACTG GCTCTGGGCC TGGAACCTCG CGGTGCCGGC GGGGTCGCAG AAGGTGGATG CCGCCAAGCA GTTCATCGCC TGGGCAACCT CGAAGGACTA CGCCGAGCTG GTCGCCTCGA AGGAGGGCTG GGCCAACGTG CCTCCGGGGA CGCGAGCCTC GCTCTACGAG AACCCGGAAT ACCAGAAGGT GCCCTTCGCG CAGATGACGC TGGAGAGCAT CAACGCGGCT GATCCGACCA ACCCGGCCGT CGATCCGGTG CCTTACGTCG GTATCCAGTT CGTGGCGATC CCCGAGTTCC AGGGCATCGG CACGGCTGTC GGCCAGCAGT TCTCGGCGGC GCTTGCCGGG TCGATGTCGG CCGAACAGGC GCTGGCCGCG GCACAAGCCT TCACAACGCG CGAGATGACC CGCGCCGGCT ACATCAAGTA A
|
Protein sequence | MTARFRALMG ACAVAALSTA AGAETITVAT VNNGDMIRMQ GLMSEFNAQH PDITVEWVTL EENVLRQKVT TDIATRGGQF DVLTIGTYEV PIWGKQGWLV SLNDLPPEYD ADDILPAIRN GLTVDGELYA APFYGESSMI MYRTDLMEKA GLTMPDAPTW EFVKEAAAKM TDKDAEIYGI CLRGKAGWGE NMAFLTAMAN SYGARWFDEN WQPQFDGEAW KAALTDYLDL MTNHGPPGAS NNGFNENLAL FQQGKCGMWI DATVAASFVT NPAESTVADQ VGFALAPDTG KGKRSNWLWA WNLAVPAGSQ KVDAAKQFIA WATSKDYAEL VASKEGWANV PPGTRASLYE NPEYQKVPFA QMTLESINAA DPTNPAVDPV PYVGIQFVAI PEFQGIGTAV GQQFSAALAG SMSAEQALAA AQAFTTREMT RAGYIK
|
| |