Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1727 |
Symbol | |
ID | 4898068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1821242 |
End bp | 1822552 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640112320 |
Product | extracellular solute-binding protein |
Protein accession | YP_001043609 |
Protein GI | 126462495 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAA GATTTCGCGC CCTGATGGGC GCGTGCGCCG TGGCTGCGCT CTCGTCCGCC GCCGGCGCCG AAACCATCAC CGTGGCGACT GTCAACAACG GCGACATGAT CCGCATGCAG GGGCTCATGT CCGAGTTCAA CGCGCAGCAC CCCGACATCA CCGTCGAGTG GGTGACGCTC GAGGAAAACG TACTGCGCCA GAAGGTCACG ACCGACATCG CCACCAAGGG CGGGCAGTTC GACGTGCTGA CCATCGGCAC CTACGAGGTT CCGATCTGGG GCAAGCAGGG CTGGCTCGTG AGCCTGAACG ACCTGCCGCC GGAGTATGAT GCCGACGACA TCCTGCCCGC GATCCGCAAC GGCCTCACCG TCGACGGCGA GCTCTATGCC GCGCCCTTCT ACGGCGAGAG CTCGATGATC ATGTATCGCA AGGACCTGAT GGAGAAGGCG GGGCTGACCA TGCCCGACGC CCCCACCTGG GACTTCGTGA AGGAAGCGGC GCAGAAGATG ACCGACAAGG ATGCCGAGGT CTACGGCATC TGCCTGCGCG GCAAGGCGGG CTGGGGCGAG AACATGGCCT TCCTCACCGC CATGGCCAAC AGCTACGGCG CGCGCTGGTT CGACGAGAAC TGGCAGCCGC AGTTCGATGG CGAGGCCTGG AAGGCCACGC TGACCGACTA TCTCGACATG ATGACGAACT ACGGCCCGCC CGGCGCCTCG AACAACGGCT TCAACGAGAA CCTCGCGCTG TTCCAGCAGG GCAAGTGCGG CATGTGGATC GACGCGACGG TGGCCGCCTC CTTCGTGACC AACCCCGAGG AATCCACGGT GGCCGACAAG GTGGGCTTCG CGCTCGCCCC CGATACCGGC AAGGGCAAGC GGGCCAACTG GCTCTGGGCC TGGAACCTCG CGATCCCGGC GGGCTCGCAG AAGGTCGATG CCGCCAAGCA GTTCATCGCC TGGGCGACCT CGAAGGACTA TGCCGAGCTG GTGGCTTCGA AGGAAGGCTG GGCCAATGTG CCTCCGGGGA CGCGGACCTC GCTCTACGAG AATCCGGAAT ATCAGAAGGT GCCGTTCGCG AAGATGACGC TCGACAGCAT CAACGCGGCT GACCCGACCC ACCCGGCCGT CGATCCGGTG CCTTATGTCG GTGTGCAGTT CGTGGCGATC CCCGAGTTCC AGGGCATCGG CACCGCCGTG GGCCAGCAGT TCTCGGCGGC TCTCGCGGGC TCGATGTCGG CCGCGCAGGC GCTTCAGGCG GCCCAGCAGT TCACGACGCG CGAAATGACC CGCGCGGGCT ACATCAAGTA A
|
Protein sequence | MTARFRALMG ACAVAALSSA AGAETITVAT VNNGDMIRMQ GLMSEFNAQH PDITVEWVTL EENVLRQKVT TDIATKGGQF DVLTIGTYEV PIWGKQGWLV SLNDLPPEYD ADDILPAIRN GLTVDGELYA APFYGESSMI MYRKDLMEKA GLTMPDAPTW DFVKEAAQKM TDKDAEVYGI CLRGKAGWGE NMAFLTAMAN SYGARWFDEN WQPQFDGEAW KATLTDYLDM MTNYGPPGAS NNGFNENLAL FQQGKCGMWI DATVAASFVT NPEESTVADK VGFALAPDTG KGKRANWLWA WNLAIPAGSQ KVDAAKQFIA WATSKDYAEL VASKEGWANV PPGTRTSLYE NPEYQKVPFA KMTLDSINAA DPTHPAVDPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG SMSAAQALQA AQQFTTREMT RAGYIK
|
| |