Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0832 |
Symbol | |
ID | 4896808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 845506 |
End bp | 846603 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111416 |
Product | extracellular solute-binding protein |
Protein accession | YP_001042715 |
Protein GI | 126461601 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCACA GACTCACGCG CCGCGGACTG CTCCGCGGCA CCGCCGCGGC GGGCTCGGCC GCCCTCCTCG GCTCGGTCAC GGGCACCCGG CTCTGGGCGC AGGAGCCCGA GAAGCCCGCC GAGCTCATCG TCCGCGCCTG GGGCGGAAGC TGGGTCGAGG CGCTGAAGGC GGGCGTGTCG GATCCCTTCA CCAAGGCCAC CGGCATCGCC GTGCGCCACG ACCTGACCGA GGACAACGAG ATCCAGCCGA AGGTCTGGGC CGCTGTGGCG CAGGGCCGCG TGCCGCCGAT CCACATCAAC TGGGACACGA CGACCAACGC CACCAAGTCG GCGCTGCGCG GCGTGACCGA GGATCTGTCG GGCCTGCCGA ACCTCGCCGC CACCACGGAT CTCGCCAAGC CCGTGGGTCT CGACGGCTAT CCGATCGTGA ACACCTACGG CTATGTCTAT GTGCTGGCCT ATCGTCCCGA GGTCTTCCCG GACGGGCCGC CGAAATCCTG GGAGGTGCTG CTCGAGCCCC GCTTCAAGGG CCGGATCGCG CTCTACAACG ACGGGATCGG CTTCCACTTC CCGGCGCAGG TCGCGGGCGG CGGCAGCCTC GAGGACATTC CGGGCAACAT GCAGCCCGCC TGGGACTTCA TCGCCAAGGT CAAGGCGCAG CAGCCGCTCC TCGGCGAGGA TCCGGACTTC ACCGCCTGGT TCCAGAACGG CGAGATCGAT CTGGCCTGCA CCATCTCGAC CAACGCGCGC GAGGCCCGGA AGAACGGCGT CGACATCGCC TGGACCGTGC CCGAGGAGGG CTGCAAGTTC GACACCGACG GGCTCTGGAT CCCGAAGGGC CTGCCCGAGA ACGAGCTCTA CTGGGCCAAG CAATACATCA ACTTCGCCAT CACGCCCGAG GCGCAGCAGG TCTGGCTCGA CGGGCTCGGC CTGCCGGGCG TGGTGCCGGG CCTGAAGCCG CCCGCCGATC TTGCGAACGA CCCCTCCTAT CCGACGAAGC CCGAGGATTT CGAGCATCTG ATCCGGGTCT CGGCGCAGGT GCAGGTCGAG AACGAGAGCG ACTGGTTCGC GAAGTTCAAG GAGATCATGC AGGGCTGA
|
Protein sequence | MIHRLTRRGL LRGTAAAGSA ALLGSVTGTR LWAQEPEKPA ELIVRAWGGS WVEALKAGVS DPFTKATGIA VRHDLTEDNE IQPKVWAAVA QGRVPPIHIN WDTTTNATKS ALRGVTEDLS GLPNLAATTD LAKPVGLDGY PIVNTYGYVY VLAYRPEVFP DGPPKSWEVL LEPRFKGRIA LYNDGIGFHF PAQVAGGGSL EDIPGNMQPA WDFIAKVKAQ QPLLGEDPDF TAWFQNGEID LACTISTNAR EARKNGVDIA WTVPEEGCKF DTDGLWIPKG LPENELYWAK QYINFAITPE AQQVWLDGLG LPGVVPGLKP PADLANDPSY PTKPEDFEHL IRVSAQVQVE NESDWFAKFK EIMQG
|
| |