Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4063 |
Symbol | |
ID | 4024580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4515295 |
End bp | 4517094 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637964266 |
Product | extracellular solute-binding protein |
Protein accession | YP_571183 |
Protein GI | 91978524 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.159076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.524387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGACT TGACTAGACG GAAGACCCAG CCCCGCCGGC TGCGGCTGAT GACGATGGCG AGCGCCGCCG CGCTGATGGC GGCGTCGATG ACGCTGGCGG CGCCGGCGTG GGCCGCCGAT GAGGCGGCGC TGAAGAAATG GATCGACGAG GAGTTTCAAC CTTCGACGTT GTCGAAGGAG GAGCAGCTCA AGGAATTGCA GTGGTTCGCA AAGGCGGCCG AGCCGTTCAA GGGGATGGAT ATCAACGTCG TCTCCGAGAC GATCACCACG CACGAATACG AAGCCAAGAC GCTCGCCAAG GCGTTCTCCG AGATCACCGG CATCAAGCTG AAGCACGATC TGATCCAGGA AGGCGACGTC GTCGAGAAGC TGCAGACCCA GATGCAGTCC GGCAAGAACG TCTATGACGG CTGGATCAAC GATTCGGACC TGATCGGCAC GCATTTCCGC TACGGCCAGG CCATCGCGCT GTCGGACTAC ATGGCCGGCG AGGGCAAGGC CGTCACTCTG CCGACCCTCG ACATCGAGGA CTTTATCGGC CGGTCGTTCA CCACGGCGCC CGACAAGAAG ATGTACCAAC TGCCGGACCA GCAGTTCGCG AACCTTTACT GGTTCCGGTA CGACTGGTTC ACCAACGCGG ACTACAAGGC CAGGTTCAAG GCGAAATACG GCTATGAGCT GGGCGTGCCG GTGAACTGGT CGGCCTATGA GGACATCGCG GAATTCTTCA CCAACGACGT CAAGGAGATC GACGGCGTCA AGGTCTATGG CCACATGGAC TACGGCAAGA AGGACCCGTC GCTCGGCTGG CGCTTCACCG ACGCCTGGCT GTCGATGGCC GGCAATGGCG ACAAGGGCCT CCCGAACGGC CGGCCGGTCG ACGAATGGGG CGTCCGGATG GAAGGCTGCC GTCCGGTCGG CTCCTCGGTG GAGCGCGGCG GCGACACCAA CGGTCCCGCA TCGGTCTATT CGATCGTGAA GTATCTCGAC TGGATGAAGA AGTATGCCCC GCCGCAGGCG CAGGGCATGA CCTTCTCCGA GTCGGGCCCG GTGCCGGCGC AGGGCAACGT CGCCCAGCAG ATGTTCTGGT ACACCGCCTT CACCGCCGAC ATGGTGAAGC CCGGGCTGCC GGTGGTGAAC GCCGACGGCA CGCCGAAGTG GCGGATGGCG CCTTCGCCGA AGGGCGCCTA TTGGAAGGAA GGCATGAAGC TCGGCTATCA GGACGTCGGT TCCGGCACGC TCTTGAAGTC GACCCCGGCG GATCGCCGCA AGGCGGCGTG GCTGTATCTG CAGTTCATCA CCTCGAAGAC GGTGAGCCTG AAGAAGAGCC ATGTCGGTCT CACCTTCATC CGTGAGAGCG ATATCTGGGA CAAGTCGTTT ACGGAACGCG CACCGAAGCT CGGTGGTCTG ATCGAGTTCT ATCGCTCGCC GGCCCGCGTG CAATGGTCGC CGACCGGCAA CAACATCCCG GACTATCCGA AGCTGGCGCA GCTTTGGTGG CAGAACATCG GCGATGCCTC CTCCGGCGCC AAGACCGCGC AGGCGGCGAT GGACTCGCTG GCGGCGGCGC AGGACTCGGT GCTCGAACGC CTCGAGAAGT CGAAGGTGCA GGGCGACTGC GGTCCGAAGC TGAACAAGAA GGAGACCGCC GAGTACTGGT ACGCGAAGTC GGAAAAGGAC GGCAACATCG CTCCGCAGCG CAAGCTCGCC AACGAAAAGC CGAAGGGTGA AACCGTCGAC TACGACACCC TGATCAAGTC CTGGCCGGCC TCGCCACCGA AGCGCGCCGA AGCGAAGTAA
|
Protein sequence | MRDLTRRKTQ PRRLRLMTMA SAAALMAASM TLAAPAWAAD EAALKKWIDE EFQPSTLSKE EQLKELQWFA KAAEPFKGMD INVVSETITT HEYEAKTLAK AFSEITGIKL KHDLIQEGDV VEKLQTQMQS GKNVYDGWIN DSDLIGTHFR YGQAIALSDY MAGEGKAVTL PTLDIEDFIG RSFTTAPDKK MYQLPDQQFA NLYWFRYDWF TNADYKARFK AKYGYELGVP VNWSAYEDIA EFFTNDVKEI DGVKVYGHMD YGKKDPSLGW RFTDAWLSMA GNGDKGLPNG RPVDEWGVRM EGCRPVGSSV ERGGDTNGPA SVYSIVKYLD WMKKYAPPQA QGMTFSESGP VPAQGNVAQQ MFWYTAFTAD MVKPGLPVVN ADGTPKWRMA PSPKGAYWKE GMKLGYQDVG SGTLLKSTPA DRRKAAWLYL QFITSKTVSL KKSHVGLTFI RESDIWDKSF TERAPKLGGL IEFYRSPARV QWSPTGNNIP DYPKLAQLWW QNIGDASSGA KTAQAAMDSL AAAQDSVLER LEKSKVQGDC GPKLNKKETA EYWYAKSEKD GNIAPQRKLA NEKPKGETVD YDTLIKSWPA SPPKRAEAK
|
| |