Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3151 |
Symbol | |
ID | 4023656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3503196 |
End bp | 3504476 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637963352 |
Product | extracellular solute-binding protein |
Protein accession | YP_570278 |
Protein GI | 91977619 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.345086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGAA ACTGGATGGT GGCGGCGGCC TTCTCGCTCG CCGCCGGCGT TGCCCATGCG CAGACGCAGA CCGAAGTCGT GCTGCAATAT CCCTATCCCG AGCTGTTCAC CGAGACCCAC AAGCAGATCG CCGCCGAATT CGCCAAGGTG CGCCCGGAGA TCAAGGTCAC GCTGCGCGCG CCCTATGAAT CCTACGAGGA AGGCACCCAG AAGATTCTGC GCGAGTCCGT CACCAATCAG CTCCCCGACG TCACCTTCCA GGGCCTGAAC CGCGTTCGCG TGCTGGTCGA CAAGAACATT CCGGCCGAGC TCGACGGCTA CATCGCCGCC GAAAAGGACT TCGACAAGCA GGGCTTCCAT CAGGCGATGT ACGACATCGG CACCGCCAGC GGAAAGGTCT ACGCGCTGCC GTTCGCGATC TCGCTGCCGA TCGTCTACGT CAACCTCGAT CTGGTGAAAC AGGCCGGCGG CGATCCGAGC AATCTGCCGA CGAGCTGGGA CGGCCTGATC GATCTGGCCA AGAAGATCAA GGCGCTGGGT CCGGAAACCA ATGGCATCAC CTATGCCTGG GACATCACCG GCAACTGGCT GTGGCAGGCG CCGGTGTTCT CCCGCGGCGG CAGCATGCTG AACGCCGACG AGACCAAGGT GGCGTTCGAC GGCCCGGAAG GCCAGTTCGC GATGAAACAG ATCGCTCGCC TGGTCACCGA GGGCGGCATG CCGAACCTCG ACCAGCCGTC GATGCGCGCG ACCTTCGCGG CAGGCAAGAC CGGCATCCAC ATCACCTCGA CCTCGGACCT CAACAAGACC ACGCAGATGA TCGCCGGCAA GTTCGCGCTG AAGACCCACA CCTTCCCGGA CGTGCTGAAA CCGAACGGCC GGCTGCCGGC CGGCGGCAAC GTGGTGCTGA TCACCGCCAA GGACAAGGCC AAGCGTGACG CCGCCTGGGA GGTCGTGAAG TTCTGGACCG GGCCGAAGGG CGCCGCGATC ATGGCGGAGA CCACCGGCTA CATGCCGCCG AACAAGCTCG CCAACGACGT CTATCTGAAG GACTTCTACG CGAAGAATCC GAACAACTAC ACCGCGGTCA GCCAACTCGC CCTGCTGACC AAATGGTACG CGTTCCCCGG CGACAACGGC CTGAAGATCA CCGACGTGAT CAAGGACCAC CTCAACTCGA TCGTCAACGG CGCGCGGGCC AAGGAGCCCG AGGCGGTGCT CGCCGACATG ACGAAGGACG TCCAGAAACT GCTGCCGAAA TCGGTCGGCG CCGCGCGCTG A
|
Protein sequence | MLRNWMVAAA FSLAAGVAHA QTQTEVVLQY PYPELFTETH KQIAAEFAKV RPEIKVTLRA PYESYEEGTQ KILRESVTNQ LPDVTFQGLN RVRVLVDKNI PAELDGYIAA EKDFDKQGFH QAMYDIGTAS GKVYALPFAI SLPIVYVNLD LVKQAGGDPS NLPTSWDGLI DLAKKIKALG PETNGITYAW DITGNWLWQA PVFSRGGSML NADETKVAFD GPEGQFAMKQ IARLVTEGGM PNLDQPSMRA TFAAGKTGIH ITSTSDLNKT TQMIAGKFAL KTHTFPDVLK PNGRLPAGGN VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKLANDVYLK DFYAKNPNNY TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVNGARA KEPEAVLADM TKDVQKLLPK SVGAAR
|
| |