Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1651 |
Symbol | |
ID | 3909928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1880444 |
End bp | 1881994 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883545 |
Product | extracellular solute-binding protein |
Protein accession | YP_485270 |
Protein GI | 86748774 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGC GCACTTTGAT TGGGCTTGCC CTGGTCTGCG GTCTGAACGC CGTGGCCGTG GCGGAGGAGC CGAAAGTAGG CGGCGTCGTG AATGCCGTGA TTCAGCCCGA GCCTCCGGGC CTGATGCTGG GTCTGGTCCA GAACGGCCCG ACCCAGATGA TCTCCGGCAA CATCTATGAT GGCCTGCTAC GCTATGGTCC GAAACTCGAG CCGCAGCCCG GCCTCGCCGA GAGCTGGACG GTGAGCGAGG ACGGCAAGGT CTACACCTTC AAGCTCAGGA GCGGCGTGAC CTGGCACGAC GGCAAACCGT TCACGTCCGC CGACGTGCTG TTCTCGATCG AATTCCTCAA GCAGACCCAC GCCCGCGCCC GTGGCAACCT CGCCACGCTC GACAAGGTGG AGGCGCCCGA CGCCTCGACC GTCGTGTTCA CGCTGAAGGA GCCGTTCGGG CCGTTCCTCG GCATCTTCGA AGTCGGCTCG ATGCCGATGA TTCCGAAGCA CATCTACGAG GGAACGGATT TCAAGGCCAA TCCCGCCAAC AACACGCCGA TCGGCACCGG CCCCTACATG TTCAAGGAAT GGCAGAAGGG CTCGTTCATC CGGCTGGTGA AGAATCCGAA CTACTACATC AAGGGCAAGC CCCACATCGA CGAGATCTAC TGGCACGTCA TTCCCGACGC GGCGGCGCGG TCGGTGGCGT TCGAGACCGG CAAGATCGAC GTGCTGCCCG GCGGCTCGGT GGAAAACTTC GACGTACCGC GCCTGAGCAA ACTGAAGAAC GTCTGCGTCA CCGGCGCCGG CTGGGAATTC TTCAGCCCGC ATTCCTGGCT GTGGCTCAAC AACCGCTCCG GGCCGACGGC GAACACGAAA TTCCGTCAGG CGCTGATGTT CGCCATCGAC CGCAATTTTG CGCGGGACGT GATCTGGAAC GGCCTCGGCA AGGTCGCGAC GGGCCCCTCC GGCTCCGCGA TCAAATACTA TACCAGCGCC GTGCCGAAAT ACGATCTCGC CCCCGCGAAG GCCAAGGCCC TGCTCAAGGA AGCCGGCTAC AAAGGCGAGA AAGTCCGGAT GCTGCCGCTG CCCTACGGCG AAACCTGGCA GCGCTGGGCC GAAGCGGTGA AGCAGAATCT GCAGGACGTC GGCATCAATG TCGAAATGAT CGCCACCGAC GTCCCCGGTT GGAACCAGAA AGTCTCCGAC TGGGACTACG ACATCGCCTT CACCTATCTG TATCAGTACG GCGACCCCGC GCTCGGCGTC GGCCGCAATT ACGTCAGCAG CCAGATCGCC AAGGGCTCGC CGTTCAACAA CGTCGAAGGT TACTCGAATC CGGAGGTCGA CAAGCTGTTC GCCGAGGGCG CGGTCGCCTT CCCGGATACC AAGCGCGACG AGATCTATGC CAAAGCGCAA AAGATCCTGG TCGAAGACGT GCCGGTCGCA TGGCTGCTCG AGCTGCAATT TCCGACCATC ACCCGCTGCA ATGTCAAGAA CCTCGTCACC ACCGCGATCG GCGTCAACGA CGGTTTCCGC GACGCCTGGC TCGACAAGTA A
|
Protein sequence | MLKRTLIGLA LVCGLNAVAV AEEPKVGGVV NAVIQPEPPG LMLGLVQNGP TQMISGNIYD GLLRYGPKLE PQPGLAESWT VSEDGKVYTF KLRSGVTWHD GKPFTSADVL FSIEFLKQTH ARARGNLATL DKVEAPDAST VVFTLKEPFG PFLGIFEVGS MPMIPKHIYE GTDFKANPAN NTPIGTGPYM FKEWQKGSFI RLVKNPNYYI KGKPHIDEIY WHVIPDAAAR SVAFETGKID VLPGGSVENF DVPRLSKLKN VCVTGAGWEF FSPHSWLWLN NRSGPTANTK FRQALMFAID RNFARDVIWN GLGKVATGPS GSAIKYYTSA VPKYDLAPAK AKALLKEAGY KGEKVRMLPL PYGETWQRWA EAVKQNLQDV GINVEMIATD VPGWNQKVSD WDYDIAFTYL YQYGDPALGV GRNYVSSQIA KGSPFNNVEG YSNPEVDKLF AEGAVAFPDT KRDEIYAKAQ KILVEDVPVA WLLELQFPTI TRCNVKNLVT TAIGVNDGFR DAWLDK
|
| |