Gene RPB_1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1651 
Symbol 
ID3909928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1880444 
End bp1881994 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content62% 
IMG OID637883545 
Productextracellular solute-binding protein 
Protein accessionYP_485270 
Protein GI86748774 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC GCACTTTGAT TGGGCTTGCC CTGGTCTGCG GTCTGAACGC CGTGGCCGTG 
GCGGAGGAGC CGAAAGTAGG CGGCGTCGTG AATGCCGTGA TTCAGCCCGA GCCTCCGGGC
CTGATGCTGG GTCTGGTCCA GAACGGCCCG ACCCAGATGA TCTCCGGCAA CATCTATGAT
GGCCTGCTAC GCTATGGTCC GAAACTCGAG CCGCAGCCCG GCCTCGCCGA GAGCTGGACG
GTGAGCGAGG ACGGCAAGGT CTACACCTTC AAGCTCAGGA GCGGCGTGAC CTGGCACGAC
GGCAAACCGT TCACGTCCGC CGACGTGCTG TTCTCGATCG AATTCCTCAA GCAGACCCAC
GCCCGCGCCC GTGGCAACCT CGCCACGCTC GACAAGGTGG AGGCGCCCGA CGCCTCGACC
GTCGTGTTCA CGCTGAAGGA GCCGTTCGGG CCGTTCCTCG GCATCTTCGA AGTCGGCTCG
ATGCCGATGA TTCCGAAGCA CATCTACGAG GGAACGGATT TCAAGGCCAA TCCCGCCAAC
AACACGCCGA TCGGCACCGG CCCCTACATG TTCAAGGAAT GGCAGAAGGG CTCGTTCATC
CGGCTGGTGA AGAATCCGAA CTACTACATC AAGGGCAAGC CCCACATCGA CGAGATCTAC
TGGCACGTCA TTCCCGACGC GGCGGCGCGG TCGGTGGCGT TCGAGACCGG CAAGATCGAC
GTGCTGCCCG GCGGCTCGGT GGAAAACTTC GACGTACCGC GCCTGAGCAA ACTGAAGAAC
GTCTGCGTCA CCGGCGCCGG CTGGGAATTC TTCAGCCCGC ATTCCTGGCT GTGGCTCAAC
AACCGCTCCG GGCCGACGGC GAACACGAAA TTCCGTCAGG CGCTGATGTT CGCCATCGAC
CGCAATTTTG CGCGGGACGT GATCTGGAAC GGCCTCGGCA AGGTCGCGAC GGGCCCCTCC
GGCTCCGCGA TCAAATACTA TACCAGCGCC GTGCCGAAAT ACGATCTCGC CCCCGCGAAG
GCCAAGGCCC TGCTCAAGGA AGCCGGCTAC AAAGGCGAGA AAGTCCGGAT GCTGCCGCTG
CCCTACGGCG AAACCTGGCA GCGCTGGGCC GAAGCGGTGA AGCAGAATCT GCAGGACGTC
GGCATCAATG TCGAAATGAT CGCCACCGAC GTCCCCGGTT GGAACCAGAA AGTCTCCGAC
TGGGACTACG ACATCGCCTT CACCTATCTG TATCAGTACG GCGACCCCGC GCTCGGCGTC
GGCCGCAATT ACGTCAGCAG CCAGATCGCC AAGGGCTCGC CGTTCAACAA CGTCGAAGGT
TACTCGAATC CGGAGGTCGA CAAGCTGTTC GCCGAGGGCG CGGTCGCCTT CCCGGATACC
AAGCGCGACG AGATCTATGC CAAAGCGCAA AAGATCCTGG TCGAAGACGT GCCGGTCGCA
TGGCTGCTCG AGCTGCAATT TCCGACCATC ACCCGCTGCA ATGTCAAGAA CCTCGTCACC
ACCGCGATCG GCGTCAACGA CGGTTTCCGC GACGCCTGGC TCGACAAGTA A
 
Protein sequence
MLKRTLIGLA LVCGLNAVAV AEEPKVGGVV NAVIQPEPPG LMLGLVQNGP TQMISGNIYD 
GLLRYGPKLE PQPGLAESWT VSEDGKVYTF KLRSGVTWHD GKPFTSADVL FSIEFLKQTH
ARARGNLATL DKVEAPDAST VVFTLKEPFG PFLGIFEVGS MPMIPKHIYE GTDFKANPAN
NTPIGTGPYM FKEWQKGSFI RLVKNPNYYI KGKPHIDEIY WHVIPDAAAR SVAFETGKID
VLPGGSVENF DVPRLSKLKN VCVTGAGWEF FSPHSWLWLN NRSGPTANTK FRQALMFAID
RNFARDVIWN GLGKVATGPS GSAIKYYTSA VPKYDLAPAK AKALLKEAGY KGEKVRMLPL
PYGETWQRWA EAVKQNLQDV GINVEMIATD VPGWNQKVSD WDYDIAFTYL YQYGDPALGV
GRNYVSSQIA KGSPFNNVEG YSNPEVDKLF AEGAVAFPDT KRDEIYAKAQ KILVEDVPVA
WLLELQFPTI TRCNVKNLVT TAIGVNDGFR DAWLDK