Gene RPD_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4063 
Symbol 
ID4024580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4515295 
End bp4517094 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content63% 
IMG OID637964266 
Productextracellular solute-binding protein 
Protein accessionYP_571183 
Protein GI91978524 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.159076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.524387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGACT TGACTAGACG GAAGACCCAG CCCCGCCGGC TGCGGCTGAT GACGATGGCG 
AGCGCCGCCG CGCTGATGGC GGCGTCGATG ACGCTGGCGG CGCCGGCGTG GGCCGCCGAT
GAGGCGGCGC TGAAGAAATG GATCGACGAG GAGTTTCAAC CTTCGACGTT GTCGAAGGAG
GAGCAGCTCA AGGAATTGCA GTGGTTCGCA AAGGCGGCCG AGCCGTTCAA GGGGATGGAT
ATCAACGTCG TCTCCGAGAC GATCACCACG CACGAATACG AAGCCAAGAC GCTCGCCAAG
GCGTTCTCCG AGATCACCGG CATCAAGCTG AAGCACGATC TGATCCAGGA AGGCGACGTC
GTCGAGAAGC TGCAGACCCA GATGCAGTCC GGCAAGAACG TCTATGACGG CTGGATCAAC
GATTCGGACC TGATCGGCAC GCATTTCCGC TACGGCCAGG CCATCGCGCT GTCGGACTAC
ATGGCCGGCG AGGGCAAGGC CGTCACTCTG CCGACCCTCG ACATCGAGGA CTTTATCGGC
CGGTCGTTCA CCACGGCGCC CGACAAGAAG ATGTACCAAC TGCCGGACCA GCAGTTCGCG
AACCTTTACT GGTTCCGGTA CGACTGGTTC ACCAACGCGG ACTACAAGGC CAGGTTCAAG
GCGAAATACG GCTATGAGCT GGGCGTGCCG GTGAACTGGT CGGCCTATGA GGACATCGCG
GAATTCTTCA CCAACGACGT CAAGGAGATC GACGGCGTCA AGGTCTATGG CCACATGGAC
TACGGCAAGA AGGACCCGTC GCTCGGCTGG CGCTTCACCG ACGCCTGGCT GTCGATGGCC
GGCAATGGCG ACAAGGGCCT CCCGAACGGC CGGCCGGTCG ACGAATGGGG CGTCCGGATG
GAAGGCTGCC GTCCGGTCGG CTCCTCGGTG GAGCGCGGCG GCGACACCAA CGGTCCCGCA
TCGGTCTATT CGATCGTGAA GTATCTCGAC TGGATGAAGA AGTATGCCCC GCCGCAGGCG
CAGGGCATGA CCTTCTCCGA GTCGGGCCCG GTGCCGGCGC AGGGCAACGT CGCCCAGCAG
ATGTTCTGGT ACACCGCCTT CACCGCCGAC ATGGTGAAGC CCGGGCTGCC GGTGGTGAAC
GCCGACGGCA CGCCGAAGTG GCGGATGGCG CCTTCGCCGA AGGGCGCCTA TTGGAAGGAA
GGCATGAAGC TCGGCTATCA GGACGTCGGT TCCGGCACGC TCTTGAAGTC GACCCCGGCG
GATCGCCGCA AGGCGGCGTG GCTGTATCTG CAGTTCATCA CCTCGAAGAC GGTGAGCCTG
AAGAAGAGCC ATGTCGGTCT CACCTTCATC CGTGAGAGCG ATATCTGGGA CAAGTCGTTT
ACGGAACGCG CACCGAAGCT CGGTGGTCTG ATCGAGTTCT ATCGCTCGCC GGCCCGCGTG
CAATGGTCGC CGACCGGCAA CAACATCCCG GACTATCCGA AGCTGGCGCA GCTTTGGTGG
CAGAACATCG GCGATGCCTC CTCCGGCGCC AAGACCGCGC AGGCGGCGAT GGACTCGCTG
GCGGCGGCGC AGGACTCGGT GCTCGAACGC CTCGAGAAGT CGAAGGTGCA GGGCGACTGC
GGTCCGAAGC TGAACAAGAA GGAGACCGCC GAGTACTGGT ACGCGAAGTC GGAAAAGGAC
GGCAACATCG CTCCGCAGCG CAAGCTCGCC AACGAAAAGC CGAAGGGTGA AACCGTCGAC
TACGACACCC TGATCAAGTC CTGGCCGGCC TCGCCACCGA AGCGCGCCGA AGCGAAGTAA
 
Protein sequence
MRDLTRRKTQ PRRLRLMTMA SAAALMAASM TLAAPAWAAD EAALKKWIDE EFQPSTLSKE 
EQLKELQWFA KAAEPFKGMD INVVSETITT HEYEAKTLAK AFSEITGIKL KHDLIQEGDV
VEKLQTQMQS GKNVYDGWIN DSDLIGTHFR YGQAIALSDY MAGEGKAVTL PTLDIEDFIG
RSFTTAPDKK MYQLPDQQFA NLYWFRYDWF TNADYKARFK AKYGYELGVP VNWSAYEDIA
EFFTNDVKEI DGVKVYGHMD YGKKDPSLGW RFTDAWLSMA GNGDKGLPNG RPVDEWGVRM
EGCRPVGSSV ERGGDTNGPA SVYSIVKYLD WMKKYAPPQA QGMTFSESGP VPAQGNVAQQ
MFWYTAFTAD MVKPGLPVVN ADGTPKWRMA PSPKGAYWKE GMKLGYQDVG SGTLLKSTPA
DRRKAAWLYL QFITSKTVSL KKSHVGLTFI RESDIWDKSF TERAPKLGGL IEFYRSPARV
QWSPTGNNIP DYPKLAQLWW QNIGDASSGA KTAQAAMDSL AAAQDSVLER LEKSKVQGDC
GPKLNKKETA EYWYAKSEKD GNIAPQRKLA NEKPKGETVD YDTLIKSWPA SPPKRAEAK