Gene RPB_4211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4211 
Symbol 
ID3912019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4783668 
End bp4785464 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content63% 
IMG OID637886114 
Productextracellular solute-binding protein 
Protein accessionYP_487813 
Protein GI86751317 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.64235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAGTT TCGGAAATCG GACTCTCTCC AGCCGCTTGC GGCTGATGAC GATGGCGAGC 
GCCGCGGCAT TGGTCGCCGC GTCGATGACG CTGGCGGCGC CGGCATGGGC CGCCGACGAT
GCGGTGCTGA AGAAGTGGAT CGACGAGGAG TTTCAGCCCT CGACGCTGTC GAAGGACGAC
CAGCTCAAGG AACTGCAATG GTTCGCCAAG GCGGCCGAGC CGTTCAAGGG CATGGACATC
AACGTCGTCT CCGAGACCAT CACCACCCAC GAATACGAGG CGAAGACGCT CGCGAAGGCG
TTCTCGGAGA TCACCGGCAT CAAGCTCAAG CATGATCTGA TCCAGGAAGG CGACGTGGTC
GAGAAGCTGC AGACCCAGAT GCAGTCCGGC AAGAACGTCT ATGACGGCTG GATCAACGAC
AGCGACCTGA TCGGCACGCA TTTCCGCTAC GGCCAGACCA TCGCGCTGTC CGACTACATG
ACCGGCGAGG GCAAGGACGT CACCGATCCG ATGCTCGATA TCGATGACTT CATCGGCAAG
TCGTTCACCA CCGCGCCCGA CAAGAAGATG TACCAGTTGC CCGACCAGCA GTTCGCCAAT
CTGTACTGGT TCCGCTACGA CTGGTTCACC AATCCGGACT ACAAGGCGAA GTTCAAGGCG
AAATACGGCT ACGAGCTCGG CGTCCCGGTC AACTGGTCGG CCTACGAGGA TATCGCCGAG
TTCTTCACCA ACGACATCAA GGAGATCAAC GGCGTCAAGG TCTATGGTCA CATGGACTAC
GGCAAGAAGG ATCCCTCGCT CGGCTGGCGC TTCACCGACG CCTGGCTGTC GATGGCCGGC
AACGGCGACA AGGGCTTGCC GAACGGTCTG CCGGTCGACG AATGGGGCAT CCGCATGGAA
GGCTGCCGTC CGGTCGGCTC CTCGATGGAG CGCGGCGGCG ACACCAACGG TCCCGCGGCG
GTGTACTCCA TCGTGAAGTA TCTCGACTGG ATGAAGAAGT ATGCGCCGCC GCAGGCGCAG
GGCATGACCT TCTCGGAGTC GGGGCCGGTG CCGGCGCAGG GCAACGTCGC CCAGCAGATG
TTCTGGTACA CCGCCTTCAC CGCCGACATG GTGAAGCCCG GCCTGCCGGT GGTGAACGCC
GACGGCACGC CGAAATGGCG GATGGCCCCC TCGCCGAAGG GCGCCTATTG GAAAGACGGC
ATGAAGCTCG GCTATCAGGA CGTCGGCTCC GGCACGCTCT TGAAGTCGAC GCCGCCGGAT
CGCCGCAAGG CGGCCTGGCT GTATCTGCAG TTCATCACCT CGAAGACGGT GAGCCTGAAG
AAGAGCCATG TCGGTCTCAC CTTCATCCGC GAGAGCGATA TCTGGGACAA ATCGTTCACC
GAGCGCGCGC CCAAGCTCGG TGGCCTGATC GAGTTCTATC GCTCGCCGGC CCGCGTGCAG
TGGTCGCCGA CCGGCAACAA CATCCCGGAC TATCCGAAGC TGGCGCAATT GTGGTGGCAG
AACATCGGCG ATGCATCGTC CGGTGCGAAG ACGCCGCAGG CCGCCATGGA TTCGCTGGCG
GCCGCGCAGG ACTCGGTGCT CGAGCGGCTC GAGCGGTCGA AGGTGCAGGG TGATTGCGGT
CCGAAGCTGA ACAAGAAAGA GACCGCCGAG TACTGGTACG AGAAGTCCGC CAAGGACGGC
AACATCGCTC CGCAGCGCAA GCTGGCGAAC GAGAAGCCGA AGGGTGAGAC CGTCGATTAC
GACACCCTGA TCAAGTCCTG GCCCGCCTCG CCGCCGAAGC GCGCGGAGGC GAAGTAA
 
Protein sequence
MHSFGNRTLS SRLRLMTMAS AAALVAASMT LAAPAWAADD AVLKKWIDEE FQPSTLSKDD 
QLKELQWFAK AAEPFKGMDI NVVSETITTH EYEAKTLAKA FSEITGIKLK HDLIQEGDVV
EKLQTQMQSG KNVYDGWIND SDLIGTHFRY GQTIALSDYM TGEGKDVTDP MLDIDDFIGK
SFTTAPDKKM YQLPDQQFAN LYWFRYDWFT NPDYKAKFKA KYGYELGVPV NWSAYEDIAE
FFTNDIKEIN GVKVYGHMDY GKKDPSLGWR FTDAWLSMAG NGDKGLPNGL PVDEWGIRME
GCRPVGSSME RGGDTNGPAA VYSIVKYLDW MKKYAPPQAQ GMTFSESGPV PAQGNVAQQM
FWYTAFTADM VKPGLPVVNA DGTPKWRMAP SPKGAYWKDG MKLGYQDVGS GTLLKSTPPD
RRKAAWLYLQ FITSKTVSLK KSHVGLTFIR ESDIWDKSFT ERAPKLGGLI EFYRSPARVQ
WSPTGNNIPD YPKLAQLWWQ NIGDASSGAK TPQAAMDSLA AAQDSVLERL ERSKVQGDCG
PKLNKKETAE YWYEKSAKDG NIAPQRKLAN EKPKGETVDY DTLIKSWPAS PPKRAEAK