Gene RPD_3151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3151 
Symbol 
ID4023656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3503196 
End bp3504476 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID637963352 
Productextracellular solute-binding protein 
Protein accessionYP_570278 
Protein GI91977619 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.345086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACGAA ACTGGATGGT GGCGGCGGCC TTCTCGCTCG CCGCCGGCGT TGCCCATGCG 
CAGACGCAGA CCGAAGTCGT GCTGCAATAT CCCTATCCCG AGCTGTTCAC CGAGACCCAC
AAGCAGATCG CCGCCGAATT CGCCAAGGTG CGCCCGGAGA TCAAGGTCAC GCTGCGCGCG
CCCTATGAAT CCTACGAGGA AGGCACCCAG AAGATTCTGC GCGAGTCCGT CACCAATCAG
CTCCCCGACG TCACCTTCCA GGGCCTGAAC CGCGTTCGCG TGCTGGTCGA CAAGAACATT
CCGGCCGAGC TCGACGGCTA CATCGCCGCC GAAAAGGACT TCGACAAGCA GGGCTTCCAT
CAGGCGATGT ACGACATCGG CACCGCCAGC GGAAAGGTCT ACGCGCTGCC GTTCGCGATC
TCGCTGCCGA TCGTCTACGT CAACCTCGAT CTGGTGAAAC AGGCCGGCGG CGATCCGAGC
AATCTGCCGA CGAGCTGGGA CGGCCTGATC GATCTGGCCA AGAAGATCAA GGCGCTGGGT
CCGGAAACCA ATGGCATCAC CTATGCCTGG GACATCACCG GCAACTGGCT GTGGCAGGCG
CCGGTGTTCT CCCGCGGCGG CAGCATGCTG AACGCCGACG AGACCAAGGT GGCGTTCGAC
GGCCCGGAAG GCCAGTTCGC GATGAAACAG ATCGCTCGCC TGGTCACCGA GGGCGGCATG
CCGAACCTCG ACCAGCCGTC GATGCGCGCG ACCTTCGCGG CAGGCAAGAC CGGCATCCAC
ATCACCTCGA CCTCGGACCT CAACAAGACC ACGCAGATGA TCGCCGGCAA GTTCGCGCTG
AAGACCCACA CCTTCCCGGA CGTGCTGAAA CCGAACGGCC GGCTGCCGGC CGGCGGCAAC
GTGGTGCTGA TCACCGCCAA GGACAAGGCC AAGCGTGACG CCGCCTGGGA GGTCGTGAAG
TTCTGGACCG GGCCGAAGGG CGCCGCGATC ATGGCGGAGA CCACCGGCTA CATGCCGCCG
AACAAGCTCG CCAACGACGT CTATCTGAAG GACTTCTACG CGAAGAATCC GAACAACTAC
ACCGCGGTCA GCCAACTCGC CCTGCTGACC AAATGGTACG CGTTCCCCGG CGACAACGGC
CTGAAGATCA CCGACGTGAT CAAGGACCAC CTCAACTCGA TCGTCAACGG CGCGCGGGCC
AAGGAGCCCG AGGCGGTGCT CGCCGACATG ACGAAGGACG TCCAGAAACT GCTGCCGAAA
TCGGTCGGCG CCGCGCGCTG A
 
Protein sequence
MLRNWMVAAA FSLAAGVAHA QTQTEVVLQY PYPELFTETH KQIAAEFAKV RPEIKVTLRA 
PYESYEEGTQ KILRESVTNQ LPDVTFQGLN RVRVLVDKNI PAELDGYIAA EKDFDKQGFH
QAMYDIGTAS GKVYALPFAI SLPIVYVNLD LVKQAGGDPS NLPTSWDGLI DLAKKIKALG
PETNGITYAW DITGNWLWQA PVFSRGGSML NADETKVAFD GPEGQFAMKQ IARLVTEGGM
PNLDQPSMRA TFAAGKTGIH ITSTSDLNKT TQMIAGKFAL KTHTFPDVLK PNGRLPAGGN
VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKLANDVYLK DFYAKNPNNY
TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVNGARA KEPEAVLADM TKDVQKLLPK
SVGAAR