Gene RPD_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1032 
Symbol 
ID4021508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1182373 
End bp1183398 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID637961224 
Productextracellular solute-binding protein 
Protein accessionYP_568171 
Protein GI91975512 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.284122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCA AAATCATATC AGGCGTCGTG GCGCTGGCGG TTGTGGGCAC CGCGTCGGCC 
CTCGTCGCGC AACCGGCGTC GATCACCATC GTCAATCCGG GCGGCCCCTA TCTCGAGGCG
ACGCTGGAGG CCTGGGGCAA GACCTTCACC GAGAAGACCA ACATCAAGGT GAAGGGCGAC
TCCCCGCAGA GCCTGCCGAA GATCCAGCAG ATGGTCGGCG CCAAGAACGT GTCGTGGGAT
GTCGTCGAAG TGCCGCCGGT GTTCACGATG CGCCATTGCG GCACGCTGTT CGAAAAGCTC
ACGCCCGGTC TCGTCGACGC GGCGCGCGTC AATGCGGGAT TCGGCAATGA ATGCGGCGTG
CCGGATGCGG GCTACGCCAA CATCATGCTC TACAACAAGA CCAAGTTCGC AAAGGGCGGC
CCGCAGAACT GGGCCGACTT CTTCGACGTC AAGAAGTTTC CCGGCAAGCG CGGCCTGTGG
GACGGCGCCG AGGGCGTGAA TCTCGAAATC GCGCTGCTGG CCGACGGCGT TGCTCCGGAG
AACCTCTATC CGCTCGACCT GGATCGCGCG TTCCGCAAGC TCAGCGAACT CAAGCCGCAT
ATCGTGTTCT GGCGAACCGG CGCGCAATCC ACCCAGATGA TGGAAAGCGG CGAAGTCGAC
ATGATCATGG CGTGGTCGTC GCGCGCCTAT CCGGCGCTGA AGAACGGCGC GCCGTTCGAG
CCGGTGTGGA ACCAGCACAT CATCTACAAC AACGTTCTGG CGATTCCGAT GGGCGCGCCG
AACAAGGCCG CCAGCGAAGC CTATATTCGC CATGCGCTCG AGGACAAACA GCAGGCCCGC
ATCACCGAGC TCTATCCGGT CACGCCCGCG CTGATCGGCG CCGCTCCGAA GCTCGACGAG
GCAGGCATGA AAGTCTTCGC CGGCACGCCT GAGCGGGCCA AGACCGCGAT CCGCCTCAAT
CTGAAATGGG TCGCCGACAA TTCCGAAGTG ATCCAGAAGC GCTGGATCGA GTGGTTGAAC
TCCTGA
 
Protein sequence
MSRKIISGVV ALAVVGTASA LVAQPASITI VNPGGPYLEA TLEAWGKTFT EKTNIKVKGD 
SPQSLPKIQQ MVGAKNVSWD VVEVPPVFTM RHCGTLFEKL TPGLVDAARV NAGFGNECGV
PDAGYANIML YNKTKFAKGG PQNWADFFDV KKFPGKRGLW DGAEGVNLEI ALLADGVAPE
NLYPLDLDRA FRKLSELKPH IVFWRTGAQS TQMMESGEVD MIMAWSSRAY PALKNGAPFE
PVWNQHIIYN NVLAIPMGAP NKAASEAYIR HALEDKQQAR ITELYPVTPA LIGAAPKLDE
AGMKVFAGTP ERAKTAIRLN LKWVADNSEV IQKRWIEWLN S