Gene RPD_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2082 
Symbol 
ID4022564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2331650 
End bp2332762 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content62% 
IMG OID637962275 
Productextracellular solute-binding protein 
Protein accessionYP_569218 
Protein GI91976559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.472813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.985512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAA ATTGCTACCG GCGGCTGCGC TTGGGAGGCG CGGCGATCGT GTTGATCGCG 
GGTCTGAACG CGCCGGCGCT GGCGCAGGAA CGCGTGGTCA ATTTCTACAA CTGGTCGAAC
TATGTCGCCC CGGGCGTGCT GGAAGAGTTC ACGCGCGAGA CCGGAATCAA GGTGGTCTAC
GACACCTTCG ACGGCAACGA GACGCTGGAA GCCAAGCTTC TTGCCGGAAA GTCCGGCTAC
GACGTCGTGG TGCCGACCGC CTATTTCCTG CAACGCCAGA TCGGCGCGAA GGTGTTCCAG
AAGCTGGATG CGTCCAAGCT GCCGAACTTG AAGAACGCCT GGGACGTGGT GACGAAGAAG
CTCGCGCTGT ACGATCCCGG CAATCAATAC GCCGCGAACT ACATGTGGGG CACTACCGGG
ATCGGCTACA ACGTCGCGGC GGTGAAGAAG ATTTTCGGTC CCGATGCGGT GATCGACAGC
TGGGACATCG TTTTCAAGCC TGAGAATCTG GCGAAGCTCA AGGATTGCGG CGTCCAGATG
CTGGACTCGG CGGACGACAT TCTGCCGGCG GCGCTGACCC ATCTCGGCCT CGACCCCAAC
TCGACCAAGC AGCCCGATCT GGAGAAGGCC GCCGACGTCG TCGCCAAGGT GCGGCCGTCA
GTCCGCAAGT TTCACTCGTC CGAATACCTC AACGCGCTCG CCACCGGCGA GATTTGCCTC
GTGGTCGGCT GGTCCGGCGA CATCAAGCAG GCGCAGTCGC GTGCGGCGGA GGCCAAGAAC
GGTGTTGATA TCCGCTATGC GATCCCGAAG GAGGGCGCGC AGATGTTCTT CGACAATCTG
GTGATCCCGG CCGACGCCAA GAACGTTGCT GAGGCGCACG AGCTGATCAA CTTCCTGTAT
CGCCCGGACA TCGCTGCGCG CAATTCCGAC TTCCTGTCCT ACGCTAACGG CAACAAGGCC
AGCCAGGAAT TCGTCAATGC CCGCGTGCTG AGCGACAAGA CGATCTATCC TGACGAGGCG
ATGCAGGCGC GGCTGTTCGT GATCACGGCG CGCGATCCGG CAATCCAGCG ATCGATCAAC
CGGCTGTGGA CGCGGGTGAA GACGGGACGG TGA
 
Protein sequence
MRENCYRRLR LGGAAIVLIA GLNAPALAQE RVVNFYNWSN YVAPGVLEEF TRETGIKVVY 
DTFDGNETLE AKLLAGKSGY DVVVPTAYFL QRQIGAKVFQ KLDASKLPNL KNAWDVVTKK
LALYDPGNQY AANYMWGTTG IGYNVAAVKK IFGPDAVIDS WDIVFKPENL AKLKDCGVQM
LDSADDILPA ALTHLGLDPN STKQPDLEKA ADVVAKVRPS VRKFHSSEYL NALATGEICL
VVGWSGDIKQ AQSRAAEAKN GVDIRYAIPK EGAQMFFDNL VIPADAKNVA EAHELINFLY
RPDIAARNSD FLSYANGNKA SQEFVNARVL SDKTIYPDEA MQARLFVITA RDPAIQRSIN
RLWTRVKTGR