Gene RPC_4923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4923 
Symbol 
ID3973806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5492862 
End bp5493872 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content63% 
IMG OID637928036 
Productperiplasmic phosphate binding protein 
Protein accessionYP_534764 
Protein GI90426394 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR00975] phosphate ABC transporter, phosphate-binding protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.205155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.509707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCT TCAAGGCAAT CGTCGCCGCC GGCCTGGTTG CCGCGTCGAC GTCGGCGTTC 
GCCGCCGATA TTACCGGTGC CGGGGCCACG TTCCCGTTCC CGGTCTATTC GAAGTGGGCC
GACCTCTACA AAAAAGAGAC CGGCAACGGG CTGAACTATC AGTCGATCGG CTCCGGCGCC
GGCATCAAGC AGATCCAGGC CAAGACCGTG ACCTTCGGCG CCACCGACGC GCCGTTGAAG
GCCGAGCAGC TCGAGAAGGA CGGCCTGGTG CAATGGCCGC AGGTGATGGG CGCGATCGTG
CCGGTGGTGA ACCTCGAAGG CATCAAGCCG GGTGAACTGG TGTTCGATGG CGAGACCCTG
GCCAATATCT ATCTCGGCAA GATCACCAAG TGGAACGATC CGGCGATCGC CAAGCTCAAT
CCGAAGCTGA AGCTGCCGAC CGACGCCATC ACCGTGGTGC GCCGCTCCGA CGGTTCGGGC
ACCACCTTCA ACTTCACCGA CTATCTGTCC AAGGCCAGCG CCGACTGGAA GACAAAGGTC
GGATCCGGCA CCGCGGTCGA ATGGCCGGTC GGCGTCGGCG CCAAGGGCAA TGAAGGCGTT
GCCGGCAACA TCAGCCAGAC CAAGAATTCG ATCGGCTATG TCGAATACGC CTATGCCAAG
CAGAACAAGC TGACCTACGC CGGGCTGATC AACAAGGCCG GCAAGACCGT GCAGCCGACC
GTCGCCGCGT TCCAGGCCGC CGCCTCCAAT GCGGATTGGG CCAAGGCGCC CGGCTACTAC
CTGATCCTGA CCGACCAGCC CGGCGAAGCC TCCTGGCCGA TCACCGCGGC GACGTTCATC
TTGATGCACA AGGAGCCGGC CGACAAGGCG GCCTCCGCGG AAGCCATCAA GTTCTTCAAA
TGGGCGTTCG AGAAGGGCGA CAAGGCGGCG GAAGAGCTCG ACTACATCCC GATGCCCGCC
GCGGTCGTCA AGCAGATCGA GAAGACCTGG TCGGCCGACA TCAAGAGCTA A
 
Protein sequence
MNFFKAIVAA GLVAASTSAF AADITGAGAT FPFPVYSKWA DLYKKETGNG LNYQSIGSGA 
GIKQIQAKTV TFGATDAPLK AEQLEKDGLV QWPQVMGAIV PVVNLEGIKP GELVFDGETL
ANIYLGKITK WNDPAIAKLN PKLKLPTDAI TVVRRSDGSG TTFNFTDYLS KASADWKTKV
GSGTAVEWPV GVGAKGNEGV AGNISQTKNS IGYVEYAYAK QNKLTYAGLI NKAGKTVQPT
VAAFQAAASN ADWAKAPGYY LILTDQPGEA SWPITAATFI LMHKEPADKA ASAEAIKFFK
WAFEKGDKAA EELDYIPMPA AVVKQIEKTW SADIKS