Gene RPB_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4242 
Symbol 
ID3912050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4819161 
End bp4820180 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content58% 
IMG OID637886144 
Productputative phosphate-binding periplasmic protein 
Protein accessionYP_487843 
Protein GI86751347 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.391821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTCTC TCTTGAGAAT CGACATTATT TCGGTCGCGG TGGCCTGCGT CGCGTTCGCC 
GCCGCATCAG TCGGTCCGGC CGAGGCCCGC GATCAGCTAT GGATCGCCGC TTCACCCTCC
GACCAGAGCT TCGCCAAGGC TGTGTCCGTG CAATTCGGCA GAGCCGGCAG GTTCAAGACA
CCCATCGTGA AAGACGGAGG CCCGCCAGCC GGGTTAATGT CGTTCTGCCG CGGCGTCGGC
CCCGACAATT TCGATATCGC TTTCTCCTCG CGCCGGATCG CTTCTTCCGA AGTCGAGCTC
TGCAATAAGA ACGGCGTCAA AGACATTACT CAAGTGCAGT TTGGCTATGA TGCCCTTGTG
TTTGTCACCA ATAAGGCGAG CCAGACCCCG GCCCTCTCGC GCACGGCTGT CTACCTTGCG
ATCGCGCGGG ACGTTCCGGA CAAAGGCACG TTGGCGTCAA ATACGAGCAA GCCGGCCAAT
ACCCTCTACA TTCCGAGCGC TAACCACGGT GCGCGCGACG TGTTCGACGA AATGCTGATG
GTTTCGTCGT GCACGTCCAC CGGGGCCTAC GCGATTATCC AGAAAACCAA TCCAGACAAA
TCCAAAGTAG CGGCGCAATG TCGGGCAGCG CGGCAAGCCG CGAATGTGGT CAATATGGAT
AGCGACAGCG GCACACTCGC TCGTCTTCAA TCCGATCCCA AGGGCGTCGG CGTGGTCACC
TGGTCTTTCT ACACGAACAA CGACGACAAG TTGAAAGTTG TCGCTTTGGA CGGCGTCGTC
CCGTCGAAAG CGACGGTGGC TTCAGGCACA TTCCCGATAG CGTACCCTCT CTATTTGTAT
GTGAAAAAGG CTCAAATCGG CCAGATCCCG GGGATTAAGG AGTGGATCGC CGAATTCACG
AGTGAAAATG CGTTCGGCCC TGACGGCTAT CTTGGGGATA GCGGCCTCGT CTCAATGCCG
GATGCACAGA GGCGACAGTC GCGTGCTGAT GCCCAGGCCC TCGTATCATA TAAACCTTGA
 
Protein sequence
MGSLLRIDII SVAVACVAFA AASVGPAEAR DQLWIAASPS DQSFAKAVSV QFGRAGRFKT 
PIVKDGGPPA GLMSFCRGVG PDNFDIAFSS RRIASSEVEL CNKNGVKDIT QVQFGYDALV
FVTNKASQTP ALSRTAVYLA IARDVPDKGT LASNTSKPAN TLYIPSANHG ARDVFDEMLM
VSSCTSTGAY AIIQKTNPDK SKVAAQCRAA RQAANVVNMD SDSGTLARLQ SDPKGVGVVT
WSFYTNNDDK LKVVALDGVV PSKATVASGT FPIAYPLYLY VKKAQIGQIP GIKEWIAEFT
SENAFGPDGY LGDSGLVSMP DAQRRQSRAD AQALVSYKP