Gene RPB_4673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4673 
Symbol 
ID3912491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5286022 
End bp5287047 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content64% 
IMG OID637886578 
Productextracellular solute-binding protein 
Protein accessionYP_488267 
Protein GI86751771 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.562067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCC GCAAGGGCCT GCTGATCGGC CTCGGCGTCG CCGCCGTGGT CGCCGCCGCC 
GCCGTCACCT ACGAGCGCTA CGACACCAAG ACGCTGAAGC GCACGATCCG CCGCGACGCC
GTGCTGTGCG GCGTCAACAA GGGCCTGCCG GGCTTCTCGA CGCCGGACGA AAAAGGCAAC
TGGACCGGTT TCGACGTCGA TTTCTGCCGC GCGGTGGCGG CGGCGATCTT CAACGATCCG
AACAAGGTGA AATTCGTGCC GCTCGACGCC AACGAGCGCT TCAAGGAATT GCAGAGCCGC
AAGGTCGACA TCCTGTCGCG CAATTCGACC TGGAGCATGT CGCGCGAGGC CGGCTACGAA
TTGTACTTCC CGGCGGTCGC CTATTACGAC GGCGAGGGCT TCATGGTGCC GTCGTCGCGC
AAGATCGAGA CCGCGCTCGA ACTCGACGGC AGCAAGGTCT GCGTCCAGGC CGGCACCACC
ACTTTGCTCA ACCTTGCCGA CTACTTCCGC GCCAACAACA TGAAGTATCA GGAGGTCAAG
TTCCCCAAGC TCGACGAGGT GGTCGCCGCC TACAACAGCG GACAATGCGA CACCTTCTCC
GCCGACGCCT CGCAGCTCTA TGCGCTGCGC CAGACGCTCG CCAAACCGGG CGATCACGTC
ATCCTGCCGG ACCTGATCTC CAAGGAGCCG CTGGCGCCGG TGGTCCGCCA GCGCGACGAC
GAATGGATGA TGATCGTGAA ATGGTCGCTC TACGCGATGA TCAACGCCGA GGAACTCGGC
GTCACCTCGG CGAATATCGA CGAGGCGCTG AAGTCGAAGA AGCCGGACGT GATGCGGCTG
GTCGGCACCG AGGGCGCCTA TGGCGAGGAG CTCGGCCTCA GCAAGGACTG GGCGGCGCGG
ATCATCCGCC ACGTCGGCAA TTACGGCGAG GTCTACGACC GTAATGTCGG CAAGCTCGGC
ATCCCGCGTG GCCTGAACCA GCTCTGGAGC GCCGGCGGCA TCCAATACGC GCCGCCGATC
AGGTAG
 
Protein sequence
MSFRKGLLIG LGVAAVVAAA AVTYERYDTK TLKRTIRRDA VLCGVNKGLP GFSTPDEKGN 
WTGFDVDFCR AVAAAIFNDP NKVKFVPLDA NERFKELQSR KVDILSRNST WSMSREAGYE
LYFPAVAYYD GEGFMVPSSR KIETALELDG SKVCVQAGTT TLLNLADYFR ANNMKYQEVK
FPKLDEVVAA YNSGQCDTFS ADASQLYALR QTLAKPGDHV ILPDLISKEP LAPVVRQRDD
EWMMIVKWSL YAMINAEELG VTSANIDEAL KSKKPDVMRL VGTEGAYGEE LGLSKDWAAR
IIRHVGNYGE VYDRNVGKLG IPRGLNQLWS AGGIQYAPPI R