Gene RPD_4382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4382 
Symbol 
ID4024907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4847347 
End bp4848372 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID637964592 
Productextracellular solute-binding protein 
Protein accessionYP_571500 
Protein GI91978841 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.98504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTTC GAAAGGGCCT GCTGATCGGC CTCGGCTTCG CTGCCGTGAT TGCCGCCGCC 
GCCGTGACCT ATGAGCGCTA CGACACCAAG ACGCTGAAGC GCACGATCCG GCGCGACGCC
GTGCTGTGCG GCGTAAACAA GGGCCTGCCC GGCTTCTCGA CGCCGGACGA CAAGGGCAAT
TGGAGCGGCT TCGACGTCGA CTTCTGTCGC GCTGTGGCGG CTGCAATCTT CAACGATCCG
AACAAGGTCA AGTTCGTGCC GCTCGACGCC AACGAGCGCT TCAAGGAATT GCAGAGCCGC
AAGGTCGACA TCCTGTCGCG CAACTCGACC TGGAGCATGT CGCGCGAGAC CGGTTACGAA
CTCTATTTCC CGGCGGTCGC CTATTACGAC GGTCAGGGCT TCATGGCGCC GGCGGCGCGC
AAGGTCGAGA CCGCGCTCGA ACTCGACGGC AGCAAGGTCT GCGTCCAGGA GGGCACCACC
ACGCTGCTCA ACCTCGCCGA CTTCTTCCGC ACCAACAACA TGAAGTATCA GGAGGTCAAG
TTCGGCAAGC TCGACGAGGT GGTGAGCGCC TACAAGAACG GCCAGTGCGA CACCTTCACC
GCCGACGCCT CCCAGCTCTA TGCGCTGCGG CAGACGCTCG ACAAGCCGGG CGATCACGTC
ATCCTGCCGG ACCTGATCTC CAAGGAGCCG CTCGCGCCGG TGGTCCGCCA GCGCGACGAC
GACTGGATGA TGATCGTGAA ATGGACGCTG TACGCGATGA TCAACGCGGA AGAGCTCGGC
ATCACCTCGA TCAACATCGA CGAGGCGCTG AAGTCCAAGA AGCCCGACGT GATGCGGCTG
GTCGGCACCG AGGGCACCTA TGGCGAAGAA CTCGGCCTGC CCAAAGACTG GGCGGCGCGG
ATCATCCGCC ACGTCGGCAA TTACGGCGAG ATCTACGATC GCAATGTCGG CAAGCTCGGC
ATCCCGCGCG GCCTGAACCA GCTCTGGAAC GCCGGCGGCA TCCAATACGC GCCGCCGATC
AGGTAG
 
Protein sequence
MSFRKGLLIG LGFAAVIAAA AVTYERYDTK TLKRTIRRDA VLCGVNKGLP GFSTPDDKGN 
WSGFDVDFCR AVAAAIFNDP NKVKFVPLDA NERFKELQSR KVDILSRNST WSMSRETGYE
LYFPAVAYYD GQGFMAPAAR KVETALELDG SKVCVQEGTT TLLNLADFFR TNNMKYQEVK
FGKLDEVVSA YKNGQCDTFT ADASQLYALR QTLDKPGDHV ILPDLISKEP LAPVVRQRDD
DWMMIVKWTL YAMINAEELG ITSINIDEAL KSKKPDVMRL VGTEGTYGEE LGLPKDWAAR
IIRHVGNYGE IYDRNVGKLG IPRGLNQLWN AGGIQYAPPI R