Gene RPC_3728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3728 
Symbol 
ID3971473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4148081 
End bp4149946 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content65% 
IMG OID637926838 
Productextracellular solute-binding protein 
Protein accessionYP_533582 
Protein GI90425212 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC TCAATCGCCG GCATCTGCTC GGTCTCGGTG TCGGTGCGGT GGCCGCCGCC 
TCGTTGCGGC CGGCGCTTGC GGCCGAGGGC GGCGAGATCG AGGCGCACGG CATTTCGGCG
TTCGGCGATC TGAAATATCC GGCGGATTTC CATCATTTCG ACTACGTCAA TGTCGACGCG
CCGAAGGGCG GGGTGTTTTC GACCAATCCG TCCTACCGGT CCTTCAACCA GTCGTTCCTG
ACCTTCAACT CGCTCAACGC CTTCATCTTC AAGGGCGACG GCGCGCAAGG CATGGGCCTG
ACTTTTGCGC CGCTGATGGC GCGCGCCGGC GACGAGCCCG ACGCGATGTA CGGCCTGGTT
GCCAAATCGG TGAAGATCTC CGCCGATGGG CTCGGCTATC GCTTCACGCT GCGGCCGGAG
GCGCGGTTTC ATGATGGCTC GAAGCTCACC GCCCACGACG CGGCGTTTTC GCTGACCGTG
CTGAAGACCA AGGGGCATCC CTTGATCACC CAGCAGATGC GCGACGTGGT CTCGGCGGAA
GCGCTCGACG ACTTAACGCT GCTGGTGAGG TTTTCGGCCA AGCGCGGCCG CGACGTGCCG
CTGTTCGTGG CGGGGCTGCC GATCTTTTCG CAGGCCTATT ACGCCAAACA TCCGTTCGAT
GAGTCGACGC TGGAGGCGCC GCTCGGCTCT GGCCCCTACA AGGTCGGCAA GTTCGAAGTC
GGCCGCTACA TCGAATTCGA GCGGCTGCAG GACTGGTGGG GCGCCGAGCT GCCGGTCAAT
CGCGGAGCCA ACAATTTCGA CGTGGTGCGT TACGACTTCT ATCGCGACCG CGACGTCGCC
TTCGAGGGCT TCACCGGCCG CAGCTATCTG TATCGCGAGG AGTTCACCTC GCGGGTCTGG
AACACGCGCT ATGATTTTCC GGCGATCCTC GACGGCCGGG TGAAGCGCGA AACCCTGCCG
GATGAAACGC CCTCCGGGGC GCAGGGCTGG TTTCCCAACA CCCGCCGCGA CAAGTTCAAG
GACCCGCGGG TGCGCGAGGC GCTGGGCTGC GCGTTCGATT TCGAATGGAC CAACAAGACC
CTGATGTACG GCGCCTATCT CCGCACGGTA TCGCCGTTCC AGAACTCCGA CCTGATGGCC
AACGGTCCGC CGTCGCCGGA AGAAGTGGCA TTGCTAGAGC GCTTCCGCGG CCAGGTGCCG
GAGGAGGTGT TCGGCGCGCC CTATGTGCCG CCGGTGTCCG ATGGCTCCGG GCAGGACCGC
GCGCTGTTGA AGAAGGCGGT GCAACTGCTG CAGGACGCCG GCTGCGTGAT CAAGAACGGC
AAGCGGATGA CGCCGCAGGG CGAACCGTTC ACGATCGAGT TTCTGCTCGA CGAGCCGACC
TTTCAGCCGC ACCACATGCC GTTCATCAAG AATCTCGCCA CGCTCGGCAT CGAGGCGTCG
CTGCGCATGG TCGATGCCGT GCAGCATCGC GCCCGGCGCG ACGATTTCGA TTTCGACCTC
ATCATCGAGC GCTTCGGCTT CTCGACGGTG CCGGGCGACT CGCTGCGGCC GTTCTTCTCG
TCGCGCGCGG CGGCCACCAA GGGCTCGAGC AATCTCGCCG GGATCGCCGA TCCGGTGGTC
GATGCGCTGG TCGAAGACGT CATCGCCGCC GACACCAGGG TCAAGCTGGT GGTCGCCGCG
CGCGCGCTCG ACCGCGTGGT CCGCGCCGGC CGCTATTGGG TGCCGCAATG GTATTCGGGC
TCGCATCGGG TGGCCTATTG GGACGTGTTC GGCCATCCGG CGAAACTGCC GAAATATCTC
GGCGTCGCAG CACCCGATCT GTGGTGGTCG ACCGTGAAGT CCGCAGCGAC CGAACAGGCG
AAATAG
 
Protein sequence
MAELNRRHLL GLGVGAVAAA SLRPALAAEG GEIEAHGISA FGDLKYPADF HHFDYVNVDA 
PKGGVFSTNP SYRSFNQSFL TFNSLNAFIF KGDGAQGMGL TFAPLMARAG DEPDAMYGLV
AKSVKISADG LGYRFTLRPE ARFHDGSKLT AHDAAFSLTV LKTKGHPLIT QQMRDVVSAE
ALDDLTLLVR FSAKRGRDVP LFVAGLPIFS QAYYAKHPFD ESTLEAPLGS GPYKVGKFEV
GRYIEFERLQ DWWGAELPVN RGANNFDVVR YDFYRDRDVA FEGFTGRSYL YREEFTSRVW
NTRYDFPAIL DGRVKRETLP DETPSGAQGW FPNTRRDKFK DPRVREALGC AFDFEWTNKT
LMYGAYLRTV SPFQNSDLMA NGPPSPEEVA LLERFRGQVP EEVFGAPYVP PVSDGSGQDR
ALLKKAVQLL QDAGCVIKNG KRMTPQGEPF TIEFLLDEPT FQPHHMPFIK NLATLGIEAS
LRMVDAVQHR ARRDDFDFDL IIERFGFSTV PGDSLRPFFS SRAAATKGSS NLAGIADPVV
DALVEDVIAA DTRVKLVVAA RALDRVVRAG RYWVPQWYSG SHRVAYWDVF GHPAKLPKYL
GVAAPDLWWS TVKSAATEQA K