Gene RPD_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3535 
Symbol 
ID4024049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3925190 
End bp3927085 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content65% 
IMG OID637963739 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_570659 
Protein GI91978000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.10202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTTGA CCCGACGACA TCTTCTGCAA GCCGGCGCAT GGGCGGCTGC CACCCCGGCC 
CTCGGTCTCG GCTCCGGCCT GCTGGCGGCA GGCCCCGCCG CAGCGCAGGG CGCGGCGGGT
GGGCCGGCCT GGCGGCACGG CCTGTCGCTG TTCGGCGAGG TCAAGTACCC GCCCGATTTC
AGGCGGTTCG ATTACGTCAA CCCGGAAGCG CCCAAAGGCG GCGCGGCGCG CCAAATCTCG
CTCGGCACCT TCGACAATTT CAACATTGCG GTCGCCGGGG TGAAGGGCAA CATCGCGCCG
GCGGTCGGCT ACCTCTACGA GACGCTGATG ACCGCCTCGC AGGACGAGGT CGGCGTCGTC
TACGGACTGC TCGCCGAAAG CGCGACCCAC CCCGACGATT TTTCCTGGGT GGCGTATCGG
CTGCGCGCCG GCGCCCGCTG GCACGACGGC AAGCCGGTGA CCGCTGATGA CGTGATCTTT
TCGTTCGACT CGCTGAAGAA ATTCAGCCCG CGCTACGCGT CGTATTACCG CCACGTCCTC
AAGGCTGAGA AGACCGGCGA GCGCGACATC CGCTTCACCT TCGATGGGCC CGGCAATCGC
GAATTGCCGA CCATTGTCGG CGAGCTGATG ATCCTGCCGA AGCATTGGTG GGAAGGCGTC
GACAGCGCCG GACGGACGCG CGACATTTCC GCCACCACGC TGGAAAAGCC GCTCGGCTCC
GGCCCGTACC GGATCAAGGA GTTCGTCGCC GGCCGCGCGA TCGTGCTGGA GCGCGTCAAG
GACTATTGGG GTGAGAAGCT GCCGGTGCGC ATCGGCGAGA ACAATTTCGA CGAGCTGCGC
TTCGAATTCT TTCGCGACAA CACCGTCGCG CTCGAAGCCT TCAAGGCCGA CCAGGCCGAC
TGGATCGCCG AGAACTCCGC CAAGCAATGG GCGACGGCCT ACGACTTCCC CGCGGTCACC
GACAAGCGGG TGGTCAAGGA GGAATTCCCG ATCAACGATT CCGGCCGGAT GCAGGGCTTC
GTCCTCAACC TCCGTCGCGA CATGTTCAAG GATGCCCGGA TCCGGCGCGC CTTCAACTAC
GCGTTCGACT TCGAGGAAAT GAACAAGCAG CTGTTCTACA GCCAGTACAA GCGGATCAAC
AGCTACTTCG AAGGCACCGA GCTCGCCTCC AGCGGGTTGC CGCAGGGCGA CGAACTCGCG
ATCCTCGAGA CCGTGCGCGA CAAGGTGCCG GCCGAGCTGT TCACCACCCC CTACAGCAAT
CCGGTCGGCG GCAATCCGGA ATCGGTTCGC GCCAATCTGC GCGAGGCGAT GAAGCTGGTG
AAGGAAGCCG GCTTCGACAT CAAGGATCGC AAGCTGGTCG ACCCGTCCGG CAAGCCGGTC
ACGGTCGAGA TGCTGGTGCA GGATCCGTCG GCCGAGCGCA TCACGCTGTT CTACAAGCCG
TCGCTGGAGC GGCTCGGCGT CACCGTCTCG ATCCGCGTCG TCGACGATGC TCAGTATCAG
AACCGCATCC GCGCCTTCGA TTTCGACATC ATCACCGATC TGTGGGGCCA GTCGCTGTCG
CCCGGCAACG AGCAGCGCGA TTATTGGGGT TCACAGGCTG CCGATCAGCC CGGCTCGCGC
AACACCATCG GCATCAAGAA CCCGGCGATC GACGCGCTGA TCGACAAGGT GATCTTCGCC
AAGGACCGCG CGACGCTGGT CGCCGCCACC CGCGCGCTCG ACCGCGTGCT GCTGTGGAAT
TTCTACGTCG TGCCGCAATT CACCTACGGC TTCATCCGCT ACGCCCGCTG GGACCGCTTC
AGCCACGCCG ATCTGCCTAA ATACGCGCGC TCCGGCCTGC CGATGCTGTG GTGGTACGAC
GCCGAGAAGG CCGCCAAGAT CGGCAGACGT TCTTGA
 
Protein sequence
MNLTRRHLLQ AGAWAAATPA LGLGSGLLAA GPAAAQGAAG GPAWRHGLSL FGEVKYPPDF 
RRFDYVNPEA PKGGAARQIS LGTFDNFNIA VAGVKGNIAP AVGYLYETLM TASQDEVGVV
YGLLAESATH PDDFSWVAYR LRAGARWHDG KPVTADDVIF SFDSLKKFSP RYASYYRHVL
KAEKTGERDI RFTFDGPGNR ELPTIVGELM ILPKHWWEGV DSAGRTRDIS ATTLEKPLGS
GPYRIKEFVA GRAIVLERVK DYWGEKLPVR IGENNFDELR FEFFRDNTVA LEAFKADQAD
WIAENSAKQW ATAYDFPAVT DKRVVKEEFP INDSGRMQGF VLNLRRDMFK DARIRRAFNY
AFDFEEMNKQ LFYSQYKRIN SYFEGTELAS SGLPQGDELA ILETVRDKVP AELFTTPYSN
PVGGNPESVR ANLREAMKLV KEAGFDIKDR KLVDPSGKPV TVEMLVQDPS AERITLFYKP
SLERLGVTVS IRVVDDAQYQ NRIRAFDFDI ITDLWGQSLS PGNEQRDYWG SQAADQPGSR
NTIGIKNPAI DALIDKVIFA KDRATLVAAT RALDRVLLWN FYVVPQFTYG FIRYARWDRF
SHADLPKYAR SGLPMLWWYD AEKAAKIGRR S