Gene RPD_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1152 
Symbol 
ID4021628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1309995 
End bp1311611 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content65% 
IMG OID637961344 
Productextracellular solute-binding protein 
Protein accessionYP_568291 
Protein GI91975632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.296162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTGT CGCGCTTCGA GATCAACCGC CGGACCGTCC TGCTGACGTC GGCCGCCATC 
GCCGCCAATG TACTCAATCC GATGCGGGCG TTCGCGCAGG AGACGCCGCG CAAGGGCGGG
GTGTTCAACG TGCATTACGG CGCGGAGCAA CGCCAGCTCA ACCCCAGCTT GCAGGCATCG
ACCGGCGTGT ACATCATCGG CGGCAAGATC CAGGAGCCGC TGGTCGATCT CGACGCCGCC
GGCAATCCGG TCGGCGTGCT GGCGGAGAGC TGGGAATCGA CGCCGGACGG CAAGACGATC
ACTTTCAAGC TGCGCAAGGG CGTCGTCTGG CACGACGGCA AGCCGTTCAC CTCCGAGGAC
GTCGCCTTCA CCGCGCTGAA CATGTGGAAG AAGATCCTCA ACTACGGATC GACGCTGCAG
CTGTTCCTCA CCGCGGTCGA CACCCCCGAT CCGCAGACTG CGATCTTCCG TTACGAGCGG
CCGATGCCGC TCAATTTGCT GCTGCGCGCG CTGCCGGACC TCGGTTACGT CTCGCCCAAG
CACATCTACG AGACCGGCGA CATCCGCCAG AACCCGGTCA ATCTCGCGCC GATCGGCACC
GGCCCGTTCA AGTTCAACAA ATACGAGCGC GGCCAGTACA TCATCGCCGA CCGCAACGAC
AATTACTGGC GGCCGAATGC GCCCTATCTC GACCGCATCG TCTGGCGGGT AATCACCGAC
CGCGCCGCGG CGGCGGCGCA GCTCGAAGCC GGCAGCCTGC ATCTCAGCCC GTTCTCGGGC
CTGACGATTT CCGACATGGC GCGGCTCGGC AAGGACAAGC GCTTCATCGT CTCGACCAAG
GGCAACGAGG GCAACGCCCG CACCAACACG CTGGAGTTCA ACTTCCGCCG CAAGGAGCTG
TCGGACATCC GCGTCCGCCA GGCGATCGCG CACGCGATCA ACGTGCCGTT CTTCATCGAG
AACTTCCTTG GCGACTTCGC CAGGCTCGGC ACCGGGCCGA TCCCCTCGAC CTCGGCCGAT
TTCTATCCCG GCCCGAACAC GCCGCAATAC GCTTACGACA AGAAGAAGGC GATCGCGCTG
CTCGACGAGG CCGGGCTGAA GCCCGCCGGC GGCGGCACCC GCCTCTCGCT GCGGCTGTTG
CCGGCGCCGT GGGGCGAGGA CATCTCGCTG TGGGCGACCT TCATCCAGCA ATCCCTGTCG
GAGATCGGCG TCCAGGTCGA GATCGTGCGC AACGATGGCG GCGGCTTCCT CAAGCAGGTC
TATGACGAAC ACGCCTTCGA CCTCGCCACC GGCTGGCACC AGTATCGCAA CGATCCCGCG
GTCTCGACCA CGGTGTGGTA TCGCTCCGGC CAGCCCAAGG GCGCGCCGTG GACTAATCAG
TGGGGCTGGG AAGACGCCAC CACCGACAAG ATCATCGATA ACGCCGCCAC CGAGGTCGAT
CCCGTCAAGC GCAAGGCGCT GTATGCCGAT TTCGTCACCC GCGCCAACAC CGAACTGCCG
ATCTGGATGC CGATCGAGCA ATTGTTCGTC ACGGTGATCT CCGCCAAGGC GCGCAATCAC
TCCAACAATC CGCGCTGGGC GTCATCGACC TGGCATGATC TTTGGCTGGC CGAATAG
 
Protein sequence
MALSRFEINR RTVLLTSAAI AANVLNPMRA FAQETPRKGG VFNVHYGAEQ RQLNPSLQAS 
TGVYIIGGKI QEPLVDLDAA GNPVGVLAES WESTPDGKTI TFKLRKGVVW HDGKPFTSED
VAFTALNMWK KILNYGSTLQ LFLTAVDTPD PQTAIFRYER PMPLNLLLRA LPDLGYVSPK
HIYETGDIRQ NPVNLAPIGT GPFKFNKYER GQYIIADRND NYWRPNAPYL DRIVWRVITD
RAAAAAQLEA GSLHLSPFSG LTISDMARLG KDKRFIVSTK GNEGNARTNT LEFNFRRKEL
SDIRVRQAIA HAINVPFFIE NFLGDFARLG TGPIPSTSAD FYPGPNTPQY AYDKKKAIAL
LDEAGLKPAG GGTRLSLRLL PAPWGEDISL WATFIQQSLS EIGVQVEIVR NDGGGFLKQV
YDEHAFDLAT GWHQYRNDPA VSTTVWYRSG QPKGAPWTNQ WGWEDATTDK IIDNAATEVD
PVKRKALYAD FVTRANTELP IWMPIEQLFV TVISAKARNH SNNPRWASST WHDLWLAE