Gene RPC_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3729 
Symbol 
ID3971474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4149958 
End bp4151775 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content64% 
IMG OID637926839 
Productextracellular solute-binding protein 
Protein accessionYP_533583 
Protein GI90425213 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGCGA TCGGCGTTAC GTCGACGCAG GCGGCGCCGG CCGAACCGGT CTGGCGTCAC 
GGCCTGTCGC TGTTCGGCGA CGTCAAATAT CCCGCCGACT TCAAGCGATT CGACTACGTC
GATGCCAACG CCCCGAAAGG CGGCGCGGCG CGGCAGATTT CGATCGGCAC CTTCGATAAT
TTCAATCTGG CGGTGGCCGG CGTGAAGGGC TCGATCGCGC CCGCGGTCGG GCTGATCTAC
GAAACCCTGA TGACGCAATC GCAGGACGAG GTCGGCGCCG AATACGGGCT GCTCGCGGAA
GCGGCGGCGC ATCCCGACGA TCACTCCTCG GTGACCTATC GGCTGCGCGC CAATGCGCGC
TGGCACGACG GCAAACCGGT GACCCCGGAG GATGTGATCT TCTCGCTGGA GGCGCTGAAG
AAATACAGCC CGCGTTATGC CTCGTATTAT CGCCACGTCG TGAAGGCCGA GAAGACCGGC
GACCGCGAGA TCAAATTCAG CTTCGACATG CCGGGCAACC GCGAATTGCC GACCATCGTC
GGCGAACTCG TCGTGTTGCC GAAGCATTGG TGGGAGGGCA GCGACGAGCA GGGCCGGCCG
CGCGACATTT CCGCGACCAC GCTGGAAAAG CCGCTCGGGT CGGGTCCGTA TCGCATCAAG
GATTTCGTCG CCGGCCGTTC GGTGACGCTG GAACGGGTCA AGGACTATTG GGGCGCCGCG
GTGCCGGCGC GGGTCGGCCA GAACAATCTC GACGAACTGC GCTACGAATT CTTCCGCGAC
AATCTGGTGG CGCTGGAAGC CTTCAAGGCC GACCAGGCCG ACTGGATCTT CGAGAATTCC
GCCAAGCAAT GGGCCACCGC CTATGACTTC CCGGCGGTGA CCGAGAAGCG CGTCGTCAAG
GAAGAATTCC CGATCAACGA TTCCGGGCGG ATGCAGGCCT TCATCTTCAA TCTGCGCCGC
GAGATGTTCC AGGATGCGAG GCTGCGCCGC GCCTTCAACT ACGCGTTCGA TTTCGAGGAG
ATGAACAAGC AACTGTTCTA CGGACAATAC AACCGGATCA ACAGCTACTT CGAAGGTACC
GAACTGGCCT CCAGCGGGCT GCCGCAGGGT GCCGAACTGG CGCTGCTGGA GCCGTTGCGC
GACAAGTTGC CCGCCGAGCT GTTCACCACG CCCTACGCCA ACCCGGTCGG CGGCAATTCG
GACGCGGTGC GCGGCAATCT GCGCGAGGCG ATGCGGCTGT TGAAGGAGGC GGGATTCGAA
GTGCGCGACC GCCGGCTGGT CGATGCCGCC GGCAAGCCGG TGCTGGTGGA GATCCTGGTG
CGGGATCCCT CCTCGGAGCG GATCGCGCTG TTCTACAAGC CGTCGCTGGA ACGGATCGGC
GTCACGGTGT CGATCCGCAC CGTCGACGAC GCGCAGTACG AGAACCGGGT TCGCGCGTAC
GATTTCGACA TGATCACCGA TCTGTGGGGC CAGTCGCTGT CGCCCGGCAA CGAGCAGCGC
GACTATTGGG GCTCGCAGGC CGCCGATCAG CCGGGCTCGC GCAACACCAT CGGCATCAAG
AATCCCGCGG TCGATGCGCT GATCGAGAAA GTGATCTTCG CCAAGGACCG CGCCTCGCTG
GTCGCCGCCA CCCGCGCGCT CGATCGCGTG TTGCTATGGA ATTTCTATCT GGTGCCGCAG
TTCACCTACG GCTATGCGCG CTACGCGCGC TGGGATCGCT TCAGCCACGC CGAGCTGCCG
AAATACGCCC GCGCCGGGTT GCCGTCGCTG TGGTGGTACG ACGCCGACAA GGCCGCCCGG
ATCGGCAAAC GCTCTTGA
 
Protein sequence
MSAIGVTSTQ AAPAEPVWRH GLSLFGDVKY PADFKRFDYV DANAPKGGAA RQISIGTFDN 
FNLAVAGVKG SIAPAVGLIY ETLMTQSQDE VGAEYGLLAE AAAHPDDHSS VTYRLRANAR
WHDGKPVTPE DVIFSLEALK KYSPRYASYY RHVVKAEKTG DREIKFSFDM PGNRELPTIV
GELVVLPKHW WEGSDEQGRP RDISATTLEK PLGSGPYRIK DFVAGRSVTL ERVKDYWGAA
VPARVGQNNL DELRYEFFRD NLVALEAFKA DQADWIFENS AKQWATAYDF PAVTEKRVVK
EEFPINDSGR MQAFIFNLRR EMFQDARLRR AFNYAFDFEE MNKQLFYGQY NRINSYFEGT
ELASSGLPQG AELALLEPLR DKLPAELFTT PYANPVGGNS DAVRGNLREA MRLLKEAGFE
VRDRRLVDAA GKPVLVEILV RDPSSERIAL FYKPSLERIG VTVSIRTVDD AQYENRVRAY
DFDMITDLWG QSLSPGNEQR DYWGSQAADQ PGSRNTIGIK NPAVDALIEK VIFAKDRASL
VAATRALDRV LLWNFYLVPQ FTYGYARYAR WDRFSHAELP KYARAGLPSL WWYDADKAAR
IGKRS