Gene RPC_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2105 
Symbol 
ID3973661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2306268 
End bp2307383 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID637925213 
Productextracellular solute-binding protein 
Protein accessionYP_531978 
Protein GI90423608 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.681994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGTT CGACCGGCTT GCAAAAATTG CGCTTTGTCG CAGCCATGAT CGGTTGCGCG 
GTCCTACTGC TGTCGCCCGC CCGCGCCGAA CAGCGCGTCG TCAACTTCTA CAACTGGTCG
AACTACATCG CCCCCGGCGT GCTCGACGAG TTCAGCCGCG AGACCGGCAT CAAGGTGATC
TACGACACCT TCGACGGCAA CGAGACGCTG GAAACGCGGC TGTTGGCGGG AAAATCCGGC
TACGACGTGG TGGTTCCGAC CGCGTATTTC CTGCAGCGGC AGATCGCCGC CAATATTTTC
CAGAAGCTCG ACAAGGCGAA GCTGCCGAAC CTCGGCAACG CCTGGGACGT CGTGACCAAG
CGGCTGGCGA CCTACGATCC CGGCAACCGC TTTGCCGCGA ACTACATGTG GGGCACCACC
GGGATCGGCT ACAACGTCGC CGCAGTGCGC AAGATCCTCG GCGAGGGCGC TGTGATCGAC
AGCTGGGCCA CGGTGTTCAA GCCGGAGAAT CTGGCGAAAT TCACCGAGTG CGGCGTGCAC
ATGCTGGACT CGCCCGATGA TATTTTGCCG GCTGCGCTGA CCTATCTCGG CCTCGATCCG
AACTCCACCA AGCCGGCCGA TCTGGAAAAA GCCGCCGATC TGGTCGGCAA GATCCGACCC
TATGTCCGCA AGTTTCATTC CTCGGAATAT CTCAACGCGC TAGCGACCGG CGAAATCTGC
CTGGTGGTCG CCTGGTCGGG CGACATCATG CAGGCGCGCA GCCGCACCGC CGAGGCCAAC
AATGGCGTCG AGATCGGCTA TTCGATTCCG AAGGAAGGCG CGCAGATGTT CTTCGACAAT
CTGGCGATCC CGGCCGACGC CAAGAACGTC GCCGAGGCGC ACGAACTGAT CAACTATCTG
TACCGCCCCG ACGTCGCGGC GAAGAATTCC GGCTTCCTGT CCTACGCCAA CGGCAATCTG
GCCAGCCAGA AGCTGATCGA TCCGAAGGTG ATCGGCAACA AGATGGTGTT TCCGGATTCG
GCGACCGAGA AGCGGCTGTT CGTCATCACC GCGCGCGACG CCGCCACCCA GCGGGTGATC
AACCGGCTGT GGACCAAGGT GAAGACCGGG ATGTAG
 
Protein sequence
MRCSTGLQKL RFVAAMIGCA VLLLSPARAE QRVVNFYNWS NYIAPGVLDE FSRETGIKVI 
YDTFDGNETL ETRLLAGKSG YDVVVPTAYF LQRQIAANIF QKLDKAKLPN LGNAWDVVTK
RLATYDPGNR FAANYMWGTT GIGYNVAAVR KILGEGAVID SWATVFKPEN LAKFTECGVH
MLDSPDDILP AALTYLGLDP NSTKPADLEK AADLVGKIRP YVRKFHSSEY LNALATGEIC
LVVAWSGDIM QARSRTAEAN NGVEIGYSIP KEGAQMFFDN LAIPADAKNV AEAHELINYL
YRPDVAAKNS GFLSYANGNL ASQKLIDPKV IGNKMVFPDS ATEKRLFVIT ARDAATQRVI
NRLWTKVKTG M