Gene RPC_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4010 
Symbol 
ID3969200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4458316 
End bp4459305 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID637927114 
Productthiosulphate-binding protein 
Protein accessionYP_533855 
Protein GI90425485 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.311837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGTC GTTTTCTTAC CGTGATCGCG GGATTGATCT GGGCAGGCTC CGCGATGGCC 
GCTGACGTCA CGCTGCTCAA CGTGTCGTAC GATCCGACCC GCGAACTTTA TTCCGACTTC
AACAAGGCGT TCGCCGCCGC CTATCAGAAG GACGCCGGCA AGAGCGTCGA GATCAAGCAG
TCGCATGGCG GCTCCGGCGC GCAGGCGCGT TCGGTGATCG ACGGGCTGCA GGCCGACGTC
GTCACCTTGG CGCTCGCCTA CGACATCGAC GCCATCGCCA ACAAGGGGCT GATCGCCAAG
GATTGGCAGG CCAAGCTGCC GCAGAACTCA TCGCCCTACA CCTCGACCAT CGTGTTCCTG
GTGCGCAAGG GCAACCCGAA GGGCATCAAG GACTGGGATG ATCTGACCAA ATCCGGCGTC
AGCGTGATCA CGCCGAACCC GAAGACCTCG GGCGGCGCGC GCTGGAACTA TCTCGCCGCC
TGGGGCTACG CGCTGAAGAA GTCGGGTTCC GAACAGGGGG CGCGGGAGTT CGTCGCCAAC
ATCTATAAGA ACGTGCCGGT GCTGGATACC GGGGCGCGCG GCTCCACCGT CAGTTTCGTC
GAGCGCGGCG TCGGCGACGT GTTGCTGGCC TGGGAAAACG AGGCGTTTCT CGCGGTGAAG
GAATTCGGCA AGGACAGGTT CGAGATCGTG GCGCCATCGG TGTCGATCCT GGCGGAGCCG
CCGATCGCGG TGGTCGATGG CGTTGCCGAC AAGAAAGGCA CCCGCTCTGC CGCGGAAGCT
TATCTGAAAT ACTGGTACAC TCCAGAGGGC CAGGAAATCG CCGCGCGCAA CTTCTACCGC
CCTCGCGATG CCGGAATTGC CAAGAAGTAT GCCGACTCGT TCGCCAAGGT TGAGCTGTTC
ACCATCGACG ACGTGTTCGG CGGCTGGACC AAGGCGCAAA AGGAACACTT CAGCGACGGC
GGTGTGTTCG ATAAGATATA TAAGGACTAA
 
Protein sequence
MVRRFLTVIA GLIWAGSAMA ADVTLLNVSY DPTRELYSDF NKAFAAAYQK DAGKSVEIKQ 
SHGGSGAQAR SVIDGLQADV VTLALAYDID AIANKGLIAK DWQAKLPQNS SPYTSTIVFL
VRKGNPKGIK DWDDLTKSGV SVITPNPKTS GGARWNYLAA WGYALKKSGS EQGAREFVAN
IYKNVPVLDT GARGSTVSFV ERGVGDVLLA WENEAFLAVK EFGKDRFEIV APSVSILAEP
PIAVVDGVAD KKGTRSAAEA YLKYWYTPEG QEIAARNFYR PRDAGIAKKY ADSFAKVELF
TIDDVFGGWT KAQKEHFSDG GVFDKIYKD