Gene RPB_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1045 
Symbol 
ID3908897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1201886 
End bp1202869 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content63% 
IMG OID637882938 
Productthiosulphate-binding protein 
Protein accessionYP_484666 
Protein GI86748170 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGCC GAATTGTTCC GCTGCTCGCC GGCCTGATGG TCGCGACCTC CGCGCAGGCC 
GCCGACGTGT CGCTGCTCAA CGTCTCGTAC GACCCGACCC GCGAACTCTA CAGCGAGTTC
AACAAATCCT TCGCCGCCGC GTATCAGAAG GAAACCGGCG ACACCGTCAC GATCAAGCAG
TCGCACGGCG GCTCCGGCTC GCAGGCGCGC TCGGTGATCG ACGGTCTGCA GGCCGACGTC
GTGACGCTGG CGCTGGCCTA CGACATCGAC GCGATCGCCA ACAAGGGGCT GCTGACCAAG
GACTGGCAGA AGCGTCTGCC GCAGAATGCG TCGCCCTACA CCTCGACCAT CGTGTTCCTG
GTGCGCAAGG GCAATCCGAA GGGCATCAAG GACTGGCACG ATCTGATCAG GCCCGGAATC
AGCGTGATCA CGCCGAACCC GAAGACCTCC GGCGGCGCGC GCTGGAATTA TCTGGCGGCC
TGGGGCTACG CGCTGAAGAC GGAGGGATCG GAGGACAAGG CGCGCGACTT CGTCGGGAAC
ATTTACAAGA ATGTGCCGGT GCTGGACACC GGCGCCCGCG GCGCGACCAT GACCTTCGTC
CAGCGTGGCG TCGGCGACGT GCTGCTGGCG TGGGAGAACG AGGCATTCCT GGCGGTCAAG
GAATTCGGCA AGGACAGATT CGAGATCGTG GTGCCGTCGA TCTCGATTCG CGCCGAGCCG
CCGGTGGCGC TGGTCGACAG CGTGGTCGAC AAGAAAGGTA CCCGGGCAGT GGCCGAAGCC
TATCTGCAGT ATTGGTACAC CAAGGAAGGT CAGGAAATCG CCGCACGGAA CTTCTATCGT
CCGCGCGATT CGGAGATTGC CAACAAGCAC GCCTTCGCGA AGGTCGAGTT GTTCACCATC
GACGAATTGT TCGGCGGCTG GACCAAGGCG CAGACGACGC ACTTCACCGA CGGTGGGGTG
TTCGACAAGA TCTACAAGAA CTGA
 
Protein sequence
MFRRIVPLLA GLMVATSAQA ADVSLLNVSY DPTRELYSEF NKSFAAAYQK ETGDTVTIKQ 
SHGGSGSQAR SVIDGLQADV VTLALAYDID AIANKGLLTK DWQKRLPQNA SPYTSTIVFL
VRKGNPKGIK DWHDLIRPGI SVITPNPKTS GGARWNYLAA WGYALKTEGS EDKARDFVGN
IYKNVPVLDT GARGATMTFV QRGVGDVLLA WENEAFLAVK EFGKDRFEIV VPSISIRAEP
PVALVDSVVD KKGTRAVAEA YLQYWYTKEG QEIAARNFYR PRDSEIANKH AFAKVELFTI
DELFGGWTKA QTTHFTDGGV FDKIYKN