Gene RPB_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1487 
Symbol 
ID3908800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1676719 
End bp1677723 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content62% 
IMG OID637883382 
Productextracellular solute-binding protein 
Protein accessionYP_485108 
Protein GI86748612 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family
[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.143655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAC TGATTGCTTC ACTCGCCGTC GCCGGTTTGG CGCTGACCGC GGCGCAGGCC 
GCCGACAAGC CCGCCAAGAT CACGGTCGGC TATCTCAATC TGGTCAACGC CCAGTTGGTG
ACCAAGAACC TCGGCCTGTT GGCCAAAGAG ATGCCGGGTG TCGAGATCAA ATACGTCAAA
TTCGGCGGCG GCGGCGACAT GCTGCGCGGC ATCGCCGGCA ACGACGTCGA TTTCGGCGGG
CTCGGCAATC CGCCGACCGC GATCGGCATC ACCCGCGGGC TGCCGATCAA GGGCATCCTG
GTGCTCAACA TGCTCGGCGA CGTCGAGTCG ATGGTGGTGC GCACCTCGAA GAACATCAAG
TCGCTGAAGG ATCTGAAGGG CAAGACGGTG GCGGCGCCGT TCGGCTCCAC CACGCACTAC
TTGTTGCTGC AGGCGCTGGC CGACGAGGGC GTCGAGCCGT CGTCGATGAA GATCCTCGAT
CTGCCGCCCT CCGACATCGC AACCGCCTGG ATCCGCGGTG ATCTCGACGC CGCCTGGCTG
TGGGAGCCCA ATCTGGACAA GGCGGTGAAG AACGGCGGCC ACATCTACAT GTCGTCCGGG
CTGATGGAGA AGCGCGGCTA CCCGACCTGG GACATCGGCG TGGTGATGAA CGGATTCGCG
GAGAAGTACC CCGACTATGT CGAGAAATTC GTCAAGGCGG AATGCGCCGG CATCGACTTC
TGGATCAAGA ACCCGGACAA GACCGCGGCG ATCATCGCCG AGGAGCTGTC GCTGCCGCCG
GAAGACGCGA TGCGGATGAT GAACGGCACC GCCATGGTGC CTTGCGACAA GCAGCTGACC
GCAACCTATC TCGGCACCAC GGCCAAGAAA GGCCAGTTCG TCGACACGCT GCTGGCCACC
GGCGACTTTC TGGTGAAGCA GGAGCGGCTC CCGAAACTGC TGCCGCGCAA GGATTTCGAA
GCCTTCCTGG TGCCTGGCTA TATCGAAAAA GTAGTCGGCA AGTAA
 
Protein sequence
MKRLIASLAV AGLALTAAQA ADKPAKITVG YLNLVNAQLV TKNLGLLAKE MPGVEIKYVK 
FGGGGDMLRG IAGNDVDFGG LGNPPTAIGI TRGLPIKGIL VLNMLGDVES MVVRTSKNIK
SLKDLKGKTV AAPFGSTTHY LLLQALADEG VEPSSMKILD LPPSDIATAW IRGDLDAAWL
WEPNLDKAVK NGGHIYMSSG LMEKRGYPTW DIGVVMNGFA EKYPDYVEKF VKAECAGIDF
WIKNPDKTAA IIAEELSLPP EDAMRMMNGT AMVPCDKQLT ATYLGTTAKK GQFVDTLLAT
GDFLVKQERL PKLLPRKDFE AFLVPGYIEK VVGK