Gene RPB_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4046 
Symbol 
ID3911853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4615940 
End bp4616947 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID637885950 
Productputative periplasmic solute-binding protein 
Protein accessionYP_487650 
Protein GI86751154 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0699034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTG CTCGTTTCGC TGCGGCGCTG ATCGCGCTGT CGCTGTTCGC CGCCGCGCCG 
GCTGCAAAGG CCGCGGATGT GATCTGCTAC AATTGTCCGC CGCAATGGGC CGACTGGGCG
TCGATGCTCA AGGCGATCAA GACGGACCTC GGCTACGACA TCCCGTTCGA CAACAAGAAC
TCCGGCCAGG CGCTGTCGCA ATTGCTGGCC GAGAAGAGTA ATCCGGTTGC GGATATCGGT
TATTTCGGCG TCAATTTCGG CATGAAGGCC AAGGCGCAGG GCGTCACCCA GCCCTACAAG
CCGCAGCACT GGAACGAGGT GCCGGCCGGT CTCAAGGACG CCGATGGCGA ATGGACCGCG
ATCCATTCCG GGACGCTCGG CCTGTTCGTC AATGTCGACG CGCTCGGCGG CAAGCCGGTG
CCTGCGTGCT GGAAGGATCT GCTGAAGCCG GACTACAAGG GCATGGTCGG CTACCTCGAT
CCGCCTTCGG CAGCGGTCGG TTATGTCGGC GCGGTCGCGG TCAATCTCGC GCTCGGCGGC
AGCGACGCCG ACTTCTCGCC GGCGATCGGA TTCTTCAAGG CGCTGCACGG CAACGACGCC
ATCGTGCCGA AGCAGACGTC CTACGCACGC GTCGTGTCGG GCGAGATCCC GATCCTGTTC
GACTATGATT TCAACGCCTA CCGGGCCAAG TACACCGAGA AGGGCAAATT CGCCTTCGTC
ATCCCGTGCG AGGGGTCGGT GGTGTTTCCC TATGTGGTCA GCCTGACCAA GGGCGCGCCG
AACGCCGAGA AGGCGAAGAA GGTGATCGAC TATCTGTTGT CCGACAAGGG CCAGGCGATC
TGGACCAACG CCTATCTGCG GCCGGCGCGA CCGATCGAAC TGCCCGAGGC GGTGAAGTCG
AAATTCCTGC CGGACGCCGA CTACGCCCGC GCCAAGAGTG TCGACTGGGC CAAGATGGAA
GCGGGCCAGA AGGCGTTCAC TGATCGCTAT CTTGCTGAGG TTCGCTGA
 
Protein sequence
MTIARFAAAL IALSLFAAAP AAKAADVICY NCPPQWADWA SMLKAIKTDL GYDIPFDNKN 
SGQALSQLLA EKSNPVADIG YFGVNFGMKA KAQGVTQPYK PQHWNEVPAG LKDADGEWTA
IHSGTLGLFV NVDALGGKPV PACWKDLLKP DYKGMVGYLD PPSAAVGYVG AVAVNLALGG
SDADFSPAIG FFKALHGNDA IVPKQTSYAR VVSGEIPILF DYDFNAYRAK YTEKGKFAFV
IPCEGSVVFP YVVSLTKGAP NAEKAKKVID YLLSDKGQAI WTNAYLRPAR PIELPEAVKS
KFLPDADYAR AKSVDWAKME AGQKAFTDRY LAEVR