Gene RPB_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3333 
Symbol 
ID3911135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3813540 
End bp3814565 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID637885236 
Productextracellular solute-binding protein 
Protein accessionYP_486940 
Protein GI86750444 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.686225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.192968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGA TATTCGCCAA GATCGCCGCG GTGCTGTCGG CGCTGCTGCT GACCACCACC 
TTCGCGACCG CGCAGAGCAA GGTGACGATC GCGATCGGCG GCGGGGCGTG TCTGTGCTAC
CTGCCGACGG TGCTGGCCAA GCAACTCGGC GAATACGACA AGGCGGGGCT CAGCGTCGAA
CTGGTCGATC TCAAGGGCGG TTCCGATGCG CTGAAGGCGG TGCTCGGCGG CAGTGCCGAC
GTCGTCTCCG GCTATTTCGA CCACACCGTC AATCTCGCCG CCAAGAAGCA GGAGATGCAG
TCCTTCGTGG TCTACGACCG CTATCCCGGG CTGGTCCTGG TGGTGTCGCC GGGGCATACC
GCGAAGATCG CATCGGTCAA GGACCTCGCC GGCAAGAAGG TCGGCGTCAG CGCGCCGGGC
TCGTCGACCG ATTTCTTCCT GAAATATCTC CTGAAGAAGA ACGGCGTCGA TCCGAACGAC
GTGGCGGTGA TCGGCGTCGG CCTCGGCGCC ACCGCGGTGG CGGCGATGCA GCAGGGCCAG
ATCGAGGCCG CGGTGATGCT CGATCCGGCG GTGACGATCC TGCAGGCGGC GCACGCCGAT
CTGCGCATCC TCAGCGACAC GCGCACCGAA CACGACACCC GCGAAGTGTT CGGCGGCGAC
TATCCGGGCG GCGCGCTGTA TTCGACGGTG GCCTGGATCA AGGCGCATCC GAAGGAGGCG
CAGGGCCTGA CCAACGCCAT CCTGAACACG CTGAGCTGGA TCCACGCGCA TTCGGCCGAG
GAGATCGCCG ACAAGATGCC GCCGAACATC GTCGGCAAGG ACAAGGCGCA ATATGTCGCC
GCGCTGAAGA ATACGATCCC GATGTATTCG ACCACCGGGC TGATGGACCC GAAGGGCGCG
GAGGCGGTTC TGGCGGTGTT CAGCACCAGC TCGCCGGATG TGGCGAAAGC CAATATCGAC
GTCACCAGGA CCTACACCAA CGCCTTCGTC GAGCAGGCGG CGAAGACGTC GGGCGCGGCG
AAGTAG
 
Protein sequence
MKTIFAKIAA VLSALLLTTT FATAQSKVTI AIGGGACLCY LPTVLAKQLG EYDKAGLSVE 
LVDLKGGSDA LKAVLGGSAD VVSGYFDHTV NLAAKKQEMQ SFVVYDRYPG LVLVVSPGHT
AKIASVKDLA GKKVGVSAPG SSTDFFLKYL LKKNGVDPND VAVIGVGLGA TAVAAMQQGQ
IEAAVMLDPA VTILQAAHAD LRILSDTRTE HDTREVFGGD YPGGALYSTV AWIKAHPKEA
QGLTNAILNT LSWIHAHSAE EIADKMPPNI VGKDKAQYVA ALKNTIPMYS TTGLMDPKGA
EAVLAVFSTS SPDVAKANID VTRTYTNAFV EQAAKTSGAA K