Gene RPB_2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2478 
Symbol 
ID3910267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2834360 
End bp2836861 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content62% 
IMG OID637884377 
Productorganic solvent tolerance protein 
Protein accessionYP_486094 
Protein GI86749598 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG CCCTCCGAGA CGAGCTGTTC GCCTTGCCGC GCCGCACCAT CGTGCGCAGC 
AAGCGCTCCC TCATGGCCAG TTGTGCGATT CCGTTGGCTG CCCTGCTGCT GGCAGGCGCG
GCCGACCTCG CGTCGATCTC GTCGGCGGAA GCGCAGAGCT ACACCTATAA TCCGCGCCCG
GCCCGGCCGC GTCCGCCCCA GACGGGGACG GACGGGCAGA TGCTCGTCCA GGCGACAGAG
GTCAACTACG ACTACAACAA CCAGCGGGTG TCCGCCGTCG GCAACGTCCA GATGTTCTTC
AACGGAACCA GTGTCGAAGC CGACCGCCTG GTCTACGACC AGAACACCAA GCGTCTGCGC
GCCGAAGGCA ACGTCCGAAT GACGGACGCG ACGGGCAAGA TCACCTACGC CAATATGCTC
GATCTGAGCG ACGACTACCG CGACGGTTTC GTCGATTCGC TGCGCGTCGA CACCGCCGAA
GACACCCGTA TCGCAGCGTC GCGCGCCGAT CGCACCGACG GCGACTACAA CGTGTTTCAG
AACGGCGTGT ACACCGCCTG CGCGCCGTGC CGGGACGATC CGAAGAAGCC GCCGCTGTGG
CAGGTCAAGG GCGCCCGGAT CATCCACGAT CAAGTCGACA AAATGCTGTA TTTCGAGAAC
GCCCAGCTCG AATTCTTCGG CGTTCCGATG GCCTATCTGC CGTATTTCTC GACGCCCGAT
CCGACGGTGA AGCGTAAGAC CGGCTTCCTG ATGCCGTTCT ACACCACCAA CACGACCTTC
GGGATGGGCT TCGAAATCCC GTTCTATTGG GCGCTCGCAC CGGATTACGA CGTCACGCTG
ACGCCCCGGA TCACGACCAA GCAGGGCGTG CTGATGCAAG GCGAGTTCCG GCAGCGATTG
ATGGACGGCG CCTATCAGAT CCGCGCCTAT GGCATCAGCC AGTCCGACCC CGCGGCGTTC
GGAAGCGCGC CGGGCAATCG CTCGTTGCGT GGCGGCGTCG ACACCAAAGG CGAATTCGCG
CTGAACGACA AATGGGTGTT CGGCTGGGAA GGCGTGTTGC TGTCCGACCG TGCGTTCTTC
CTCGACTACA GGCTCGCGCA GTATCGCGAC AGTTGGGGCA GCTTCCTCAA CCAGAGCACC
GAAGCGACCT CGCAGATCTA TCTGAGCGGC GTCGGCAACC GCTCCTATTT CGATTTGCGG
GCGATTCACT ATCTCGGCTG GGCATCGGCT GACATTCAGG GCCAGATTCC GGTCATCCAC
CCGGTGCTGG ACTATTCGAA GACTCTGGAT CGCAACATCT TCGGCGGCGA AGTCAGCTTC
AAGACCAACT TCACCAGCCT CTCCCGCCAG ACGGCGCAGT TCGACCCGAT CACGACGATC
GCGAATACGA CGAGCCTTTG TCTGAACACA TCGGCCGATC CGGCTGCGCG CATGCCGTCG
AGCTGCCTGT TGCGCGGCAT TCCGGGCACC TACACGCGTG CAACGGCCGA GGCGCAATGG
CGCCGGTCGT TCACCGACCC CTACGGCCAG ATCTGGACCC CATTCGCATC GATCCGCATG
GACGCGATCG ATTCCTCGGT CTCGAATCAG CCCGGCGTGT CGAACTATCT CCCTGTCGGA
GACACCCAGG CGTTCCGGGT GATGCCGACC ATCGGTCTCG AATATCGCTA CCCCTTCATC
AACGTGCAGC CCTGGGGCAC CACGACGATC GAACCGATCG CGCAGGTCAT CGTTCGTCCG
AACGAAACCT ACGCGGGCAA ACTGCCGAAC GAGGACGCGC AGAGCATGGT GTTCGACACC
AGCAATCTGT TCAGCGTCGA CAAATTCTCC GGCTACGATC GCGTCGAAGG CGGCGGCCGC
GCCAATGTCG GCGTCCAGGC GACCACGCAA TTCGATCGCG GTGGCGCGAT CAACGTGCTG
TTCGGCCAGT CCTATCAGTT GTTTGGGCAG AACTCCTACG CGGTGAGGGA CACCACCAAT
ACCGGTCTCG ATTCGGGGCT CGCGACAGCC CGTTCCGACT ATGTCGGCCG GGTCTCATAT
TCGCCGAATT CGACCTACAA GTTCACCACA CGTGCGCGAC TCGACGAGGC GACGCTCGAC
GTCAACCGCT TCGAGGCCGA AGCCAGCGCG TCGTTCAATC GCTGGTCCGT CAGCGTGATC
TACGGGAACT ACGCGGCGCA ACCGGACCTC GGATTTCTGA CGCGACGTCA GGGCATCCTC
ACCACCGGTT CCATCAAGGT GGCGTCGAAT TGGGTCGTCT CGGGTGGCGC CCGCTGGGAC
CTTGAGGCAA ACCGCATCAA TCAATATATT GTTGGTGCCG GCTACGTGGA CGACTGTTTC
GTGATGGCCG TGAACTATGT CACCGGCTAT TCGTACGCCA ACTACGGAAC GACGCCGACG
CTGAATCATT CGGTCATGCT GCAGATCGGA CTTCGGACCA TCGGCATGGG CGCGATGCAA
CAGAGCGTTT CGGGCGCGTC GAGCGGCGTT TTCGGCCAAT AG
 
Protein sequence
MIAALRDELF ALPRRTIVRS KRSLMASCAI PLAALLLAGA ADLASISSAE AQSYTYNPRP 
ARPRPPQTGT DGQMLVQATE VNYDYNNQRV SAVGNVQMFF NGTSVEADRL VYDQNTKRLR
AEGNVRMTDA TGKITYANML DLSDDYRDGF VDSLRVDTAE DTRIAASRAD RTDGDYNVFQ
NGVYTACAPC RDDPKKPPLW QVKGARIIHD QVDKMLYFEN AQLEFFGVPM AYLPYFSTPD
PTVKRKTGFL MPFYTTNTTF GMGFEIPFYW ALAPDYDVTL TPRITTKQGV LMQGEFRQRL
MDGAYQIRAY GISQSDPAAF GSAPGNRSLR GGVDTKGEFA LNDKWVFGWE GVLLSDRAFF
LDYRLAQYRD SWGSFLNQST EATSQIYLSG VGNRSYFDLR AIHYLGWASA DIQGQIPVIH
PVLDYSKTLD RNIFGGEVSF KTNFTSLSRQ TAQFDPITTI ANTTSLCLNT SADPAARMPS
SCLLRGIPGT YTRATAEAQW RRSFTDPYGQ IWTPFASIRM DAIDSSVSNQ PGVSNYLPVG
DTQAFRVMPT IGLEYRYPFI NVQPWGTTTI EPIAQVIVRP NETYAGKLPN EDAQSMVFDT
SNLFSVDKFS GYDRVEGGGR ANVGVQATTQ FDRGGAINVL FGQSYQLFGQ NSYAVRDTTN
TGLDSGLATA RSDYVGRVSY SPNSTYKFTT RARLDEATLD VNRFEAEASA SFNRWSVSVI
YGNYAAQPDL GFLTRRQGIL TTGSIKVASN WVVSGGARWD LEANRINQYI VGAGYVDDCF
VMAVNYVTGY SYANYGTTPT LNHSVMLQIG LRTIGMGAMQ QSVSGASSGV FGQ