Gene RPB_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2541 
Symbol 
ID3910330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2908938 
End bp2911190 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content63% 
IMG OID637884439 
Producthypothetical protein 
Protein accessionYP_486156 
Protein GI86749660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCC GGAAATGGCG CGGGGGGCAG ACCGACCCTG CCGCTAGAGC TTTGCAGCCC 
ACCATGACGG TCGAGATCAG GCCAGAAATC CGGATCGAAG GCTTGCGGCC TGGTTCTTCA
AGTCTGCGTC CGGACGCGCG CTGGATCGGC GCAGGGGATA CGACTGAGGT CCGTGGCGCT
AAGATCCGGG CTGGTCTGTT CTACCTCGGT AGTGCTGTGG CGCTGAGGGA TGGTCGCTCG
ACGGACCAAT ACGTCGTCAA TCCCAAGCTG CCGGTGGCGG CGCCGCCAGA CGTGAGCGGT
TCCTCGATGC CGTATTGGCC ATCCTACGCG GACATCCCGC CGCGGGCACG GCGAGCGTAC
ATCGACTGGC TGGCTGGCGG ACGACGCGAC CCTACCTACG GCGTAGGTTA CGTTTTCCTG
TTCTTCTATG GTCTTGAGCA TCGACTGTTC ATCGATCGGG ACAGGGCATC GACGCCCGCG
GTGGTCCAGG AGGTGGAAGA GCTCCTGTCG GTCTACGGCC ATAGCGGATC GCTGCGCCGT
TATGCGGCCG AATTTTTGAA CTGCGCCCGC ATCGCCGCGG GCATTCCCCT GGCGCTTCCC
ATACCGAAGC CGGAGCATGA AAATTCCTCG GAGATGAACG CTGCGGTTCG GCTACACCTC
GGGAAAGTGC TGTCCCAGTC GAAGGCGGTC GGCTCAGAGG ATGCATTGCT GTGGGTGCTG
GCACTGCCTG ATGTGTATCT GAGGACGCCG GCTGTCCGAT GCTTCACCGA GTTCGTCCGA
CTTTGGAAAG TGTGGTTCGC GGCGAGGTAC CCGAACGGCA TCTCCGCCAA GCCGGCGGCC
AACATTGTTA TCCACTACAA GGCGGCGAGC GGCGCGTTCG AAGTACCGAT TTCCGGACCT
CATGAGCAAC TTCCGGACCC GATCCAGTCT CCTCGATCCG GAGCCGAACT CAAAAAGTTG
GTCGAGGAAG TCACCGCCGA ACTCGAGCCG TTCAGCCGCT ACGTCGGCCG CAAGCCCGAC
GAGCGCAACT CGATGCGCGC GGCGTTGCTG CTACCGAAGG AGCTTCAGAC CGATCCGCAG
GATGGCCTGG TCGCGAAGTT CGGCCACCGG ATGGCGGAGA TCATGGGCGA GCACAAGCGC
GCCAGCACGA AGATGAGCAA GATGTTGGCC ATTGCCGGAC TTGAGCTTCC TGTCGGCAAG
ATCACGCCTG GGATGGCGGA CCAGTTGGGT GGGCTGCTGG ACCAGGTCGA CATCGCGATC
GAGCCGGATC GTCGTTACGG GAGCGGCGTA CCGCAGCTCG ACGACCAGGT GATCGTCTTC
AAGGCGCCGA AAGGCGGCCC CGTCGATCAC CAAAGGCCGG CTTATCGCGC TATGAAGGCC
CAGATCGAGG TTGCGGTTCT GGCAGCGGCG GTCGACGGCG AGTCATCGCA CGACGAACTG
CAGCGGATCG TCGACGGCAT TCGGGCTGAG AAAGACCTTG GCGGGATCGA ACAGGCTCGG
CTGATCGGAT ACGCCGTCAC CGTCTTCAAC AGCCCGCCCA AGCAGGCGCG GGTGATGCGG
AAGTTGGCGG AGCACAGTCC CGCCGAGCGC GAGGCGATCG CGAGAGCGGC CCTGACGGTG
GTCGGAGGCA ACGAACACGT CGGGCCAGAA GAAGTGAAAT TCCTCGAGCG GCTGCACAAG
GCGTTGGGGT TGCCGAAGGA GCGGGTCTAC AGCGAACTTC ATCGGGCTGC GGCCACAGCG
TCACCTTCGG ACGAGCCGGT GGCGCTGACC GAAGAGAAAC GGGTTGCCGG CATTCCCATT
CCGCCCCAGG CGGAAGTGTC TTCGATCCCC GAGCCTCCGG CGCCGATCGC CGCCGAAACG
TCTGGGGGTA TCCAGATCGA CGCGGAGAAG TTGGCCAGGA CGCAGCGAGA AACGCAGGCG
GTGGCCAAAC TGCTCGCCGA CATCTTCACC GACGAAGCCC CAGAGGCAGA GCCTGTGTCA
CAGGCCTCTG CGACTGGGCG GTCGGCTTTC GAAGGGCTGG ATACCGCCCA TGCCGAGCTC
GTCGAGATGC TAGAGATCAA GGGGACGGTG CCGCGCGCCG AGTTCGACCA GCGCGCGAAA
GAGATGAAGC TTCTGCCCGA GGGGGCCATC GAGCGCATCA ACGAGTGGTC CTTCGACCTG
TTCGATGAAG CGCTGATCGA GGTTGGCCAT GACGTTACGA TAGCGCCGGA CCTGCGCGAG
CGGTTGGCCG AATTGAGAGA GAATGCGGAA TGA
 
Protein sequence
MAVRKWRGGQ TDPAARALQP TMTVEIRPEI RIEGLRPGSS SLRPDARWIG AGDTTEVRGA 
KIRAGLFYLG SAVALRDGRS TDQYVVNPKL PVAAPPDVSG SSMPYWPSYA DIPPRARRAY
IDWLAGGRRD PTYGVGYVFL FFYGLEHRLF IDRDRASTPA VVQEVEELLS VYGHSGSLRR
YAAEFLNCAR IAAGIPLALP IPKPEHENSS EMNAAVRLHL GKVLSQSKAV GSEDALLWVL
ALPDVYLRTP AVRCFTEFVR LWKVWFAARY PNGISAKPAA NIVIHYKAAS GAFEVPISGP
HEQLPDPIQS PRSGAELKKL VEEVTAELEP FSRYVGRKPD ERNSMRAALL LPKELQTDPQ
DGLVAKFGHR MAEIMGEHKR ASTKMSKMLA IAGLELPVGK ITPGMADQLG GLLDQVDIAI
EPDRRYGSGV PQLDDQVIVF KAPKGGPVDH QRPAYRAMKA QIEVAVLAAA VDGESSHDEL
QRIVDGIRAE KDLGGIEQAR LIGYAVTVFN SPPKQARVMR KLAEHSPAER EAIARAALTV
VGGNEHVGPE EVKFLERLHK ALGLPKERVY SELHRAAATA SPSDEPVALT EEKRVAGIPI
PPQAEVSSIP EPPAPIAAET SGGIQIDAEK LARTQRETQA VAKLLADIFT DEAPEAEPVS
QASATGRSAF EGLDTAHAEL VEMLEIKGTV PRAEFDQRAK EMKLLPEGAI ERINEWSFDL
FDEALIEVGH DVTIAPDLRE RLAELRENAE