Gene RPB_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2749 
Symbol 
ID3910542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3132596 
End bp3134113 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content67% 
IMG OID637884649 
Productproline-rich region 
Protein accessionYP_486362 
Protein GI86749866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.174945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.312279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC GATATCAGGA CCGACCCTAC CTGGCGGAAG GCGAGGCCCA CGCAGCTTAC 
GCCAAGCGCA ATCCCGAGAA TGACCCGCTC GCTGAACTCG CACGATTGAT CGGTCAGACC
GATCCGTTCG GGCAGGAAGC ACCGCCATCG GGCCGGCCGT CGTCGCGTTC GACCGACTTT
CGTCTGAGCA GCCCGCCGCC GATCGAAGAC GAGGTGCCGC CGCCGACCCC CTCATGGTTG
CAGCATCGTC GTGCTGCCGA TCCAGCACCG TTGTCGCCGC CCGAACCCGA GCCTGACTTC
AGCAGGCCGC CGTCCTTCGT CACAGCCGCA TCGCCCCGGC TGGCCGACCC CGTCTATGAT
CCAGCGCCGT TCGATCGGCA ATCGTTCGAT CAGGTCGCCG AAGAGCCGAA TAATTACGAC
CCTCACTATG CTGTGCAGCA GCCGTTGCCG TTGGAGCCGC CGCAATTCGT ACCCGGCCGC
TACGACGATG CGCTGTATGG CCAGCTCGAT CCCGCCGACG TTCGTCCGGA TCCGAACTAT
CCCGAGGCGC CTTATGGCTA CGATGACGGA TATGCCGACG AGCCGGACAG TCGGGCGTAC
AAGCCCCGTC GCAACAACAT GATGACTGTG GCCGCTGTGT TGGCACTCGC CGTTGTCGGC
ACCGGCGGCG CGTTCGCCTA TCGCAGCTTC ACCAGCGGCC CCCGCACCGG CGAACCGCCG
GTGATCAAGG CCGACGCCAG CCCGACCAAG GTGATGGCTG CGCCGTCGGC CTCCGCAGAT
GCTGCAGGCA AACCGATTCA GGACCGGCTC GCCGCCGGCA ACAACATCGA AGCACTGGTG
TCCCGCGAAG AGCAGCCCGC CGACCCGTCG CGTGCTGGGC AGGGTACGCG CGTGGTGCTG
CCGCAACTCA ACCAGAATCC GAATCCGCCC GCGGTGTCCG CCGTTGCGCC CGGGCCGAAG
CCCAACCTTC CGCCGCCCAA CAACGGCACG ATTGCCGGCG AGGAGCCTCG GCGGATCAAG
ACCTTCAGCA TCCGTCCCGA CCAGGGCGAT CCGGGCGCGG CGCCGGTCAA TGTGGCTGCT
CCTGCGACCC GGCAGGCGTC CCGCGCGCCG GCTCCGACGG CGCCTGCGCA GCGCCCGGCA
GCGCGCCAGC TTGAAGATGC CAATGCTTCC GCGGGCAATA CGCCGCTGTC GCTGGCGCCG
AATTCAGGCG GGTCGCCAGC TGCCAACCAG CGCGTCGCAG CGCTGCCGCC CACGGAATCG
GCCGGCGCGG GAGGCTATGT GGTGCAGGTG TCGTCGCAGC GCAGCGAGGC CGACGCGAAA
TCGTCGTATC GGACGCTGCA GGGTAAGTTC CCGTCCGTGC TCGGCCAGCG CGCGCCGTTG
ATCAAGCGCG CCGATCTCGG CAGCAAGGGC GTGTATTATC GCGCCATGGT CGGCCCATTC
GGCAGTTCCG AAGAGGCGTC GAGACTCTGC GGCAACCTGA AAAGTGCCGG CGGACAGTGC
GTCGTCCAGA GGAATTAA
 
Protein sequence
MTDRYQDRPY LAEGEAHAAY AKRNPENDPL AELARLIGQT DPFGQEAPPS GRPSSRSTDF 
RLSSPPPIED EVPPPTPSWL QHRRAADPAP LSPPEPEPDF SRPPSFVTAA SPRLADPVYD
PAPFDRQSFD QVAEEPNNYD PHYAVQQPLP LEPPQFVPGR YDDALYGQLD PADVRPDPNY
PEAPYGYDDG YADEPDSRAY KPRRNNMMTV AAVLALAVVG TGGAFAYRSF TSGPRTGEPP
VIKADASPTK VMAAPSASAD AAGKPIQDRL AAGNNIEALV SREEQPADPS RAGQGTRVVL
PQLNQNPNPP AVSAVAPGPK PNLPPPNNGT IAGEEPRRIK TFSIRPDQGD PGAAPVNVAA
PATRQASRAP APTAPAQRPA ARQLEDANAS AGNTPLSLAP NSGGSPAANQ RVAALPPTES
AGAGGYVVQV SSQRSEADAK SSYRTLQGKF PSVLGQRAPL IKRADLGSKG VYYRAMVGPF
GSSEEASRLC GNLKSAGGQC VVQRN