Gene RPB_2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2531 
Symbol 
ID3910320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2890103 
End bp2892334 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content61% 
IMG OID637884430 
Producthypothetical protein 
Protein accessionYP_486147 
Protein GI86749651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA AACCACCATC CATGCAGCGA TCGCGCAGTC AACTCGCCAC GGCCTATGCG 
CCTCATAGCC TCTTCACGTT CGAAGGCGGC TCTGGGGCCT GCATGGCGCT TCCTAGCCCA
GGGAACCGGG CTGCCGGAGA CGAGCTGAGT CCGAACACAC AGAAGATGAT CTCCGAACAG
ATCCAAGAAT ACTTCGAGGC GTGGGCATTA AGGGCGGCAC GCGGGATCGG ACTTCGGCAT
CCGGTCCCCT TGAACCTGGC CGTCGACGGC CGGGTCGTCA GCGACGAGCG GGTGCGGGTG
CGCTCTGGAG AATTCGCATT CCAAATTCCA GATCATGTCG GCTACGTCCC GTTCCCCCTT
TCGTTCGTCT GCACCCGTTG TAATCTGCAT ATATCGGTAG ATCGGGTGGA ACGTCTGACC
GACGAAGCAT CGCGCTTCCG GACCGCATGC CCGTCGGGTG CGGATAGCTG TGCGGACAAT
TGGCAGCAGC TCGACGTCGT TTTTGCTCAC TGGTCGGGAA GCGTAGAGCC GATCACGCCG
AGCCATCGAC GGTGGCAGAA CGGTGTCGTG CAGGACTACG ATCGATGTGA AAATTGCAAT
GAGGATCGCT TCTACCTGCA TCGTCCACCC GGACGACTCG CGAACTGGGT GTTTCAATGC
GTCGAGTGCA AACTTCCGAG GCCTGTCCAA CAGCGGGATC TGACGACGCT TGAGGAGCTC
GGCCCCCTGA TCGCACAAAA CCAGGCACTG CCCGCCGAAA TAAACATGGA GCCGGTCTCC
TACAGAGCCT CGGCTGCCTA TTACGTTCAT GGCGACCGGC TGCTGGTCTT CGATCAGGAA
CGGTACATCC AGCTGCTGAA CTCAACCAAT ACGGGTCCGC TCGTAGCGCT TCTTTCCAGC
GAATACGGCT ATCCGCCCAC CGCGATCGAT GATGCGGAAA AAGAACGTCT CCTGCTGGCG
GCCGGTCAGG GCGATAAGTG GAACCGCTAT CGGCAATTTC GAAGCTTCCT TCCGACGCTT
GAGGCGATGA CCCCCTCTCC GGTCGACATG ATCAACGAGC AGCGAGCGTA TCTCGCCGAA
ATGGACGCCG ACTGGAACAG GACGGTTTTC GCGGGCCATC AGCAGGCCGT GGATGGAATC
GAGCGTGCCG GCCGCGAGCG CACGGAATAT GTCCGCCGTT TCGATCCTGT GCGCATGGCG
CTCGAACACA AGACCCTCAC CGAAGAGCGT CTGCGAGGGC GGCAGATGCC TGACGGCAAG
GATGTGTCGG TAGATCTTAC GATCCTCGAC GACTTCCTGT TTCCCGACGA CCTTGGGGCT
GACGAGCGCT CGCGACTCCT CGATCAAGTC CGTATACGCC GAGACCTGCT GGGGATGGCC
GAGTTGCGCC TCGTCCGAGA CGTCAGGGTG TGCGAATATA CGTTCGGCTA CACGCGGACA
AGTTCGCTGC CGACGGTCCA ACGCGACAAG TCTGGCACAG CTGAATTGCC GGTACGCCTG
CGGCTGTTCG ACCGAGTGCA GGTCGGTGAT CGCGGCGTCC ATCCAATACT CTGCCTCGTA
CAGTCCAACG AGGGCTTCTA CATCCGCCTC GACGAGGAAT GCGTTCTGGA GTGGTTGGAG
GCGAACGGCA TCCGCCCTGC CCCCGCCGCT CCAGGCGTCC GGCTGGGCGG ACGCCTGATC
GAGGAATACG CGCAGATGCA GGCAAACGAA GACGTCCGCT TCTCGCGCTT TCTTGACGAA
TATCGGCGCG AGCGCAGCGT GCCGCGCCTC GCCTATCCGT ACGTCTACAC GCTTCTCCAC
ACGATGGCTC ACCACCTCCT CGGGACATCG TCCTCGATGT CGGGCCTCGA TCTCGGCTCG
TTCGGCGAGC ACATCTTCGT GCCCGACCTA GCGTTCCTAG TCTATCGGCG GGGCATGACC
ATGGATCTCG GCAACCTGTC GTCTATGTGG CGCGACCGCG GAGATCCATG GTTCGGCAAC
GAGGTCCTCG AACGCATGGT CGATCCAGCC AGTCTTCGGT GCGGCTCCGA GAGCGTGTGC
AACCATCGCG GTGGCGCATG TCCAGACTGC CTGCTGATCC CCGAGAGCGC ATGTCTGACC
CGGAACGAGT TGCTGTCGCG CTCAGTATTG ATAGGCCGCG GTCAGCCGCG CTGGGACGCC
GAAGGCCGCG CGTTGATCGG ATACTACGAC GTTGCCCGTA GACGAGCGGC AGCGGCCCCG
GTCGATCCAT GA
 
Protein sequence
MTAKPPSMQR SRSQLATAYA PHSLFTFEGG SGACMALPSP GNRAAGDELS PNTQKMISEQ 
IQEYFEAWAL RAARGIGLRH PVPLNLAVDG RVVSDERVRV RSGEFAFQIP DHVGYVPFPL
SFVCTRCNLH ISVDRVERLT DEASRFRTAC PSGADSCADN WQQLDVVFAH WSGSVEPITP
SHRRWQNGVV QDYDRCENCN EDRFYLHRPP GRLANWVFQC VECKLPRPVQ QRDLTTLEEL
GPLIAQNQAL PAEINMEPVS YRASAAYYVH GDRLLVFDQE RYIQLLNSTN TGPLVALLSS
EYGYPPTAID DAEKERLLLA AGQGDKWNRY RQFRSFLPTL EAMTPSPVDM INEQRAYLAE
MDADWNRTVF AGHQQAVDGI ERAGRERTEY VRRFDPVRMA LEHKTLTEER LRGRQMPDGK
DVSVDLTILD DFLFPDDLGA DERSRLLDQV RIRRDLLGMA ELRLVRDVRV CEYTFGYTRT
SSLPTVQRDK SGTAELPVRL RLFDRVQVGD RGVHPILCLV QSNEGFYIRL DEECVLEWLE
ANGIRPAPAA PGVRLGGRLI EEYAQMQANE DVRFSRFLDE YRRERSVPRL AYPYVYTLLH
TMAHHLLGTS SSMSGLDLGS FGEHIFVPDL AFLVYRRGMT MDLGNLSSMW RDRGDPWFGN
EVLERMVDPA SLRCGSESVC NHRGGACPDC LLIPESACLT RNELLSRSVL IGRGQPRWDA
EGRALIGYYD VARRRAAAAP VDP