Gene RPB_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0721 
Symbol 
ID3910017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp808768 
End bp810885 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content64% 
IMG OID637882613 
Productcatalase 
Protein accessionYP_484343 
Protein GI86747847 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCA AGAAGTCCTC CCCCAAGAAT TCCGATACGA CCTCCGCGAC CATCGCCGAT 
GCCAGTATCC AGCGCGGCCA GGGCGGCGAG ACCCATCAGG TCGCCGGCGG CAAGACGCCG
GTCCTGACCA CCCGCCAGGG TGTTCCGGTC AGCGACGATC AGAACAGTCT GAAGCTCGGC
CTGCGCGGGC CGACGCTGAT GGAGGATTCT CACTTCCGCG AGAAGATCTT CCACTTCGAT
CACGAGCGGA TTCCGGAGCG CGTGGTGCAT GCCCGCGGCT TCGGCGCGCA CGGCTTCTTC
GAGGCCTATG AATCATTGGC CGATATCACC CGCGCCGACA TCTTCCAGCG CGCCGGCGAG
AAGACGCCGG TGTTCGTGCG GTTCTCGACG GTCGCCGGCA ACAAGGGCTC GGCGGATCTG
GCCCGCGACG TGCGCGGCTT CGCGGTCAAG TTCTACACCA AGGAAGGCAA TTGGGACCTG
GTCGGCAACA ACATCCCGGT GTTCTTCATC CAGGATGCGA TCAAGTTTCC GGACCTGATC
CACGCCGCCA AGCAGGAGCC CGACCGCGGC TTTCCGCAGG CGCAGACCGC GCACGACAAT
TTCTGGGACT TCATCTCGCT GATGCCGGAA TCGATCCACA TGGCGCTGTG GATCATGTCC
GATCGCACCA TCCCGCGTTC GTTCCGCTTC ATGGAAGGTT TCGGCGTGCA CAGCTTCCGG
CTGATCAATG CCGACGGCAA GTCCACCTAC GTCAAGTTCC ACTGGAAGCC CAAGCTCGGC
CTCCAATCGG TCGCCTGGAA CGAGGCGGTG AAGCTCAACG GCGCCGATCC GGATTTCCAT
CGCCGGGATC TGTGGGACGC GATCCAGACC GGCAACTACC CGGAATGGGA GCTCGGCCTG
CAACTGTTCG ACGACGCCTT CGCGGACGAA TTCGACTTCG ACATTCTCGA CGCCACCAAG
ATCATCCCGG AAGAGCTGGT GCCGATCCGT CGCGTCGGCC GGCTGGTGCT GGACCGCACC
GTCGACAATT TCTTCAACGA GACCGAGCAA GTCGCGTTCC AAACCGGCAA CATCGTCCCC
GGCATCGACT TCACCAACGA TCCGCTGCTG CAGGGCCGCA ACTTCTCCTA TCTCGACACG
CAATTGAAGC GGCTCGGCAG CCCGAACTTC AACGACCTGC CGATCAACGC GCCGAAGTGC
CCCTTCCACA CCTTCCAGCA GGACGGGCAT ATGACGCTGA ACAATCCCAA GGGCCGCGCC
AACTACGAGC CGAATTCGTG GGGCGCCGAG GGCGGACCGC GCGAGACGCC GAAGGGCTTC
CAGACTTTTC CGGAAGAAAT CACCGGCGAG AAGCGGCGCG CGCGCGCCGA ATTATTCGCC
GATCACTACA GCCAGGCGCG GCAGTTCTAC ATCAGCCAGA CCGACATCGA GCAGACCCAC
ATCAAGGATG CATTTGTCTT CGAACTGAGC AAGGTCGAGC GGCCCGACAT CCGCACGCGC
GTGCTGTCGC ATCTGGTCAA TGTCGATGAA GGTCTCGCCG CAATGGTCGC CGACGGACTC
GGCATGGACG TGCCGGACGC TGCCGAACCG GCACGCGAGA TCATCGCCGA TCTCGAGCCG
TCGCCCGCGC TCAGCATCCT GATGAACGGA CCGAAGGACT TCTCGGGCCG CAAGATCGGC
GTGCTGGTCA GCGACGGTAC CGATGCCAAG CTGCTCGCGG CGCTGCAGGC CGCAGCCGCG
AAGCACGATG TGCTGATCGA ACTCGTGGCG CCGAAGGTCG GCGGATTCGA AACTTCAGAT
GGCGAGCGGA TGCCCGCCAA GCAGAAGATC AATGGCGGGC CGTCCGTCCT GTATGACGCG
GTCGCGATCC TCGTCTCCGA AGAGGGCGCG GCGCTGCTGC AGGGCGAGGC GACGGCACGC
GACTTCGTCG CCGACGCCTT CGCGCACGCC AAGTTCATCG CCTATGTCGA CACCGCGCAG
CCGCTGCTCT ACAAGGCCGG CGTCGAACCG GACGGAGGCT TCCTCGCCTT GAGCAAGCCG
GCGGATGCCG AGCGCTTCAT CAAGCTCTGC GGCAAATTGC GGTTCTGGGA GCGCGAAGCG
TCCGTTCACG CGGTGTAG
 
Protein sequence
MATKKSSPKN SDTTSATIAD ASIQRGQGGE THQVAGGKTP VLTTRQGVPV SDDQNSLKLG 
LRGPTLMEDS HFREKIFHFD HERIPERVVH ARGFGAHGFF EAYESLADIT RADIFQRAGE
KTPVFVRFST VAGNKGSADL ARDVRGFAVK FYTKEGNWDL VGNNIPVFFI QDAIKFPDLI
HAAKQEPDRG FPQAQTAHDN FWDFISLMPE SIHMALWIMS DRTIPRSFRF MEGFGVHSFR
LINADGKSTY VKFHWKPKLG LQSVAWNEAV KLNGADPDFH RRDLWDAIQT GNYPEWELGL
QLFDDAFADE FDFDILDATK IIPEELVPIR RVGRLVLDRT VDNFFNETEQ VAFQTGNIVP
GIDFTNDPLL QGRNFSYLDT QLKRLGSPNF NDLPINAPKC PFHTFQQDGH MTLNNPKGRA
NYEPNSWGAE GGPRETPKGF QTFPEEITGE KRRARAELFA DHYSQARQFY ISQTDIEQTH
IKDAFVFELS KVERPDIRTR VLSHLVNVDE GLAAMVADGL GMDVPDAAEP AREIIADLEP
SPALSILMNG PKDFSGRKIG VLVSDGTDAK LLAALQAAAA KHDVLIELVA PKVGGFETSD
GERMPAKQKI NGGPSVLYDA VAILVSEEGA ALLQGEATAR DFVADAFAHA KFIAYVDTAQ
PLLYKAGVEP DGGFLALSKP ADAERFIKLC GKLRFWEREA SVHAV