Gene RPD_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0223 
Symbol 
ID4020681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp255039 
End bp257237 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content67% 
IMG OID637960402 
Productcatalase/peroxidase HPI 
Protein accessionYP_567364 
Protein GI91974705 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.125237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAA AGACAGACGA TCAGGGCGGC AAGTGCCCGT TTCCGCACGG CGGCGGCTCG 
CGGGGGCACA GGAATCGCGA CTGGTGGCCG GAGCAGCTCG ACATCAACAT GCTCCATCGT
AATTCGACCC TGTCCGACCC GCTGGGCAAG GCGTTCGACT ACGCCAAGGA ATTCGAAAGC
CTCGATCTCG ACGCGGTGAT CAAGGATCTG CACGCGCTGA TGACCGATTC GCAGGACTGG
TGGCCGGCGG ATTTCGGCCA CTACGGCGGT CTGATGATCC GCATGGCCTG GCATTCGGCC
GGCACCTATC GCACCACCGA CGGCCGCGGC GGCGCCGGCG CCGGACAGCA GCGCTTCGCG
CCGCTGAACT CGTGGCCGGA CAACGCCAAT CTCGACAAGG CGCGCCGGCT GCTGTGGCCG
ATCAAGCAGA AATACGGCAA CAAGATCTCG TGGGCCGATC TCTACGTGCT GACCGGCAAC
GTCGCGCTGG AATCGATGGG CTTCAAGACC TTCGGCTTCG CCGGCGGCCG TGCCGACACC
TGGGAGCCCG AAGAGCTGTT CTGGGGTCCG GAAGGCTCCT GGCTCGGCGA TGAGCGCTAT
TCCGGCGAGC GCCAGCTGCA CGACGCGCTC GGCGCGGTGC AGATGGGCCT GATCTACGTC
AACCCGGAAG GCCCGAACGG CAATCCTGAT CCGGTCGCCG CCGCCAAGGA CATCCGCGAG
ACCTTCGCCC GGATGGCGAT GAACGACGAA GAGACCGTGG CGCTGATCGC CGGCGGCCAC
ACCTTCGGCA AGACCCACGG CGCCGGCGAT CCGTCGCTGA TCGGCGCGGA GCCGGAGGGC
GGGGCACTCG AGGATCAGGG CCTCGGCTGG AAGAGCAAGT TCGGCACCGG CTTCGGCGCC
GACGCCATCA CCGGCGGGCC GGAAGTGATC TGGACGCAGA CGCCGACGCA GTGGAGCAAC
TTCTTCTTCG AAAACCTGTT CGGCTTCGAA TGGGAACTCG ACAAGAGCCC GGCCGGCGCC
AAGCAGTGGA AGGCCAAGGG CGCCGAGGCG ACCGTGCCGG ATCCGTTCGA TCCGACCAAG
AAGCGCGTGC CGACGATGCT GACGACCGAC CTGTCGCTGC GCTTCGACCC CGCCTACGAG
AAGATCTCGC GCCGCTTTTT CGAGAACCCG GATCAGTTCG CCGACGCCTT CGCCCGCGCC
TGGTTCAAGC TGACCCACCG CGACATGGGC CCGAAGGTGC GCTATCGCGG CAAGCTGGTG
CCGAAGGAAG ATCTGATCTG GCAGGACCCG ATCCCGCCGG TCGACCACGA GCTGGTCAGC
GCCAAGGACA TCGCCGATCT CAAGGCGCGG ATCCTCGCCT CCGGCCTGTC GGTGTCGCAG
CTGGTCTCGA CCGCGTTCGC CTCGGCCTCG ACCTATCGCC ATTCCGACAA GCGCGGCGGC
GCCAACGGCG CGCGCATCCG CTTCGCGCCG CAGAAGGATT GGGAGGTCAA CCAGCCGGCC
GATCTCGCCC AGGTGCTCGG CAAGCTCGAA GCGATCCAGA AGGCGTTCAA CGACGCGCAG
TCGGGCGGCA AGAAGGTCTC GCTCGCCGAC CTGATCGTGC TCGGCGGCTC AGCCGCGGTC
GAGAAGGCCG CCAAGGATGC CGGCACCGAG GTCGAGGTGC CGTTCACCCC GGGCCGGATG
GACGCGCTGG AAGAGCAGAC CGACGGCGAT TCGTTCAAGG TGCTGGAGCC GCGGGCCGAC
GGCTTCCGCA ACTTCATCGG CAAGCGGCAT CAGTTCATGC AGCCCGAAGA AGCGCTGGTC
GATCGCGCGC AGCTCCTCAA CCTGACTGCG CCGGAAATGA CGGTGCTGCT CGGCGGCCTG
CGCGTGCTCG GCGGCAATGT CGGCCACGAC AGCCACGGCG TCTTCACCGA CCGGCCGGAG
AAGCTGACCA ACGACTTCTT CGTCAACCTC TTGGACATGA AGACCGCCTG GTCGCTCTCG
GCCACCGCCG AAGGCGTCTA TGAAGGCCGC GACCGCAAGA CCGGCGACCT GCGCTGGACC
GGCACCCGCG TCGATCTGAT CTTCGGCTCG CACTCGCAGC TGCGCGCGCT CGCCGAAGTC
TACGGTCAGT CCGACGCCCA GACGAAGTTC GCCCAGGACT TCGTCGCCGC CTGGACCAAG
GTGATGAACG CGGATCGGTT CGACCTCGCG GCGAAGTAA
 
Protein sequence
MDAKTDDQGG KCPFPHGGGS RGHRNRDWWP EQLDINMLHR NSTLSDPLGK AFDYAKEFES 
LDLDAVIKDL HALMTDSQDW WPADFGHYGG LMIRMAWHSA GTYRTTDGRG GAGAGQQRFA
PLNSWPDNAN LDKARRLLWP IKQKYGNKIS WADLYVLTGN VALESMGFKT FGFAGGRADT
WEPEELFWGP EGSWLGDERY SGERQLHDAL GAVQMGLIYV NPEGPNGNPD PVAAAKDIRE
TFARMAMNDE ETVALIAGGH TFGKTHGAGD PSLIGAEPEG GALEDQGLGW KSKFGTGFGA
DAITGGPEVI WTQTPTQWSN FFFENLFGFE WELDKSPAGA KQWKAKGAEA TVPDPFDPTK
KRVPTMLTTD LSLRFDPAYE KISRRFFENP DQFADAFARA WFKLTHRDMG PKVRYRGKLV
PKEDLIWQDP IPPVDHELVS AKDIADLKAR ILASGLSVSQ LVSTAFASAS TYRHSDKRGG
ANGARIRFAP QKDWEVNQPA DLAQVLGKLE AIQKAFNDAQ SGGKKVSLAD LIVLGGSAAV
EKAAKDAGTE VEVPFTPGRM DALEEQTDGD SFKVLEPRAD GFRNFIGKRH QFMQPEEALV
DRAQLLNLTA PEMTVLLGGL RVLGGNVGHD SHGVFTDRPE KLTNDFFVNL LDMKTAWSLS
ATAEGVYEGR DRKTGDLRWT GTRVDLIFGS HSQLRALAEV YGQSDAQTKF AQDFVAAWTK
VMNADRFDLA AK