Gene RPC_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0046 
Symbol 
ID3971433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp52957 
End bp54102 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID637923160 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_529944 
Protein GI90421574 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACGCC GGGCAAGGAA AACGCCGTAT TGGCGGCAAA CGCAAGGGCT CAATTTGGCT 
ACCGATACAG CGTCTCTGGT GCTGGGGATC GAGACCACTT GCGACGAAAC CGCGGCCGCC
GTGGTCGAGC GCCGCAGCGA TGGCAGCGGC CGCATCCTGT CCAACATCGT GCACTCGCAG
ATCGAGGATC ACGCGCCGTT CGGCGGCGTG GTCCCCGAGA TCGCGGCGCG GGCGCATGTC
GACCTGCTCG ACGGCATCAT CGCGCGTGCG ATGCAGCAGG CCGGCCTCGG CTTCAAGGAT
CTTTCGGGCG TCGCCGCCGC CGCCGGGCCC GGCCTGATCG GCGGCGTCAT CGTCGGCCTC
ACCACCGGCA AGGCGATCGC GCTGGTGCAC GATACGCCGT TGATCGCGGT CAACCATCTG
GAAGCCCACG CGCTGACGCC GCGGCTGACC GACGCGCTGC AATTCCCCTA TTGCCTGTTT
CTCGCCTCCG GCGGCCACAC CCAGATCGTC GCGGTGCTCG GCGTCGGCAA CTACGTCCGG
CTCGGCACCA CCGTCGACGA CGCGATGGGC GAGGCCTTCG ACAAGGTCGC CAAGATGCTC
GGGCTGCCCT ATCCGGGCGG GCCGCAGGTC GAGCGCGCCG CGGCGGCCGG CGACGCTGCG
CGCTTTGCGT TTCCGCGGCC GATGCTGGGC CGCGCCGACG CCAATTTTTC GCTGTCCGGT
CTGAAGACCG CGGTGCGCAA CGAGGCCAGC CGGCTATCGC CGCTTGAGCC GCAGGACGTC
AACGATCTGT GCGCTGGATT CCAGGCCGCG GCGCTGGAAT CCACCGCCGA CCGGCTGCAT
GTCGGCCTTC GGATATTTCG CGAGCGGTTC GGCGCGCCGC ACGCGCTGGT CGCCGCCGGT
GGCGTCGCCG CCAATCAGGC GATCCGCGGC GCGTTGCAGC AGGTGGCCTT GGCCGCCGGC
ACTCAATTCA TGATCCCCCC GCCGGCGCTA TGCACCGACA ACGGGGCGAT GATCGCCTGG
GCCGGCGCCG AACGGCTGGC GCTGGGGTTG ACCGACAGCC TCGAATTCGC GCCGCGGGCG
CGCTGGCTGC TCGACGCCAA CGTCATCACG CCGGCGCAAT TCGCCAACAC CCGCGCGGGC
TTCTAG
 
Protein sequence
MLRRARKTPY WRQTQGLNLA TDTASLVLGI ETTCDETAAA VVERRSDGSG RILSNIVHSQ 
IEDHAPFGGV VPEIAARAHV DLLDGIIARA MQQAGLGFKD LSGVAAAAGP GLIGGVIVGL
TTGKAIALVH DTPLIAVNHL EAHALTPRLT DALQFPYCLF LASGGHTQIV AVLGVGNYVR
LGTTVDDAMG EAFDKVAKML GLPYPGGPQV ERAAAAGDAA RFAFPRPMLG RADANFSLSG
LKTAVRNEAS RLSPLEPQDV NDLCAGFQAA ALESTADRLH VGLRIFRERF GAPHALVAAG
GVAANQAIRG ALQQVALAAG TQFMIPPPAL CTDNGAMIAW AGAERLALGL TDSLEFAPRA
RWLLDANVIT PAQFANTRAG F