Gene RPD_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0470 
Symbol 
ID4020938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp542016 
End bp543185 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content70% 
IMG OID637960657 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_567609 
Protein GI91974950 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.153158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCAT TCCGCGACCC CGCAATGCTA CGCGAAGCAA GGAGAACGCT GGTTCGGCCG 
CAAACGCAAG GGCTCAATTT GACTAGTGAA CAAACCTTGC TGGTGCTCGG AATCGAGACA
ACCTGCGATG AAACCGCCGC CGCCGTGGTC GAGCGCCGCG CCGACGGCAG CGGCCGGATT
CTCTCCAACA TCGTGCGCTC GCAAATCGAC GAACACGCGC CGTTCGGTGG CGTTGTACCG
GAGATCGCCG CGCGGGCGCA TGTCGATCTG CTCGACGGCA TCGTCGCAAA TGCCATGCGG
GAAGCGGGAA CCGGTTTTGC GCAGCTGTCG GGCGTCGCGG CGGCGGCAGG GCCGGGGCTG
ATCGGCGGCG TCATCGTCGG GCTGACCACC GCGAAGGCGA TCGCGCTGGT GCACAACACG
CCGTTGATCG CGGTCAATCA CCTCGAGGCG CACGCGCTGA CGCCGCGGCT GACGGATGCG
ACCGAGTTTC CTTACTGCCT GTTCCTCGCC TCCGGCGGCC ACACCCAGAT TGTCGCCGTG
CTCGGCGTCG GCGACTACGT CCGGCTCGGC ACCACGGTGG ACGATGCGAT CGGCGAGGCG
TTCGACAAGA TCGCCAAGAT GCTGGGGCTG CCTTATCCGG GCGGGCCGCA GGTCGAGCGC
GCAGCGGCGT CCGGCGACGC CGCGCGGTTC GCGTTCCCGC GGCCGATGCT GGGGCGGCCC
GACGCCAATT TCTCGCTGTC CGGCCTCAAG ACCGCGGTGC GCAACGAGGC CAGCCGGCTG
ACGCCGCTGG AGCCGCAGGA CATCAACGAT CTGTGCGCCG GCTTCCAGGC CGCGGTGCTG
GACTCGATGG CCGACCGGCT GGGCGCCGGG CTGCGGTTGT TCCGCGAGCG CTTCGGTGCG
CCGAAGGCGC TGGTCGCGGC CGGCGGGGTC GCCGCAAATC AGGCGATCCG CCGGTCTTTG
CGGGAGGTCG CCGCGAAGGC GCAGACCACG CTGATGGTGC CGCCGCCGGC GCTGTGCACC
GACAACGGCG CGATGATCGC CTGGGCCGGC GCCGAGCGTC TCGCGCTCGG CCTGACCGAC
ACCATGGACG CGGCGCCCCG CGCCCGCTGG CTGCTCGATG CCAACGCGAC CGCGCCGGGA
AAATTCGCCA ATACGCGCGC GGGATTCTGA
 
Protein sequence
MIAFRDPAML REARRTLVRP QTQGLNLTSE QTLLVLGIET TCDETAAAVV ERRADGSGRI 
LSNIVRSQID EHAPFGGVVP EIAARAHVDL LDGIVANAMR EAGTGFAQLS GVAAAAGPGL
IGGVIVGLTT AKAIALVHNT PLIAVNHLEA HALTPRLTDA TEFPYCLFLA SGGHTQIVAV
LGVGDYVRLG TTVDDAIGEA FDKIAKMLGL PYPGGPQVER AAASGDAARF AFPRPMLGRP
DANFSLSGLK TAVRNEASRL TPLEPQDIND LCAGFQAAVL DSMADRLGAG LRLFRERFGA
PKALVAAGGV AANQAIRRSL REVAAKAQTT LMVPPPALCT DNGAMIAWAG AERLALGLTD
TMDAAPRARW LLDANATAPG KFANTRAGF