Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0470 |
Symbol | |
ID | 4020938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 542016 |
End bp | 543185 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637960657 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_567609 |
Protein GI | 91974950 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.153158 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCAT TCCGCGACCC CGCAATGCTA CGCGAAGCAA GGAGAACGCT GGTTCGGCCG CAAACGCAAG GGCTCAATTT GACTAGTGAA CAAACCTTGC TGGTGCTCGG AATCGAGACA ACCTGCGATG AAACCGCCGC CGCCGTGGTC GAGCGCCGCG CCGACGGCAG CGGCCGGATT CTCTCCAACA TCGTGCGCTC GCAAATCGAC GAACACGCGC CGTTCGGTGG CGTTGTACCG GAGATCGCCG CGCGGGCGCA TGTCGATCTG CTCGACGGCA TCGTCGCAAA TGCCATGCGG GAAGCGGGAA CCGGTTTTGC GCAGCTGTCG GGCGTCGCGG CGGCGGCAGG GCCGGGGCTG ATCGGCGGCG TCATCGTCGG GCTGACCACC GCGAAGGCGA TCGCGCTGGT GCACAACACG CCGTTGATCG CGGTCAATCA CCTCGAGGCG CACGCGCTGA CGCCGCGGCT GACGGATGCG ACCGAGTTTC CTTACTGCCT GTTCCTCGCC TCCGGCGGCC ACACCCAGAT TGTCGCCGTG CTCGGCGTCG GCGACTACGT CCGGCTCGGC ACCACGGTGG ACGATGCGAT CGGCGAGGCG TTCGACAAGA TCGCCAAGAT GCTGGGGCTG CCTTATCCGG GCGGGCCGCA GGTCGAGCGC GCAGCGGCGT CCGGCGACGC CGCGCGGTTC GCGTTCCCGC GGCCGATGCT GGGGCGGCCC GACGCCAATT TCTCGCTGTC CGGCCTCAAG ACCGCGGTGC GCAACGAGGC CAGCCGGCTG ACGCCGCTGG AGCCGCAGGA CATCAACGAT CTGTGCGCCG GCTTCCAGGC CGCGGTGCTG GACTCGATGG CCGACCGGCT GGGCGCCGGG CTGCGGTTGT TCCGCGAGCG CTTCGGTGCG CCGAAGGCGC TGGTCGCGGC CGGCGGGGTC GCCGCAAATC AGGCGATCCG CCGGTCTTTG CGGGAGGTCG CCGCGAAGGC GCAGACCACG CTGATGGTGC CGCCGCCGGC GCTGTGCACC GACAACGGCG CGATGATCGC CTGGGCCGGC GCCGAGCGTC TCGCGCTCGG CCTGACCGAC ACCATGGACG CGGCGCCCCG CGCCCGCTGG CTGCTCGATG CCAACGCGAC CGCGCCGGGA AAATTCGCCA ATACGCGCGC GGGATTCTGA
|
Protein sequence | MIAFRDPAML REARRTLVRP QTQGLNLTSE QTLLVLGIET TCDETAAAVV ERRADGSGRI LSNIVRSQID EHAPFGGVVP EIAARAHVDL LDGIVANAMR EAGTGFAQLS GVAAAAGPGL IGGVIVGLTT AKAIALVHNT PLIAVNHLEA HALTPRLTDA TEFPYCLFLA SGGHTQIVAV LGVGDYVRLG TTVDDAIGEA FDKIAKMLGL PYPGGPQVER AAASGDAARF AFPRPMLGRP DANFSLSGLK TAVRNEASRL TPLEPQDIND LCAGFQAAVL DSMADRLGAG LRLFRERFGA PKALVAAGGV AANQAIRRSL REVAAKAQTT LMVPPPALCT DNGAMIAWAG AERLALGLTD TMDAAPRARW LLDANATAPG KFANTRAGF
|
| |