Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0315 |
Symbol | |
ID | 3908696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 358534 |
End bp | 359601 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637882201 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_483937 |
Protein GI | 86747441 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.508404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGGAA TCGAGACCAC CTGCGATGAA ACCGCAGCCG CGGTGGTCGA GCGCCGCGCC GATGGCAGCG GCCGGATTCT CTCCAATATC GTACGCTCGC AGACCGACGA ACACGCGCCG TTCGGCGGCG TGGTCCCGGA AATCGCCGCG CGCGCCCATG TCGATCTGCT CGACGGCATC GTCGCGCGGG CGATGCGGGA GGCGGGAACC GGCTTCCCGG AATTGTCGGG CGTCGCGGCG GCGGCAGGGC CGGGGCTGAT CGGCGGCGTC ATCGTCGGGC TGACCACCGC CAAGGCGATC GCGCTGGTGC ACAACACGCC GCTGATCGCG GTCAATCATC TGGAAGCGCA CGCGCTGACG CCGCGGCTGA CGGATGCGAC CGAGTTTCCC TATTGCCTGT TTCTCGCCTC CGGCGGCCAC ACCCAGATCG TCGCGGTGCG CGGCGTCGGC GACTATGTCC GGCTCGGCAC CACGGTCGAC GACGCGATCG GCGAGGCGTT CGACAAGATC GCCAAGATGC TCGGCCTGCC CTATCCGGGC GGGCCGCAGG TCGAGCGCGC CGCAGCCTCC GGCGATGCGG TGCGGTTCGC GTTTCCGCGG CCGATGCTGG GGCGGCCCGA CGCCAATTTC TCATTGTCCG GTCTCAAGAC AGCGGTGCGC AACGAGGCCA GCCGGCTGAC GCCGCTGGAG CCACAGGACA TCAATGATCT GTGCGCCGGC TTCCAGGCCG CGGTGCTGGA CTCGATGGCC GACCGGCTGA CCTCCGGACT GCGGCTGTTC CGCGAGCGCT TCGGCGCACC GAAGGCGCTG GTCGCGGCCG GCGGCGTCGC CGCCAATCAG GCGATCCGCC GCGCCTTGCG CGAGGTCGCC GCCAAGGCCC AGACCACGCT GATCGTGCCG CCGCCGGCGC TGTGCACCGA CAACGGCGCG ATGATCGCCT GGGCCGGCGC CGAGCGCCTG GCGCTCGGCC TCACCGACAG CATGGACGCC GCCCCCCGCG CCCGCTGGCT GCTCGACGCC AACGCGACGG CGCCGGGCAA ATTCGCCAAT ACCCGCGCGG GATTTTAA
|
Protein sequence | MLGIETTCDE TAAAVVERRA DGSGRILSNI VRSQTDEHAP FGGVVPEIAA RAHVDLLDGI VARAMREAGT GFPELSGVAA AAGPGLIGGV IVGLTTAKAI ALVHNTPLIA VNHLEAHALT PRLTDATEFP YCLFLASGGH TQIVAVRGVG DYVRLGTTVD DAIGEAFDKI AKMLGLPYPG GPQVERAAAS GDAVRFAFPR PMLGRPDANF SLSGLKTAVR NEASRLTPLE PQDINDLCAG FQAAVLDSMA DRLTSGLRLF RERFGAPKAL VAAGGVAANQ AIRRALREVA AKAQTTLIVP PPALCTDNGA MIAWAGAERL ALGLTDSMDA APRARWLLDA NATAPGKFAN TRAGF
|
| |