Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5052 |
Symbol | |
ID | 6412746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5434744 |
End bp | 5436306 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714937 |
Product | protease Do |
Protein accession | YP_001994016 |
Protein GI | 192293411 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.610685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGCG ACCATTTCGA CCACGTATCA GCTCAAACCG AGGCTCACAT CCGCAAGGTG TTGAGGCCGC GGCGTCTCTC GCTGTTGGCT TCCGCTGCCG GCTTGAGCAT GGTGGTGGCG CTCGGCGGCG CCGGATTTCT CACCCGCGAG ATGCCGTCGC TGACGTCGCC GGCCTATGCG GCTGAAAACG GCAAGCCGGC GCCCGCCGGG TTCGGCGATC TGGTCGACAA GGTCAAGCCG GCAGTGATTT CGGTGCGGGT CAAGATCGAC GACGATCGCC AGACCACGCC GCTGGTGCGT GACGATGGTG ACGGCGATCA GATGCAGACG CCGCGGGGCC TCGCGCCGTT CCAGCAGTTC GAGCGTCAGT TCGGCTTCCG CGGTCCTGAA GGCATGCCGA AGCGGCACCA GATGATCACC GGCGAAGGCT CCGGCTTCTT CATCACCGCC GACGGCTACG CGGTCACCAA TAATCACGTG GTCGATCACG CCAAGTCGGT GCAGGTGACG ACCGACGACG GCACTATCTA CACCGCCAAG GTCGTCGGCA CCGACGACAA GACCGATCTG GCGCTGATCA AGGTCGACGG CAAAACCGAT TTTCCGCACG TCAACTTCGC CGATGCGCCG GCGCGGGTCG GCGATTGGGT GATCGCTGTC GGCAATCCGT TCGGCCTCGG CGGCACGGTG ACGGCGGGCA TCGTCTCGGC GCGCGGTCGT GACATCGGTT CGGGCCCCTA TGACGACTAC GTGCAGATCG ACGCGCCTAT CAACAAGGGC AACTCCGGCG GTCCGGCGTT CGACACCAAT GGCAATGTGA TCGGCGTCAA CACCGCGATC TATTCGCCAT CGGGCGGCTC GGTCGGCATC GGCTTCGATA TTCCGGCGGC GACCGCGAAG CTGGTGGTGT CGCAGCTCAA GGACAAGGGC TACGTCACCC GCGGCTGGCT CGGCGTGCAG GTGCAGCCGG TCACGGCGGA GATCGCCGAC AGTCTCGGCA TGAAGCAGGC CCGCGGCGCG CTGGTCGATA GTCCGCAGGA CGGCAGCCCG GCCGCGAAGG CGGGCATCAA GGCCGGCGAT GTGATCACCG CGGTCGACGG CAAGGAGGTC AAGGACTCCC GCGCGCTCGC CCGCACCATC AGCACGCTGG CACCGGGCTC CTCGGTGAAG CTCGACGTGC TGCACAACGG CCAGTCCAAG ACGATGGATC TGACGCTCGC CGAAATGCCC GGTGATCATC AGAAGGTCGC CGACAGCAGC GGCGATCGCG ACGCTACCCG TCCGTATCTC GGCCTGCGCG TGGCACCGGC CAGCGAAGTC GACGGTGCCG GCAAGAACGG GGTGGTCGTT ACCGGTGTCG ATCCGGACGG GCCGGCCGCC GACAAGGGCC TGCGCACGGG TGATGTCATC CTCGACGTCG GCGGCAAGGC GGTGACCAAC ACCGGCGATG TCCGCAACGC GCTCACACAG GCCGGCAAGG ACGGCAAGAA GACCGTGCTG ATGCGGGTGA AGACGGCGGA TTCGGCGGCG CGCTTTGTCG CGGTGCCGAT CGCGAAGGGC TGA
|
Protein sequence | MEGDHFDHVS AQTEAHIRKV LRPRRLSLLA SAAGLSMVVA LGGAGFLTRE MPSLTSPAYA AENGKPAPAG FGDLVDKVKP AVISVRVKID DDRQTTPLVR DDGDGDQMQT PRGLAPFQQF ERQFGFRGPE GMPKRHQMIT GEGSGFFITA DGYAVTNNHV VDHAKSVQVT TDDGTIYTAK VVGTDDKTDL ALIKVDGKTD FPHVNFADAP ARVGDWVIAV GNPFGLGGTV TAGIVSARGR DIGSGPYDDY VQIDAPINKG NSGGPAFDTN GNVIGVNTAI YSPSGGSVGI GFDIPAATAK LVVSQLKDKG YVTRGWLGVQ VQPVTAEIAD SLGMKQARGA LVDSPQDGSP AAKAGIKAGD VITAVDGKEV KDSRALARTI STLAPGSSVK LDVLHNGQSK TMDLTLAEMP GDHQKVADSS GDRDATRPYL GLRVAPASEV DGAGKNGVVV TGVDPDGPAA DKGLRTGDVI LDVGGKAVTN TGDVRNALTQ AGKDGKKTVL MRVKTADSAA RFVAVPIAKG
|
| |