Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0593 |
Symbol | |
ID | 6408243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 646592 |
End bp | 647932 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642710506 |
Product | carboxyl-terminal protease |
Protein accession | YP_001989628 |
Protein GI | 192289023 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAA TCGTCGTCCG CTCGCTTGCC GTCCTCGGCG TCGTCGCGGT TCCCGTCGTC TCCGCGCTCG CCCTGCAGCG CGAGCAGACC GCCCCGGCCG ATCTCGACCT GATCGGCGGC GTGATCCAGC TGGTGCAGCG CGCCTACGTC CATCCGATCC CCTCGGACGA GCTGACCAAG GATGCCCTGA AGGGCCTGCT GAACCGGCTC GATCCGCATT CGGACTACAT GGACGAGAAG GAATTCAAGG ACATGCAGAC CGACATGTCG GGCCAATTCG GCGGCCTCGG CATGCAGATC ACCGCTCAGG GCGGGATTCC GCGAATCGTG TCGCCGATCG ACGGCACCCC GGCGGCGCGC GCCAAGCTCG AGCCGGGCGA CCTGATCATC AAGGTCGGCA CCGCCACCAC TCAGGGCATG AGCCTGCGTA ACGTCGTCGA CATGCTGCGC GGCGCGCCCG GCACCAAGGT CACCATCACG GTGCTGCGCG GCAAGGACGA GCCGTTCGAC GTTACCCTGA CCCGTGAAAT CATCCGCGTC GCTTCGGTGA AGTCGGAGAT CAAGCCGGAC GGCGTCGGCT ACATCCGCAT CAGCCAGTTC GGCGCCGACA CCGCCGACGG CTTCACCGCC GCGCTCACCA AGCTGAAAAG CGACGCCAAG GACGGCGGCC TCAAGGGGCT GGTGATCGAC CTGCGCAACG ATCCGGGCGG GCTGCTCAAC GCCGCCGTCT CGGTGGCGGG CGACCTGCTC GACGGCGGCA CCGTGGTGTC GATCCGCGGC CGCCAGGCCG ACGACCAGCG AGTGTTCTCC GCGCCGTCGA AGGGCGACAA ACTGCCGGGC GTGCCGATCG TGGTGCTGAT CAACGGCGCC TCGGCGTCGG CGTCGGAGAT TGTTGCCGGC GCGCTGCAGG ACCGCAAGCG CGCCACCGTG ATGGGCACCA CCAGCTTCGG CAAGGGCTCG GTGCAGACTG TGATTCCGAT CAAGGGTCAC GGCGCGGTGC GGCTGACCAC GGCGCTGTAC TACACGCCGG CCGGCCGCTC GATCCAGGAC GAGGGCATCG TGCCCGACGT GATCGAAGAG GCGCCGAAGG ATCAGCAGAT CAGCGGCGGC CCGCTGATCC GCGAAAGCGC GCTGCACGGC GCGATCGCCA ATCCGGGCCA ACTCGGCAAG TCGGACGCCA GCGCCACCAA GGCGACGCCG AAGACCGGCG AGCCGACCGA CGACAAGACC AAGTCCGCGA CCTCGGCGCC GATCAAGGCG GATCTGATCG GCAAGCCCGA AGACGCCCAG CTCAACGCTG CGCTGGCGCT GGTCCTGAAG AAGGACGCGG CACCGAAGTA A
|
Protein sequence | MRKIVVRSLA VLGVVAVPVV SALALQREQT APADLDLIGG VIQLVQRAYV HPIPSDELTK DALKGLLNRL DPHSDYMDEK EFKDMQTDMS GQFGGLGMQI TAQGGIPRIV SPIDGTPAAR AKLEPGDLII KVGTATTQGM SLRNVVDMLR GAPGTKVTIT VLRGKDEPFD VTLTREIIRV ASVKSEIKPD GVGYIRISQF GADTADGFTA ALTKLKSDAK DGGLKGLVID LRNDPGGLLN AAVSVAGDLL DGGTVVSIRG RQADDQRVFS APSKGDKLPG VPIVVLINGA SASASEIVAG ALQDRKRATV MGTTSFGKGS VQTVIPIKGH GAVRLTTALY YTPAGRSIQD EGIVPDVIEE APKDQQISGG PLIRESALHG AIANPGQLGK SDASATKATP KTGEPTDDKT KSATSAPIKA DLIGKPEDAQ LNAALALVLK KDAAPK
|
| |