Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1019 |
Symbol | |
ID | 3909143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1167546 |
End bp | 1169123 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882912 |
Product | peptidase S1C, Do |
Protein accession | YP_484640 |
Protein GI | 86748144 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGA ACAAGACGGA TCGCAACGAC ATCGACGACA CCCCGAAATC CGAGGCGCAA CTACGCCGGA TCATGAGGCC GCGGCGCTTT GCTCTGCTGG CGTCCGCCGC AGCGCTGAGC GCCGCGCTGG TCACCGGCGG ATATATGACG CACGAGTTTC CGTCGTTCGC GTCGCCCGCG CTCGCGGCCG AAGCGGCTGC GCCGACCGGC ATGCCTTCCG GCTTCGGCGA TCTCGTCAGC AAGGTGAAGC CGGCGGTGAT CTCGGTGCGC GTCAAGATCG ACGAGGACAG CCGCACCACG CCGCTGATCC GCAACGACGA CAGCGACGAC AATGCCAATC CGGGGATGGG CGACGATCAG TTGCGTCCGT TCCTCAAGCA GTTCGGCTTC CGCGGCGAAG GCGCCATGCC GAAGCGGCAC GAGATGATCA CCGGCGAGGG CTCCGGCTTC TTCATCTCGC CGGACGGCTA CGCGGTCACC AACTATCACG TCGTCGATCA CGCCAAGTCG GTGCAGGTCA CGACCGATGA CGGCGCCGTC CACACCGCCA AGGTCGTCGG CACCGACAAG AAGACCGATC TGGCGCTGAT CAAGGTCGAC GGCAAGAGCG ACTTCACTTA CGTGAAATTC GCCGATCAGC CGGCGCGGGT CGGCGACTGG GTGGTGGCGG TCGGCAATCC GTTCGGCCTC GGCGGCACCG TCACGGCCGG CATCGTCTCG GCGCGCGGCC GCGACATCGG CTCCGGTCCC TATGACGACT ACGTCCAGAT CGACGCGCCG ATCAACAAGG GCAATTCCGG CGGCCCGGCG TTCGACACCA GCGGCAACGT CATCGGCGTC AACACCGCGA TCTATTCGCC CTCCGGCGGC TCGGTCGGCA TCGGCTTCGA CATTCCGGCG GCGACCGCCA AGCTCGTCAT CGCGCAGTTG AAGGACAAGG GCGTGGTGAC GCGCGGTTGG CTCGGCGTGC AGGTGCAGCC GGTGACCGCC GAGATCGCCG ACAGCATGGG GCTGAAGCAG GCCCGCGGCG CGCTGGTCGA CAGCCCGCAG GACGGCAGCC CGGCCGCCAA GGCGGGGATC GCCGCCGGCG ACGTCATCAC CGCGGTCAAT GGCGCGGAGA TCAAGGACTC GCGCGCGCTG GCGCGGACCA TCAGCATGAT GGCGCCGGGC AGCGCGGTGA AGCTCGACGT GCTGCACAAG GGTGACAGCA AGACTGTGAC GCTCAGCCTG GCGGAGATGC CGAACGAGAG CGCGAAAGTC GCCGACAGCG GCGATGCCGG GCGCGAGTCC GGCCGGCCCT ATCTCGGCCT GCGCGTCGCG CCGGCCAGCG AGGTTGGCGA CTCCGGCCAG AAGGGCGTCG TCGTCACCGG CGTCGACCCC GAAGGCCCGG CGGCCGATCG CGGGCTGCGC AGCGGCGACG TCATCCTCGA CGTCGGCGGC AAGCCGGTCG CCAATTCCGG CGACGTCCGC GACGCGCTGA AGCAGGCCAG CGAGCACGGC AAGAAGACCG TGCTGATGCG GGTCAAGACC GCGGATTCCG CCGCGCGCTT CGTCGCGGTG CCGATCGCCA AAGGCTGA
|
Protein sequence | MNQNKTDRND IDDTPKSEAQ LRRIMRPRRF ALLASAAALS AALVTGGYMT HEFPSFASPA LAAEAAAPTG MPSGFGDLVS KVKPAVISVR VKIDEDSRTT PLIRNDDSDD NANPGMGDDQ LRPFLKQFGF RGEGAMPKRH EMITGEGSGF FISPDGYAVT NYHVVDHAKS VQVTTDDGAV HTAKVVGTDK KTDLALIKVD GKSDFTYVKF ADQPARVGDW VVAVGNPFGL GGTVTAGIVS ARGRDIGSGP YDDYVQIDAP INKGNSGGPA FDTSGNVIGV NTAIYSPSGG SVGIGFDIPA ATAKLVIAQL KDKGVVTRGW LGVQVQPVTA EIADSMGLKQ ARGALVDSPQ DGSPAAKAGI AAGDVITAVN GAEIKDSRAL ARTISMMAPG SAVKLDVLHK GDSKTVTLSL AEMPNESAKV ADSGDAGRES GRPYLGLRVA PASEVGDSGQ KGVVVTGVDP EGPAADRGLR SGDVILDVGG KPVANSGDVR DALKQASEHG KKTVLMRVKT ADSAARFVAV PIAKG
|
| |