Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2323 |
Symbol | |
ID | 3908954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2671433 |
End bp | 2672824 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884220 |
Product | peptidase S1C, Do |
Protein accession | YP_485939 |
Protein GI | 86749443 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.052989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.164438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTG TCCGCATCTT CACGGCCGTC CTGCTCTCGC TCGCGGCGTC GGCCCCGGCG ACCGCGCAGG AGCGGCGGCT GCCGGCGTCG CAGGCCGAGA TCAAGCTCAG CTACGCGCCG ATCGTGCAGC ACGCGCAGCC GGCGGTGGTG AACGTCTACG CCGCCAAGGT GGTGCAGAAC CGCAATCCGC TCTTGGAAGA CCCGATCTTC CGCCGCTTCT TCGGCGGCGG CCCGCAGCCC GAGCAGATCC AGCGCTCGCT CGGAAGCGGC GTGATGGTCG ATCCGTCGGG CCTCGTCGTC ACCAACAATC ACGTCATCGA CGGCGCCGAT CAGGTCAAGG TCGCGCTCGC CGACAAGCGC GAGTTCGAGG CCGAGATCGT GCTGAAGGAC AGCCGCACCG ATCTCGCGGT GCTGCGCCTC AAGGATACCA GGGAGAAATT CGCCACCCTC GAACTGGCCA ATTCGGACGA GCTGCTGGTC GGCGATCTGG TGCTGGCGTT GGGCAATCCG TTCGGCGTCG GCCAGACCGT GACGCACGGC ATCGTCTCGG CGCTGGCGCG CACCCAGGTC GGCATCACCG ACTATCAGTT CTTCATCCAG ACCGACGCGG CGATCAATCC CGGCAATTCC GGCGGCGCGC TGGTCGACAT GACCGGCAAG CTGGTCGGCA TCAACACCGC GATCTTTTCG CGCTCCGGCG GCTCGCAGGG CATCGGCTTC GCGATCCCGA CCAACATGGT GCGGGTGGTG ATCGCCTCGG CCAAGAGCGG CGGCAAGGCG GTGAAGCGGC CGTGGCTCGG CGCGCGGCTG CAGGCGGTGA CGCCGGAGAT CGCCGAGACG CTCGGTCTGA AGCGGCCCAG CGGCGCGCTG GTGGCGAGCG TCACCAAGGG CAGCCCGTCC GACAAGGCGG GGCTGAGACT GTCCGATCTG ATCGTCGGGG TCGACGGCTT TGCGATCGAT GATCCCAACG CGTTCGACTA TCGCTTCGCC ACCCGCCCGC TCGGCGGCAC CGCGCAGATC GACGTGCAGC GCGGCGGCAA GCCGCTCAAG CTCAGCATCA CGCTGGAGAC CGCGCCGGAC ACCGGCCGCG ACGAGATCGT GCTGACCGCG CGCTCGCCGT TCCAGGGTGC CAGGATCGCC AACATCTCGC CGGCGATCGC CGACGACCTG CGGCTCGACC CCAGCGTCGA GGGCGTGGTC GTGACCGATC TCGCCGACGG CGCCACCGCG GCGAGCGTCG GTTTCCAGAA GGGCGACATC ATCGTCGCGG TCAACAACAA GCGCATCGGC AAGACCAGCG ACCTCGAACG CATCACCAAC GAGTCGTTCC GCCTGTGGCG CATCACCGTC GTTCGCGGCG GCCAGCAGAT CAACGTCACG CTCGGCGGAT GA
|
Protein sequence | MSAVRIFTAV LLSLAASAPA TAQERRLPAS QAEIKLSYAP IVQHAQPAVV NVYAAKVVQN RNPLLEDPIF RRFFGGGPQP EQIQRSLGSG VMVDPSGLVV TNNHVIDGAD QVKVALADKR EFEAEIVLKD SRTDLAVLRL KDTREKFATL ELANSDELLV GDLVLALGNP FGVGQTVTHG IVSALARTQV GITDYQFFIQ TDAAINPGNS GGALVDMTGK LVGINTAIFS RSGGSQGIGF AIPTNMVRVV IASAKSGGKA VKRPWLGARL QAVTPEIAET LGLKRPSGAL VASVTKGSPS DKAGLRLSDL IVGVDGFAID DPNAFDYRFA TRPLGGTAQI DVQRGGKPLK LSITLETAPD TGRDEIVLTA RSPFQGARIA NISPAIADDL RLDPSVEGVV VTDLADGATA ASVGFQKGDI IVAVNNKRIG KTSDLERITN ESFRLWRITV VRGGQQINVT LGG
|
| |