Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3443 |
Symbol | |
ID | 3911245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3948654 |
End bp | 3950240 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885346 |
Product | peptidase S1C, Do |
Protein accession | YP_487050 |
Protein GI | 86750554 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.203092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0980299 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC GTTCTTCCGA TCTCACCTCG CAGCCGACCA GCCCGGTGCG CCGTTCGCTG TTCTCGGCGC GCAAATTCGC GCTGATGGCG TCGGTGGTGG CCGGCCTCGG CGCGGGCGCG TTCGGCCTGT CGCAGGTCCC GGCGTCGGGC GATCTGTTCA GCACGGCGGC GCAGGCGCAG GTCGGCAACG AGGTCCACAA GGCGCAGCAG CCCGTCGGCT TCGCCGACAT CGTCGAGAAG GTGAAGCCCT CGGTGATCTC GGTGAAGGTC AACATCGCCG AAAAAGTTGC GAAGACCGAC GAGCGCAGCG AGCGCAGCGA AGAATCGCCG TTCGCGCCCG GCTCGCCGAT GGAGCGCTTC TTCCGCCGCT TCGGCGGTGA AATGCCGCCC GGCATGCGCG GCCATCGCGG CGGCGGCATG ATGACGGGGC AGGGCTCGGG CTTCTTCATC ACCGCCGACG GCTACGCCGT CACCAACAAT CACGTCGTCG ACGGCGCCGA CAAGGTCGAA GTCACCACCG ACGACGGCAA GACCTACAAG GCCAAGGTGA TCGGCACCGA CCAGCGCACC GATCTGGCGC TGATCAAAGC CGAAGGCCGC ACCGACTTCC CGTTCGCCAA GCTGTCCGAA GGCAAGCCGC GGATCGGCGA CTGGGTGCTC GCGGTCGGCA ATCCGTTCGG CCTCGGCGGC ACCGTCACCG CCGGCATCGT CTCGGCCTCC GGCCGCGACC TCGGCAACGG TCCGTATGAC GATTTCATCC AGATCGACGC GCCGGTGAAC AAAGGCAATT CCGGCGGTCC GGCGTTCGAC GTCAATGGCG AAGTGATGGG TGTCAACACG GCGATCTACT CGCCGTCGGG CGGCAGCGTC GGCATCGCGT TCTCGATCCC GGCCTCGACC GTCAAGGCGG TGGTGCAGCA GCTCAAGGAC AAGGGCTCCG TCAGCCGTGG CTGGATCGGC GTCCAGATCC AGCCGGTGAC GCCGGAGATC GCCGACAGCC TCGGGCTGAA GAAGCCGGAC GGCGCGTTGG TGGCCGAGCC GCAGCCCAAC GGCCCGGCCG CCAAGGCCGG CATCGAATCC GGCGACGTCA TCACCGCGGT CAACGGCGCG CCGGTGAAGG ACGCGCGCGA GCTCGCCCGC ACCATCGGCG GCTTCGCGCC GGGCAATACG GTGAAGCTCA CCGTGTTCCA CAAGGGCGCG GATCGGGAAC TCAGCCTGAC GCTCGGCCAA TTGCCGAACC AGGTCGAGGC CAAGGCCAAT CTCGACGGCG ACAACGGTCG CCAATCCAGC CGCGGCACCG AAGTGCCGAG GCTCGGCCTG ACGGTCGCGC CGGCCAGTTC GGTCGCCGGT GCCGGCAAGG ATGGCGTGGT GGTCACCGAC GTCGATCCGA AGAGCGCCGC AGCCGATCGC GGCTTCAAGG AAGGCGACGT GATCCTCGAG GTCGCGGGCA AGAACGTGGC GAGCCCGGGT GACGTCCGCG ACGCCATCAA CACCGCCAAG AACGACAACA AGAACAGCGT GCTGATCCGG GTCCGCTCGG GTGGTTCGTC ACGTTTCGTC GCGGTGCCGA TCTCGGCCAA GGGCTGA
|
Protein sequence | MTDRSSDLTS QPTSPVRRSL FSARKFALMA SVVAGLGAGA FGLSQVPASG DLFSTAAQAQ VGNEVHKAQQ PVGFADIVEK VKPSVISVKV NIAEKVAKTD ERSERSEESP FAPGSPMERF FRRFGGEMPP GMRGHRGGGM MTGQGSGFFI TADGYAVTNN HVVDGADKVE VTTDDGKTYK AKVIGTDQRT DLALIKAEGR TDFPFAKLSE GKPRIGDWVL AVGNPFGLGG TVTAGIVSAS GRDLGNGPYD DFIQIDAPVN KGNSGGPAFD VNGEVMGVNT AIYSPSGGSV GIAFSIPAST VKAVVQQLKD KGSVSRGWIG VQIQPVTPEI ADSLGLKKPD GALVAEPQPN GPAAKAGIES GDVITAVNGA PVKDARELAR TIGGFAPGNT VKLTVFHKGA DRELSLTLGQ LPNQVEAKAN LDGDNGRQSS RGTEVPRLGL TVAPASSVAG AGKDGVVVTD VDPKSAAADR GFKEGDVILE VAGKNVASPG DVRDAINTAK NDNKNSVLIR VRSGGSSRFV AVPISAKG
|
| |