Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1127 |
Symbol | |
ID | 3969522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1226719 |
End bp | 1227819 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637924238 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_531010 |
Protein GI | 90422640 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0664983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGATC GTTTCTCCCG CGTTGCGACC GTCATTCTGA TCGCGCTGCT CGCCGCGCTG GTCGGGCAGC CCTATATCGA CCGGCTGCTG TTCGCGGCAA CATCGCCCAG GGCGGTCGCT GCACGAAGCT ATCTGGCGGA ATCCGAGCGG GCGACCATCA ACCTGTTCGA GCGGGTCTCG CCCTCGGTGG TTCAGGTGGT CGGCTCAGCC GCCGGCAGCG GCCCAACCGA CTTCGAAGGC GAGCAGCCTC GGGAGCAGAG CGGCACCGGC ATGATCTGGG ACGCCGCAGG TCACGTGGTG ACCAACAACC ACGTGGTGAA CGGGACCGCT CACGTCGCCG TTCGTCTCGC CAGCGGCGAT GTCGTTCCCG GCACGATCGT CGGCACCGCT CCGAATTACG ATTTGGCGGT GGTTCGGCTG CAGAACCCTC GCCGTCTGCC TGCGCCGATT ACGGTGGGCA GCTCGGCCGA TTTGAAAGTC GGACAGGCCG CGTTCGTGAT CGGCAACCCG TTCGGTCTCG ACCAATCGTT GTCGACCGGC GTAATCAGCG CCTTGAAGCG GCGCTTGCCG ACCGGTTCAG GGCGGGAAAT CGGCAACGTC GTCCAGACCG ACGCCGCCGT TAATCCTGGA AACTCCGGAG GTCCGCTACT GGATTCCGCG GGACGACTGA TCGGCGTGAC CACCGCGATT ATCTCGCCCT CGGGCTCGAA CGCCGGGATC GGCTTTGCGA TTCCTGTGGA TACGGTGAAT CGGGTGGTCC CCGAACTGAT CAAATACGGA CGGGTGCCGA CGCCCGGGAT CGGCATCGTC GCCGCCAACG AAGCGGTCGC GACCCGGCTC GGAATCGAAG GCGTCATCAT TGTCCGTGCG CTGCCGGGAT CGCCCGCCGC CAAATCCGGA CTGCGCGGCA TCGATCAGGC GGCCGGCGAA ATCGGCGACG TGATCGTCAG CGCCAACGGC CAACCGACGA GACGCCTGTC GGATCTCACC GACCAGTTAG AGGCGGTCGG AGTCGGACAG GAGATCGAGC TATCGATCAG GCGCAACAAC CGGTCGAGCA CGGTTCGCGT CAGGGTGCAG GACATCAGTC AGCCTTCTTG A
|
Protein sequence | MRDRFSRVAT VILIALLAAL VGQPYIDRLL FAATSPRAVA ARSYLAESER ATINLFERVS PSVVQVVGSA AGSGPTDFEG EQPREQSGTG MIWDAAGHVV TNNHVVNGTA HVAVRLASGD VVPGTIVGTA PNYDLAVVRL QNPRRLPAPI TVGSSADLKV GQAAFVIGNP FGLDQSLSTG VISALKRRLP TGSGREIGNV VQTDAAVNPG NSGGPLLDSA GRLIGVTTAI ISPSGSNAGI GFAIPVDTVN RVVPELIKYG RVPTPGIGIV AANEAVATRL GIEGVIIVRA LPGSPAAKSG LRGIDQAAGE IGDVIVSANG QPTRRLSDLT DQLEAVGVGQ EIELSIRRNN RSSTVRVRVQ DISQPS
|
| |