Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0798 |
Symbol | |
ID | 5207741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 990700 |
End bp | 991590 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640594414 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001275162 |
Protein GI | 148654957 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.408859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCCA TGACTCAGGT GTTCTATCGT GCACCGGATA TTGTTGCCGG TTTCGCTGAT CGTGCCGAAC GCGCGCGGCG GAGCGTCGTG CATCTGCACG GAGCAGGCAG AGGCGTTGGT ACCGGCGTGG TCTGGCAATC TGGCGGCGTT GTTCTGACCA ACGACCACGT CGTTGCTGGT ACGGGCGGCA GGTTGCGCAT GCAAACGCTC GATGGGCGCG ATCTGCCGGT GAGGGTTGCA GCGCGTAACC CGGACCTCGA CCTGGCGCTG CTCCACACGT CGGCAGACGA CCTTCCACCG GCATCGGTTG GCGATTCGTC GCGTTTGCGG GTTGGTGAAC TGGTCTTTGC GATTGGTCAT CCGTGGGGAC AGCCATGGGT GGTGACTGCC GGTATCGTCA GTGGGCTGGG GGAGGCGGAG ACGCGCAACG GACAGCGTGT CGCGTTCATC CGTTCCGATG TACGGCTCGC GCCCGGCAAC TCGGGCGGAC CGCTGCTCGA CGCACGCGGC AGGGTGATCG GGATCAACGC AATGGTGTTT GGCGGCGATC TGGCGGTTGC CATCGCCAGT CACGTCGTCG AACGGTGGTT GAACGATAGC CAGGGGCGAC GCGCACGTCT TGGCGTCGGT GTGCAATCAC TGCCATTACC GGAAGGCGTG CTGAACGGAC GATCGAGAGG CTTGCTGGTA GTCAGTCTCG AGCCAGGCAG TCCGGCGGAG CAGGCGGGGA TTCTGGTGGG CGACCTGCTG CTCCATGCCG ATACGGTATC ATTGGAGCAG CCGGAGGACC TGCACAGAGC GCTTCAGCAC ACGGATGGAA CGCTCCGTCT CCGGTTGCTG CGCGCTGGCG ACATTCGCGT GATCGATGTG ACGCCCGGCG CGGGATCATA G
|
Protein sequence | MTSMTQVFYR APDIVAGFAD RAERARRSVV HLHGAGRGVG TGVVWQSGGV VLTNDHVVAG TGGRLRMQTL DGRDLPVRVA ARNPDLDLAL LHTSADDLPP ASVGDSSRLR VGELVFAIGH PWGQPWVVTA GIVSGLGEAE TRNGQRVAFI RSDVRLAPGN SGGPLLDARG RVIGINAMVF GGDLAVAIAS HVVERWLNDS QGRRARLGVG VQSLPLPEGV LNGRSRGLLV VSLEPGSPAE QAGILVGDLL LHADTVSLEQ PEDLHRALQH TDGTLRLRLL RAGDIRVIDV TPGAGS
|
| |