Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2829 |
Symbol | |
ID | 5209798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3529865 |
End bp | 3532864 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596426 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001277148 |
Protein GI | 148656943 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0476517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCTC GCGCTGCTGT TTCGCGCCTC CTCATTATTG TCCTGGTTCT CCTTTCTTCG TGTGTGGGTG ATTCTGGCAG AATCGCTGTG CATGCATCAG GAGTTCTTGC CGCTTCTCCT GAGTCAATCG GTGAAACGCT GGCGTCGGGT CAGCAGGTCA CGCGACCGCT GGTTATCACG AATACCGGGA CAACCCCGAT CACCGCCTTG CTGTACGAAG CGTATGCGCA ACCATCGCTG GCAATGGCGC ATGCCTGGGG TCCTGCCAGC GTACCATTGC CACAGCAGGA ACAATCGCTC GACCCGCGCC TGGCAACGCA ACTGGACGAA CCCTCCGAGC GTGGTGCGTT CATTGTCTAT TTGCACGACC AGGCAGACCT GAGCGCGGCG TATGGCATAA CCGACTGGGG GGAACGCGGG TGGTTCGTTT ACCGCACGCT GGTCGAACAC GCGGAGCGCA CCCAGCGTGC TCTGCGCGCT GAACTGTCGG CGCGCGGTCT GGCATACCGA CCATTCTGGA TTGTCAATGC GGTCTATGTC GAAGGAACGC TCACCGATGC GCAGGCGCTG GAACGACGCG CCGACGTTGC GCTGGTTCGC ACCGATGCGC GTGTGGCGGT TGCGCCGCAG GTGTCGGCGC CAGCGAGTCT CGATGAGCGT TGCAGTTCGG ACGGTAATCC GGTGTGCTGG AATATCCGGG CGATTGGCGC CGACCGGGTC TGGAATGAGT TTGGCATCAC CGGTCAGGGA GTCACCGTTG CGTCAATCGA CACCGGCGTG TTGGGGATCC ATCCGGCGCT GCGCGACCGG TATCGCGGCG CGCTTGGCGG CGGAATGTAC GATCACAACT ACAACTGGTA CGATCCGCAG GGTGTGTTTC CGATGCCGGT CGATCAGAAC GGGCACGGCA CCCACACCAC CGGCACGATC GTCGGCAGTC GACCGGGCGG TGAGCGGTTC GGCGTTGCGC CGGGCGCACG GTGGATTGCT GCGCAGGGGT GCGATGGGTC ATTCTGTAGC GAAAGCGATC TGTTTGCCGC CGCGCAGTGG ATCCTGGCGC CGACCGATCT CAACGACCGC AATCCGCGAC CGGACCTTCG CCCGATGATT GTCAACAACT CATGGGCAGG CGGCAGCAAC GACCCATGGT ATGCCGGCTA TACCGCCGCC TGGCGCGCAG CGGGCATCTT TCCGGTCTTC GCGGCGGGCA ACGGCGTCGG CGCCTGCCGC ACAATTGCGT CTCCAGGCGA TTATGCCGAT GTGGTGGCAG TCGGCGCCAC CAATCGCAGC GGTTCGATTG CATCATTCAG TCTGCGCGGT CCTGCCGCCG ATGGGCGGAT GAAGCCGGAC TTCGTCGCTC CCGGTGATGG CGGCATTTAC TCTGCATCGC TCAATGACGG GTACACAACA CTGCGCGGCA CATCAATGGC GACTCCGCAT GTCGCCGGGG TGGCTGCGCT GCTTTACGCT GCCAATCCGG CGCTGATCGG CGATTACGAC GCAACCTATG CCATCCTGCG CGACACGGCG CGGCGGCGAG ATGATCCGCA GTGTGGCGTC GTAGCAGGGG GCGGGAACAA TGTCTACGGG TGGGGGTTGA TCGACGCACA CGCGGCTGTT GCCCGCGCGC GCGTTGATGT TCCCTGGTTG CGCCTCCCGA CGACGACCCT GAACCTTGAT CCCGGTCAAA GCGCATCCGT CAACGTGACC CTCGACGCCA GCGGTGTTGC AACACCCGGA ATCTACACGG CGCGCATCCA GATCTATGCC GGTGATCTGA CCCTTCCGCC GGTAACAGTG ACGGTCACAA TGGTCGTGAC CGGTGCTGGC GGTACACTGG TGACCGGCAT TGTGCGTGAT GCAGAGACCG GTACGCCGCT TGCGGCAACA GTCCGCACTG AGAACGGCGC GAGCACAACG ACTGCGGCTG ACGGAACATT CGCACTGGTG CTGACCACGG GTGTCCATAC CCTGACCGCT TCCGCTCGCT CGTATGCACC CCGGCAACGC ACGATCACCG TGCCGACCGA CGGATCGGTC GATTTTGGGT TGCTGCTCGA TGCGCCGCAG GTTGCATTAT CGACCGATTA CGTGACAGCC ACACTCGATT TCAACACCAG TGTCACGCGC ACGGTAACGA TCACGAACAC CGGCACACGT CCATTGACGT TCGAAGCGAA GGTCGGGTAT GCGCCATTCG GCATCTATCG CAGTGACGAG CCGGGGGGAC CGGTGTACCG GTGGATCGAC CTGCCACCTG ATGCACCAAC CCTGGAACTG ACGAACACAA CCCGGATCGA TGGCGTGCCA TTGGGACTGA CATTCCCGCT CTATACCTAC ACGGTTACCG AGACCTCCAT CACGTCGGAT GGGACGCTGA TGTTTGACTG GCCCTATCCG TACACCGGTC TGCTGGAGCG TTGTCTGCCG GCGACTGAAG CATTCTTCAA CCTGCTTGCG CCATTCCGCA CCGACCTTGA CCCGTCGCGC GGCGGAATTG TGCGTTATGG AACGGTCAAC AACGGCACAA CGTTTGTGGT CAGTTTCGAG GATGTACCGA CCGCCGCAGG TCCGCCGGAT CACACCTTCA CGTTCCAGAC GTTGCTTCAC CAGGATGGCC GGATTGTCTA CCAGTACCGC GATCTGGGAG CGTTGCCGGA GCGTCTCAGT GTTGGGATTC AGAAAAGCCT GAACCAGGTG CAACGGATCG GTTGCGGCGC CGATACGCCG ATCGCTCCGG GACTGGCCAT CGAGTTCAGA CCACAGTTCC GTCCGGAGGG TTGGGTGATC GTGAACCCGG AGAAAGGGTC GCTTGAACCG GGCGCCAGCA CAAACCTGAC GTTCACCTAC TTCTGGCAGG CGCCGCCGCA GGGGAACCAT CTCCGCACAA CCATTGTCAT CTCCAGCAGC GATCCGCGTC GCCGCACAGC AACGATTATG ACTGAGGCTG CGATGCGTCC GGCGCCGCAT GTGGTATGGC TGGGCATCGT GGCGCGGTAA
|
Protein sequence | MSARAAVSRL LIIVLVLLSS CVGDSGRIAV HASGVLAASP ESIGETLASG QQVTRPLVIT NTGTTPITAL LYEAYAQPSL AMAHAWGPAS VPLPQQEQSL DPRLATQLDE PSERGAFIVY LHDQADLSAA YGITDWGERG WFVYRTLVEH AERTQRALRA ELSARGLAYR PFWIVNAVYV EGTLTDAQAL ERRADVALVR TDARVAVAPQ VSAPASLDER CSSDGNPVCW NIRAIGADRV WNEFGITGQG VTVASIDTGV LGIHPALRDR YRGALGGGMY DHNYNWYDPQ GVFPMPVDQN GHGTHTTGTI VGSRPGGERF GVAPGARWIA AQGCDGSFCS ESDLFAAAQW ILAPTDLNDR NPRPDLRPMI VNNSWAGGSN DPWYAGYTAA WRAAGIFPVF AAGNGVGACR TIASPGDYAD VVAVGATNRS GSIASFSLRG PAADGRMKPD FVAPGDGGIY SASLNDGYTT LRGTSMATPH VAGVAALLYA ANPALIGDYD ATYAILRDTA RRRDDPQCGV VAGGGNNVYG WGLIDAHAAV ARARVDVPWL RLPTTTLNLD PGQSASVNVT LDASGVATPG IYTARIQIYA GDLTLPPVTV TVTMVVTGAG GTLVTGIVRD AETGTPLAAT VRTENGASTT TAADGTFALV LTTGVHTLTA SARSYAPRQR TITVPTDGSV DFGLLLDAPQ VALSTDYVTA TLDFNTSVTR TVTITNTGTR PLTFEAKVGY APFGIYRSDE PGGPVYRWID LPPDAPTLEL TNTTRIDGVP LGLTFPLYTY TVTETSITSD GTLMFDWPYP YTGLLERCLP ATEAFFNLLA PFRTDLDPSR GGIVRYGTVN NGTTFVVSFE DVPTAAGPPD HTFTFQTLLH QDGRIVYQYR DLGALPERLS VGIQKSLNQV QRIGCGADTP IAPGLAIEFR PQFRPEGWVI VNPEKGSLEP GASTNLTFTY FWQAPPQGNH LRTTIVISSS DPRRRTATIM TEAAMRPAPH VVWLGIVAR
|
| |