Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2773 |
Symbol | |
ID | 5540259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3585820 |
End bp | 3588813 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894899 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001432862 |
Protein GI | 156742733 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0362893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGGTC GCGCTGCCAG TCTACGCCTG TTCATCGCTG CGTTTCTGTT GATTGCAGCG CTCATCGTTC CGACTGACAG ATTCACCATT CAGGCAACAG GCATTCTCAC AGCCTCTCCG ATCACGATCA GCGAAACGTT GCCATTGGGT CAACAGGTCA CGCGACCACT GACGATCACC AACCTCGGTT CGACGACTGT CACAGCGTTG CTCTATGAAG CGCGCGCGCA GCCGTTGTTG GCGATGGCGC ATGCAATCGG TCCCGCAAGC GTCCCTCTGC CGCAGCAAGA TCACACGCTC GATCCGCGTC TGGCGGCGCA ACTCGACGAA CCCGCAGCGC AGGGTTCGTT CATTATCTAT TTGCGCGATC AGGCGGACCT GAGCAGCGCA TATGGCATCA CCGACTGGTC AGAACGTGGG CGCTTCGTCT ACCGAACGCT GGTAGAACAC GCTGAACGCA CACAACGCAC CCTGCGCGCC GAGTTGACGG CGCGCGGTCT GACCTATCGA CCGTTCTGGG TCGTCAACGC CATTCAGGTG GAAGGTGCGC TCGCCGATGC GCAGGCGCTG GAGCAGCGCG CTGACGTTGC TCTGGTGCGC GCCGACGCAA GTATCATGGT CGCGCTGCAA ACGCTGCCGT CCAGCCTCGA TACGCGCTGT AGCGCAGACG GCAATCCGAT GTGCTGGAAT ATTCGCGCCA TCCGCGCAGA TCGTGTGTGG AACGAGTTTG GCATCACCGG TCAGGGCGTG ACGGTCGCCT CCATCGATAC CGGCGGGCTT TTCAGTCATC CGGCGCTGCG CGATCAGTAT CGCGGTGCGC TCGGCAACGG CGCATACGAC CACAACTACA ACTGGTACGA TCCGCAAGGG GCATTCCCTG CGCCGAACGA TCAGAGCGGT CATGGAACGC ATACCATCGG TATCATGGTC GGCAGGCGCA TCGGAGGCGA GCGGTTTGGC GTTGCTCCCG GCGCGCGCTG GATTGCTGCG CAGGGGTGTG AAGGATCATT CTGCAACGAA AGTGACCTGA TCGCCGCCGC GCAGTGGGTC CTGGCGCCGA CTGACCTCCA CGACCGCAAT CCGCGCCCCG ATCTGCGCCC GTTGATCGTC AACAACTCGT GGGCGGGCGG CGGCAACGAC CCGTGGTATG CCGGATATAC CGCCGCCTGG CGCGCGGCCG GCATCTTTCC CGTTTTTGCG GCAGGCAACG GCATGGGCGT CTGTCGCTCA ATCGCATCCC CCGGCGACTA TGCCGATGTC GTCGCTGTTG GCGCCACCGA CCGCAACGAC GCCATCGCCC CGTTCAGCCT GCGTGGTCCG ACTGCCGATG GTCGCATGAA GCCGGACTTC GTCGCTCCGG GGGAGGGCGG CATCTACTCA ACACACCTCA GTGACGGGTA TGCCACTCTG CGCGGCACAT CGATGGCAGC CCCCCACGTT GCCGGGGTTG TGGCACTGCT CTATTCGGCA AACCCGGCTC TGATCGGCGA TTTTGAGTCG ACCTACGCCA TCCTGCGCGA CACAGCGCGC AGAATAGCCG ATGAACAGTG TGGCGTCGTA TCCGGCGGCG GCAATCATGT GTATGGTTGG GGCTTGATCG ACGCCCATGC GGCGGTTGCG CGAGCGCGCG TCGATGTGCC GTGGCTGCGC CTTTCACCGA CGACGGTAAC GCTCAATCCT GGTCAGAATG CAACGCTGGA CGTGACGTTC GACTCCAACG GCGTTGCTGC GCCGGGGACC TATACTGCGC GCATTCAGAT ATATGCCGGT GATCTGACCC AACCGCCAGC GACCGTTGAG GTGACCATGA ATGTGATCGC TTCGGGAACC ATCGTCGGCG GCATTGTGCG CGATGCCGAG ACCGGCGAAG CGCTGCGCGC GACAGTCAGC GTCAGCGGCG GCGCCAGCAC GCCTACTTCC AATGACGGAT CATACGCTCT CATCCTGCCG TCGGGCGTTT ACACGTTGAC GGCGTCCGCA CTCTCGTATG CGCCGCAGCA GCGTGTGATT ACTGTACCGG TGAGCGGATC GGTCGATTTT GGATTATTGC TCGATGCGCC GCATTTGACC CTTTCGACCG ACCATGTGAC GGCTACGCTC GATTTCAACA CCACCGTCGA GCAAACCGTA ACGATAACCA ATACCGGTAC CCGTCCTTTG ACTTTTGAAG CCAGCGTCGG ATATGCGCCA TTCGGGGTCT ATCGCAGTGA TGAGCCAGGC GGACCGGTCT ATCAGTGGAT CGACCTGCCC GTTGATGCGC CGACGCTCGA ATTGACCGAT ACAACCCGGA TCGACAACAT CCCCCTGGGC TTCGACTTTC CACTCTACAC CCTCACCGTC ACTGAAACGT CGGTCACATC GGATGGGACG CTTTCGTTTG GTTGGCCCTC CTCATATACC GGTCTGGTCG AACGTTGTTT GCCGGGGAGC GAAGCATTCT TCTACCTGCT GGCGCCCTTC CGCGCCGACC TTGATCCCGC GCGTGGCGGG CAGGTGCGGT ACGGAACCGT CAACAGCAAC GCAACATTCG TCGTCAGTTT CGAGGATGTG CCATTGGCGC AGGGTCCGCC GGATCAGAGA TACACGTTTC AGGCGCTTCT CCATCGTGAT GGACGGATTG TGTTTCAGTA CGCCGACCTC AGCGCGCTCC CGGAGCGCTT GAGCGTTGGC GTTCAGAAGA CCATGAATCA GGTTCAGCGG ATCGGATGTG GCGCCGATAC CCCTGTCACA CCAGGGCTTG CCATCGAGTT CCGACCGCAG TTCAGCCCGG AGGGGTGGCT GGAAGTGGCG CCGGATCGGG GAACCGTAGC GCCGGGCGAC AGCGCCACGC TCAGGCTCGC CTATCGCTGG CAGGGTCCAC CGCAGGGCGC GCGCCTGCGC ACCACGGTCA CAGTGATCAG CAGTGATCCT CGCCGCAGAA ACGCCACAAT TATGGCGGAG GCGGCCATGC GTCCGGCGCC GTATGCCGTC TGGTTGGGGA TCGTGGCTCG GTAA
|
Protein sequence | MSGRAASLRL FIAAFLLIAA LIVPTDRFTI QATGILTASP ITISETLPLG QQVTRPLTIT NLGSTTVTAL LYEARAQPLL AMAHAIGPAS VPLPQQDHTL DPRLAAQLDE PAAQGSFIIY LRDQADLSSA YGITDWSERG RFVYRTLVEH AERTQRTLRA ELTARGLTYR PFWVVNAIQV EGALADAQAL EQRADVALVR ADASIMVALQ TLPSSLDTRC SADGNPMCWN IRAIRADRVW NEFGITGQGV TVASIDTGGL FSHPALRDQY RGALGNGAYD HNYNWYDPQG AFPAPNDQSG HGTHTIGIMV GRRIGGERFG VAPGARWIAA QGCEGSFCNE SDLIAAAQWV LAPTDLHDRN PRPDLRPLIV NNSWAGGGND PWYAGYTAAW RAAGIFPVFA AGNGMGVCRS IASPGDYADV VAVGATDRND AIAPFSLRGP TADGRMKPDF VAPGEGGIYS THLSDGYATL RGTSMAAPHV AGVVALLYSA NPALIGDFES TYAILRDTAR RIADEQCGVV SGGGNHVYGW GLIDAHAAVA RARVDVPWLR LSPTTVTLNP GQNATLDVTF DSNGVAAPGT YTARIQIYAG DLTQPPATVE VTMNVIASGT IVGGIVRDAE TGEALRATVS VSGGASTPTS NDGSYALILP SGVYTLTASA LSYAPQQRVI TVPVSGSVDF GLLLDAPHLT LSTDHVTATL DFNTTVEQTV TITNTGTRPL TFEASVGYAP FGVYRSDEPG GPVYQWIDLP VDAPTLELTD TTRIDNIPLG FDFPLYTLTV TETSVTSDGT LSFGWPSSYT GLVERCLPGS EAFFYLLAPF RADLDPARGG QVRYGTVNSN ATFVVSFEDV PLAQGPPDQR YTFQALLHRD GRIVFQYADL SALPERLSVG VQKTMNQVQR IGCGADTPVT PGLAIEFRPQ FSPEGWLEVA PDRGTVAPGD SATLRLAYRW QGPPQGARLR TTVTVISSDP RRRNATIMAE AAMRPAPYAV WLGIVAR
|
| |