Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4194 |
Symbol | |
ID | 8328387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 4938033 |
End bp | 4941230 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 644944658 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003101895 |
Protein GI | 256378235 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCTCC GGAAACAGCT CTCCACGGGG GTGGTGGCGG TGGTCGCGGC GGCGGCGCTC ACCGCCACCA CCGCTGCGGC CGAACCCGCG GGCCCCGACC CCGAACGGCA GGGCGCGGCA GGCCCCGCCG TGCGCGGCAC GGTCACCCTG CTGACCGGCG ACACCGTCAC CGTCCGGGGT GACGACGTCC AGGTCAAGCC CGGCGCGGGC CGGGAGAAGA TCGCGTTCCA GCGCTCCGCC GACCGCGACG GGCTGCACGT CGTCCCCTCC GACGTGGTGG CGGACGTCGC GTCCGGCAGG CTCGACCGCA GGCTGTTCGA CGTCGCGGGC CTGATCGCCC AGGGCTATGA CGACGCCCGC ACCGACCACC TGCCGCTGAT CGTCACCGGC GCCCCCGGCG CGGCCCGCGT GGCCGGTGAC GGCGTCCGCG AGCTGCCCAG CGTCGACGGC TACGCGCTCA AGGCCCCCAA GTCCCGGCCC GTGCTGGCGG CGGGCGTGGA GAGCGCGGGC GGCCGGATCT GGTTGGACGG CAAGGTCTCC GCGACCCTGG ACCGCAGCAC CGCGCAGATC GGCGCGCCCG CCGCGTGGGC CGCCGGCCTG ACCGGCGCGG GCGCGAAGGT CGCGGTCCTG GACACCGGCG TCGACGCCAC CCACCCGGAC CTCGCGGGCG CGGTCGCCGA GTCGGCCGAC TTCACCGACG CCGCCGACGC CGACGACCAC CTCGGCCACG GCACGCACGT CGCCGCCACG GTCACCGGCG CGGGCAAGTA CCGGGGCGTC GCGCCCGACG CCGAGGTGCT CAACGGCAAG GTGCTCGACG ACACCGGTGG CGGCTACGAC TCCTGGATCA TCGCCGGGAT GGAGTGGGCC GCGGCCCGCG CCGACGTGGT CAGCATGAGC CTGGGCGGCC CCGCCACCGA CGGCGCCGAC CCCATGTCGC TCGCCGTCGA CCGGCTCACC GCCGAGACCG GCGCGCTGTT CGTGATCGCC GCGGGCAACT CGGGCGGGGC CTCCACGGTC GGCAGCCCCG GATCGGCGGC CTCCGCGCTG ACCGTGGGCG CGGTCGACCG GGACGACTCG CTCGCCCCGT TCTCCTCGCG CGGCCCCAGG ACCGGCGACT ACGCGATCAA GCCGGAGATC ACCGCGCCGG GCGTGAACAT CGTCGCCGCC AAGGCGAAGA ACGGCGTCAT CGGCACCCCG GTGGACGACG CGCACGTCGC CATGTCCGGG ACCTCGATGG CCGCCCCGCA CGTCGCGGGC GCCGCCGCGA TCCTGGCGCA GCAGCACCCG GACTGGCGGG CCCCGCAGCT CAAGGCCGCC CTGATGGGCG CCGCCGTCGA CCCCAAGGGC GCCACCGTCT ACGAGCAGGG CGCCGGGCGC GTCGACCTGG CCCGCGCCAC CACGATCCCG GTGCAGGCCG ACCCGGCCGC GCTCGACCTG GACACCCTGC GCTTCCCGCA CGACGACGGC GAGCAGCCGT CCCCGCGCAC CGTCACCTAC CGCAACACCG GGGACCAGCC CGTGGAGCTC GCCCTCAGCG GCGTGCTGCG CGACCCCTCC GGCGCCGAGA TCCCCGGCGC GGTCTCGATG TCGCCGTCCT CGGTCACCGT CCCGGCGGGC GGCTCCGCCG AGGTCGTCGT GACCACCACC CTGCCCGCCG ACTCGCCGAT CGGCGCGTAC AGCGGCGTGC TGCTGGCCGG GGACGCGGTG CGCGTCCCGA TCGGGCTGAC CCGCGAGCGG GAGAGCTACG ACGTCACCGT CACCGCGACC GACCACGCGG GCGCGCCCGC GTCGGACTAC GGGTACGCGC TGCTGAACCT GGAGACCGGC GAGCGCTTCG GCAGGTTCGA CCCGTCCGGG CGGGTCACCG TCCGGGTGCC CAGGGGCAGG TACGCGGTGC AGGGCATGGT CTACAGCGGG GAGCGCGCGA CGCTGTTCGT CGAACCGGCC TTCGAGGTCG GCGGCCCGTC GGCGCTGGAG CTGGACGCCC GGCGCGGCCG GCAGCTCAAG CCGAAGGTCG AGGCGCGCGG CGCGCAGGTC GGGTACGTGC AGGCGCTGAC CCTCATCCCC CTCGGCGACT CGTGGGTCAG CGCGGGCGGG AGCGCCGGGA GCGCCGAGGC GCTGCTGCTC GCGCCGTCGC GGACCCGCGA CGAGGACGCC AACACCGCCC TGCACGCGAC GCTGGCCAAG GCGGACGGCG CGGGCAGGTT CACCGGCAGC CCGTACCAGT ACCGGCTGGC CTGGGAGAAC GCGGGCGGCA TCCCGGAGGA CGTCGGCCGG GTCAGGGCGG TGCGCGACCG GGAGCTCGGC CGCGTCGACG CCACCGCCGC CGCGGTCGCG GACGGGAGCT GGGTGGTGTA CCCGGAGAAC GCGGTCGTGG CCGCGCCGAG CACGACCCGG CTGCACTACA CGCCGGGCGT GGAGTGGAGC CAGAGCGCGT TCCTGCTGGA CTCCCCGGAC GCCCGCGCCA ACCGGGCGTA CCAGGGGAGG GGGATGCCGA AGGCGCTGCG CGCGGGCGAG GTCGTCCGCG AGTCGTGGTA CCGGGGCGTG CTCGGCCCTG CCTTCCCCCT GACGCCCGGA GGGGCGCTGT TCAGCGCCGG GCGCACCGAC GACACGGTGA TCTACTTCCC GGACCTGTTC AGCGACCAGG ACCCCAACCA CTACGGGGGC CGCTTCGACG TCACCGGGAA GATCGCGCTC AGCCGGGACG GCCAGCCGGT GGCGGAGGCC CCGGTGTCGG ACTACCTGAT CGCGGACGTG CCCGCCGAGG CGGGCGCGTA CGTGCTGGAG GCGAACGCCA CCGGCGGCGG TTACGCCGTG TCGACCGAGG TGAGCGCCCG CTGGTCGTTC CGCTCGGAGC ACGCCCAGGA GCCCGCCTTC CTGCCGCTGC TGGCGGTGCG CTTCGCGCCG GACGTGGACG AGCGCAACCG GGCGTCGCGG GGCCGGTCGA CGATCCCGGT CTCGGTGCAG CGCAACGGGA GCGCGGAGGC GTCGGACGTG CGCAGGCCGA GCGTGGAGGT CTCGTACGAC GACGGCAGGA CCTGGCGGGC GGCCCCGGTG AGCGGGCGGA ACGGGAAGTG GTCGGTGACC ACGGTCAGCC CAGCGGGGGC GACGCACGCG TCACTGCGGG CGTCCACTTC GGACTCGTCC GGGAACTCGG TGCAGCAGAC CGTGATCCGG GCGTACGCGT TGCGCTGA
|
Protein sequence | MPLRKQLSTG VVAVVAAAAL TATTAAAEPA GPDPERQGAA GPAVRGTVTL LTGDTVTVRG DDVQVKPGAG REKIAFQRSA DRDGLHVVPS DVVADVASGR LDRRLFDVAG LIAQGYDDAR TDHLPLIVTG APGAARVAGD GVRELPSVDG YALKAPKSRP VLAAGVESAG GRIWLDGKVS ATLDRSTAQI GAPAAWAAGL TGAGAKVAVL DTGVDATHPD LAGAVAESAD FTDAADADDH LGHGTHVAAT VTGAGKYRGV APDAEVLNGK VLDDTGGGYD SWIIAGMEWA AARADVVSMS LGGPATDGAD PMSLAVDRLT AETGALFVIA AGNSGGASTV GSPGSAASAL TVGAVDRDDS LAPFSSRGPR TGDYAIKPEI TAPGVNIVAA KAKNGVIGTP VDDAHVAMSG TSMAAPHVAG AAAILAQQHP DWRAPQLKAA LMGAAVDPKG ATVYEQGAGR VDLARATTIP VQADPAALDL DTLRFPHDDG EQPSPRTVTY RNTGDQPVEL ALSGVLRDPS GAEIPGAVSM SPSSVTVPAG GSAEVVVTTT LPADSPIGAY SGVLLAGDAV RVPIGLTRER ESYDVTVTAT DHAGAPASDY GYALLNLETG ERFGRFDPSG RVTVRVPRGR YAVQGMVYSG ERATLFVEPA FEVGGPSALE LDARRGRQLK PKVEARGAQV GYVQALTLIP LGDSWVSAGG SAGSAEALLL APSRTRDEDA NTALHATLAK ADGAGRFTGS PYQYRLAWEN AGGIPEDVGR VRAVRDRELG RVDATAAAVA DGSWVVYPEN AVVAAPSTTR LHYTPGVEWS QSAFLLDSPD ARANRAYQGR GMPKALRAGE VVRESWYRGV LGPAFPLTPG GALFSAGRTD DTVIYFPDLF SDQDPNHYGG RFDVTGKIAL SRDGQPVAEA PVSDYLIADV PAEAGAYVLE ANATGGGYAV STEVSARWSF RSEHAQEPAF LPLLAVRFAP DVDERNRASR GRSTIPVSVQ RNGSAEASDV RRPSVEVSYD DGRTWRAAPV SGRNGKWSVT TVSPAGATHA SLRASTSDSS GNSVQQTVIR AYALR
|
| |