Gene Amir_4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4194 
Symbol 
ID8328387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4938033 
End bp4941230 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content76% 
IMG OID644944658 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003101895 
Protein GI256378235 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCTCC GGAAACAGCT CTCCACGGGG GTGGTGGCGG TGGTCGCGGC GGCGGCGCTC 
ACCGCCACCA CCGCTGCGGC CGAACCCGCG GGCCCCGACC CCGAACGGCA GGGCGCGGCA
GGCCCCGCCG TGCGCGGCAC GGTCACCCTG CTGACCGGCG ACACCGTCAC CGTCCGGGGT
GACGACGTCC AGGTCAAGCC CGGCGCGGGC CGGGAGAAGA TCGCGTTCCA GCGCTCCGCC
GACCGCGACG GGCTGCACGT CGTCCCCTCC GACGTGGTGG CGGACGTCGC GTCCGGCAGG
CTCGACCGCA GGCTGTTCGA CGTCGCGGGC CTGATCGCCC AGGGCTATGA CGACGCCCGC
ACCGACCACC TGCCGCTGAT CGTCACCGGC GCCCCCGGCG CGGCCCGCGT GGCCGGTGAC
GGCGTCCGCG AGCTGCCCAG CGTCGACGGC TACGCGCTCA AGGCCCCCAA GTCCCGGCCC
GTGCTGGCGG CGGGCGTGGA GAGCGCGGGC GGCCGGATCT GGTTGGACGG CAAGGTCTCC
GCGACCCTGG ACCGCAGCAC CGCGCAGATC GGCGCGCCCG CCGCGTGGGC CGCCGGCCTG
ACCGGCGCGG GCGCGAAGGT CGCGGTCCTG GACACCGGCG TCGACGCCAC CCACCCGGAC
CTCGCGGGCG CGGTCGCCGA GTCGGCCGAC TTCACCGACG CCGCCGACGC CGACGACCAC
CTCGGCCACG GCACGCACGT CGCCGCCACG GTCACCGGCG CGGGCAAGTA CCGGGGCGTC
GCGCCCGACG CCGAGGTGCT CAACGGCAAG GTGCTCGACG ACACCGGTGG CGGCTACGAC
TCCTGGATCA TCGCCGGGAT GGAGTGGGCC GCGGCCCGCG CCGACGTGGT CAGCATGAGC
CTGGGCGGCC CCGCCACCGA CGGCGCCGAC CCCATGTCGC TCGCCGTCGA CCGGCTCACC
GCCGAGACCG GCGCGCTGTT CGTGATCGCC GCGGGCAACT CGGGCGGGGC CTCCACGGTC
GGCAGCCCCG GATCGGCGGC CTCCGCGCTG ACCGTGGGCG CGGTCGACCG GGACGACTCG
CTCGCCCCGT TCTCCTCGCG CGGCCCCAGG ACCGGCGACT ACGCGATCAA GCCGGAGATC
ACCGCGCCGG GCGTGAACAT CGTCGCCGCC AAGGCGAAGA ACGGCGTCAT CGGCACCCCG
GTGGACGACG CGCACGTCGC CATGTCCGGG ACCTCGATGG CCGCCCCGCA CGTCGCGGGC
GCCGCCGCGA TCCTGGCGCA GCAGCACCCG GACTGGCGGG CCCCGCAGCT CAAGGCCGCC
CTGATGGGCG CCGCCGTCGA CCCCAAGGGC GCCACCGTCT ACGAGCAGGG CGCCGGGCGC
GTCGACCTGG CCCGCGCCAC CACGATCCCG GTGCAGGCCG ACCCGGCCGC GCTCGACCTG
GACACCCTGC GCTTCCCGCA CGACGACGGC GAGCAGCCGT CCCCGCGCAC CGTCACCTAC
CGCAACACCG GGGACCAGCC CGTGGAGCTC GCCCTCAGCG GCGTGCTGCG CGACCCCTCC
GGCGCCGAGA TCCCCGGCGC GGTCTCGATG TCGCCGTCCT CGGTCACCGT CCCGGCGGGC
GGCTCCGCCG AGGTCGTCGT GACCACCACC CTGCCCGCCG ACTCGCCGAT CGGCGCGTAC
AGCGGCGTGC TGCTGGCCGG GGACGCGGTG CGCGTCCCGA TCGGGCTGAC CCGCGAGCGG
GAGAGCTACG ACGTCACCGT CACCGCGACC GACCACGCGG GCGCGCCCGC GTCGGACTAC
GGGTACGCGC TGCTGAACCT GGAGACCGGC GAGCGCTTCG GCAGGTTCGA CCCGTCCGGG
CGGGTCACCG TCCGGGTGCC CAGGGGCAGG TACGCGGTGC AGGGCATGGT CTACAGCGGG
GAGCGCGCGA CGCTGTTCGT CGAACCGGCC TTCGAGGTCG GCGGCCCGTC GGCGCTGGAG
CTGGACGCCC GGCGCGGCCG GCAGCTCAAG CCGAAGGTCG AGGCGCGCGG CGCGCAGGTC
GGGTACGTGC AGGCGCTGAC CCTCATCCCC CTCGGCGACT CGTGGGTCAG CGCGGGCGGG
AGCGCCGGGA GCGCCGAGGC GCTGCTGCTC GCGCCGTCGC GGACCCGCGA CGAGGACGCC
AACACCGCCC TGCACGCGAC GCTGGCCAAG GCGGACGGCG CGGGCAGGTT CACCGGCAGC
CCGTACCAGT ACCGGCTGGC CTGGGAGAAC GCGGGCGGCA TCCCGGAGGA CGTCGGCCGG
GTCAGGGCGG TGCGCGACCG GGAGCTCGGC CGCGTCGACG CCACCGCCGC CGCGGTCGCG
GACGGGAGCT GGGTGGTGTA CCCGGAGAAC GCGGTCGTGG CCGCGCCGAG CACGACCCGG
CTGCACTACA CGCCGGGCGT GGAGTGGAGC CAGAGCGCGT TCCTGCTGGA CTCCCCGGAC
GCCCGCGCCA ACCGGGCGTA CCAGGGGAGG GGGATGCCGA AGGCGCTGCG CGCGGGCGAG
GTCGTCCGCG AGTCGTGGTA CCGGGGCGTG CTCGGCCCTG CCTTCCCCCT GACGCCCGGA
GGGGCGCTGT TCAGCGCCGG GCGCACCGAC GACACGGTGA TCTACTTCCC GGACCTGTTC
AGCGACCAGG ACCCCAACCA CTACGGGGGC CGCTTCGACG TCACCGGGAA GATCGCGCTC
AGCCGGGACG GCCAGCCGGT GGCGGAGGCC CCGGTGTCGG ACTACCTGAT CGCGGACGTG
CCCGCCGAGG CGGGCGCGTA CGTGCTGGAG GCGAACGCCA CCGGCGGCGG TTACGCCGTG
TCGACCGAGG TGAGCGCCCG CTGGTCGTTC CGCTCGGAGC ACGCCCAGGA GCCCGCCTTC
CTGCCGCTGC TGGCGGTGCG CTTCGCGCCG GACGTGGACG AGCGCAACCG GGCGTCGCGG
GGCCGGTCGA CGATCCCGGT CTCGGTGCAG CGCAACGGGA GCGCGGAGGC GTCGGACGTG
CGCAGGCCGA GCGTGGAGGT CTCGTACGAC GACGGCAGGA CCTGGCGGGC GGCCCCGGTG
AGCGGGCGGA ACGGGAAGTG GTCGGTGACC ACGGTCAGCC CAGCGGGGGC GACGCACGCG
TCACTGCGGG CGTCCACTTC GGACTCGTCC GGGAACTCGG TGCAGCAGAC CGTGATCCGG
GCGTACGCGT TGCGCTGA
 
Protein sequence
MPLRKQLSTG VVAVVAAAAL TATTAAAEPA GPDPERQGAA GPAVRGTVTL LTGDTVTVRG 
DDVQVKPGAG REKIAFQRSA DRDGLHVVPS DVVADVASGR LDRRLFDVAG LIAQGYDDAR
TDHLPLIVTG APGAARVAGD GVRELPSVDG YALKAPKSRP VLAAGVESAG GRIWLDGKVS
ATLDRSTAQI GAPAAWAAGL TGAGAKVAVL DTGVDATHPD LAGAVAESAD FTDAADADDH
LGHGTHVAAT VTGAGKYRGV APDAEVLNGK VLDDTGGGYD SWIIAGMEWA AARADVVSMS
LGGPATDGAD PMSLAVDRLT AETGALFVIA AGNSGGASTV GSPGSAASAL TVGAVDRDDS
LAPFSSRGPR TGDYAIKPEI TAPGVNIVAA KAKNGVIGTP VDDAHVAMSG TSMAAPHVAG
AAAILAQQHP DWRAPQLKAA LMGAAVDPKG ATVYEQGAGR VDLARATTIP VQADPAALDL
DTLRFPHDDG EQPSPRTVTY RNTGDQPVEL ALSGVLRDPS GAEIPGAVSM SPSSVTVPAG
GSAEVVVTTT LPADSPIGAY SGVLLAGDAV RVPIGLTRER ESYDVTVTAT DHAGAPASDY
GYALLNLETG ERFGRFDPSG RVTVRVPRGR YAVQGMVYSG ERATLFVEPA FEVGGPSALE
LDARRGRQLK PKVEARGAQV GYVQALTLIP LGDSWVSAGG SAGSAEALLL APSRTRDEDA
NTALHATLAK ADGAGRFTGS PYQYRLAWEN AGGIPEDVGR VRAVRDRELG RVDATAAAVA
DGSWVVYPEN AVVAAPSTTR LHYTPGVEWS QSAFLLDSPD ARANRAYQGR GMPKALRAGE
VVRESWYRGV LGPAFPLTPG GALFSAGRTD DTVIYFPDLF SDQDPNHYGG RFDVTGKIAL
SRDGQPVAEA PVSDYLIADV PAEAGAYVLE ANATGGGYAV STEVSARWSF RSEHAQEPAF
LPLLAVRFAP DVDERNRASR GRSTIPVSVQ RNGSAEASDV RRPSVEVSYD DGRTWRAAPV
SGRNGKWSVT TVSPAGATHA SLRASTSDSS GNSVQQTVIR AYALR