Gene Amir_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4195 
Symbol 
ID8328388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4941518 
End bp4944778 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content75% 
IMG OID644944659 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003101896 
Protein GI256378236 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0130517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCTCC GGAAAGCGCT CTCCGCCGGG GTGGTGGCGG CGGTCGTGGC GGCGGCGCTC 
ACCGCGACCG TCCCCACGGC CGCGGCGGGC CCCGAGACCG GCCGGGACGC GCCAGGCGCG
GCCGGGACCG GCACTGCCAG CGCCTCCACG ACCGGCGCCG CCACGACCGA CGTCTCTACG
ACCAGCGCCT CCACGACCGG CGCCGCCACG ACCGGCGGCC AACGGCCGCG CGGCGTGGTG
ACCCTCCTGA CCGGCGACGT CGTCGCCGTC GGCGGCGACG ACGTCCGGGT CACCCCCGGC
GCGGGCCGGG AGAAGATCGT GTTCCACCGC TCCGGCGGCC CGGACGTCCT GCGCGTGATC
CCCTCCGACG TGGCGGCGGA CGTCGCCTCG GGCAGGCTGG ACCGCGCCCT GTTCGACGTC
GCGGGCCTGA TCGCCCAGGG CTACGACGAC GCCCGCACCG ACCACCTGCC GCTGATCGTC
ACCGGCGCGC CCGACGCCGT CCGCCCCGCC GGGGACGCCG TCCGCGAGCT GCCCAGCGTC
GACGGCTACG CGCTCGACGT CCCCAAGTCC CGCCCGCTGC TCGCCTCCGC CGCCGAGCAG
GTCCCCGGCC GGATCTGGTT GGACGCCAAG GCCCGCACCA CCCTGGACCG CACCGCCGCG
CAGATCGGCG CGCCCGCCGC GTGGGCCGCC GGCCTGACCG GCGCGGGCGC GAAGGTCGCG
GTCCTGGACA CCGGCGTCGA CGCGGCCCAC CCGGACCTGG CAGGCGCGGT CGTGGAGTCC
GCGAACTTCA GCGACAGCGC CGACGCGGGC GACCGCGACG GCCACGGCAC GCACGTCGCC
TCCACCATCA CCGGCTCCGG CCGGTACCGG GGGATCGCGC CGGACGCGGT GATCCTCAAC
GGCAAGGTCC TGGACGACCG CGGCGGCGGC GCCTACTCGT GGATCATCGC CGGGATGGAG
TGGGCCGCGC CCCGCGCCGA CGTGGTCACC ATGAGCCTGG GCGCCCCCGC GAGCGAGGAC
GACCCGCTGA CCCTCGCGCT CGACCGGCTC ACCGCCGAGA CCGGCGCGCT GTTCGTGGTC
GCGGCGGGCA ACTCCGGTCC GCGCGCCTCC ACGGTCGGCA GCCCCGGATC GGCGGCGTCC
GCGCTGACCG TGGGCGCGGT CGACCGGGAC GACGTGCCGG CCCCGTTCTC CTCGCGCGGC
CCCGGCCCCG ACGAGCGCGT GCTCAAGCCG GACGTCACCG CGCCCGGCGT CGGCGTCGTG
GCGGCCGAGG CGGGCTCGCC GGACGGGCAC GTCGCCATGT CCGGGACCTC GATGGCCGCC
CCGCACGTCG CGGGCGCCGC CGCGATCCTG GCGCAGCAGC ACCCGGACTG GCTGGCCCCG
CAGCTCAAGG CCGCCCTGAT GGGCACGGCC GTCGACCCGA AGGGCGCGAC CGTCTACGAG
CAGGGCGCGG GCCGCGTCGA CCTGGCCCGC GCCACCACCA CCCCGCTGCA GGCCGACCCG
CCGTCGCTGG GGCTGGGGAC CCTGCGCTTC CCGCACGACG ACGGCGAGCA GCCGTCCCCG
CGCACCGTCA CCTACCGCAA CACCGGGGAC CAGCCGGAGG AGGTCGCGCT GACCGCCGTC
CTGCGCGACC CCTCCGGCGC CGAGATCCCC GGCGCGGTCT CGGTGTCGCC GTCCTCGGTC
ACCGTCCCGG CGGGCGGCTC CGCCGAGGTC GTCGTGACCA CCACCCTGCC CGCCGGCTCG
CCGATCGGCG CGTACAGCGG CGTGCTGCTG GCCGGGGACG CGGTGCGCGT CCCGATCGGG
CTGACCCGCG AGGGGGAGAT GCGCGACCTG CCGGTCCGGG TCCTCGACCA CGAGGGCGGG
CCCGCGTCGA TGTACACGTT CTGGCTGCTC AACACCGCCA CCGGCGAGGA GCACAGGATG
TTCGGCCCGT CCGGCTCCAC CACCGTCCGG CTGCCCGTCG GCGACTACCT GATGCACGCG
GTGATCGTCC TGGGTGAGAA GGCCACCACG TTCGTCGAAC CCGCCCTGCG GATCGACGGC
TCCTCGGTGC TGGAGCTGGA CGCCCGGCGC GGCGTGCCGA TGCGGGTCGC GGTGGACGAG
CCCGGCGCGG TCCCCGCCGC GGTCGGCGTG GCCCTCACCA TGGACGTGCT GGGCAGGCCA
GCGCACGCGA GCTCCTTCAG CTACGCCCCC GAGACGATGC TCTTCGTGCC GTCGAGCACC
ACCTCCGAGG CGGCCAAGAC CACCCTGGAG CACGCGCTGG TCCCGCCCGA GGGGTTCACC
GGCCTGCCCT ACCAGTACGC GCTGAAGTGG GCGGTCGAGG GTGGTGTTCC GCACGACCTG
AGCCGGGAGT TTCGCAAGCG CGACCTGGCG CACGTGAACG CCGCCGTCGC CTCCGGGGGC
AGCGGGAACG TGGTGCAGCA CGACCTCCTC ACCACGCTGG CCACACCGCA CGCGATCCTG
CTGCACTACA GCCCGGACGT GCCGTGGCGG CACCGGACTG ACCTCTGGTC CGAGCAGGGC
GGCAGCGGCA AGGGCACCCA GGAGCACGTG AAGGACGTGG TCTACCGCAA GGGCCAGGTG
CTGAGCGAGT CCTGGTACCG GGGTGTGCTC GGCCCGGCAT TCCCGGAGAT CGGCGAGGGG
GACGTGCCCC ACGCCTCGCG CCGGCGTGAC CAGTTCCTCT ACTGGGTTCC GCTGTTCACC
GACCAGGCCG CCAACCACAA GGGCGGCCGG GACTACAGCA CCGAGCACGT CGTGCTCACC
CGCGACGGTG AGGTGCTGCT GGACGAGCTG CGGCGGGGCC AGCACCTGCT GGCGCGGATG
CCCCAGGACC CGGGGGACTA CGAGCTGAGC GTCGACGCGT CCAGCGGCGT CGGGTTCGAC
CTGTCCACGC GGGTGAGCGC GAAGTGGCGC TTCCACTCCG AGCAGACCCA GGCGGAGGAG
GCGGTGCCGC TGCTGGCGGT GCGGTTCGCG CCGGACCTCG ACCAGCGCAA CCGGGCGCCG
CGCGGCCGGG TCACGATCCC GGTCTCGGTG CAGCGCAACG GGAGCGCGGA CGTGTCGGAC
GTGCGCAGGC CGAGCGTGGA GGTCTCGTAC GACGACGGCA GGACCTGGCG GGCGGCCCCG
GTGAGCGGCC GGGACGGCGA GTGGTCGGTG ACGACCGCGA GCCCGCCGGG GGCGGTGTTC
GCGTCACTGC GATCGTCCAC TTCGGACTCG TCCGGGAACT CGTTGGTGCA GACCATTATC
CGGGCGTACG CGCTGCGCTG A
 
Protein sequence
MPLRKALSAG VVAAVVAAAL TATVPTAAAG PETGRDAPGA AGTGTASAST TGAATTDVST 
TSASTTGAAT TGGQRPRGVV TLLTGDVVAV GGDDVRVTPG AGREKIVFHR SGGPDVLRVI
PSDVAADVAS GRLDRALFDV AGLIAQGYDD ARTDHLPLIV TGAPDAVRPA GDAVRELPSV
DGYALDVPKS RPLLASAAEQ VPGRIWLDAK ARTTLDRTAA QIGAPAAWAA GLTGAGAKVA
VLDTGVDAAH PDLAGAVVES ANFSDSADAG DRDGHGTHVA STITGSGRYR GIAPDAVILN
GKVLDDRGGG AYSWIIAGME WAAPRADVVT MSLGAPASED DPLTLALDRL TAETGALFVV
AAGNSGPRAS TVGSPGSAAS ALTVGAVDRD DVPAPFSSRG PGPDERVLKP DVTAPGVGVV
AAEAGSPDGH VAMSGTSMAA PHVAGAAAIL AQQHPDWLAP QLKAALMGTA VDPKGATVYE
QGAGRVDLAR ATTTPLQADP PSLGLGTLRF PHDDGEQPSP RTVTYRNTGD QPEEVALTAV
LRDPSGAEIP GAVSVSPSSV TVPAGGSAEV VVTTTLPAGS PIGAYSGVLL AGDAVRVPIG
LTREGEMRDL PVRVLDHEGG PASMYTFWLL NTATGEEHRM FGPSGSTTVR LPVGDYLMHA
VIVLGEKATT FVEPALRIDG SSVLELDARR GVPMRVAVDE PGAVPAAVGV ALTMDVLGRP
AHASSFSYAP ETMLFVPSST TSEAAKTTLE HALVPPEGFT GLPYQYALKW AVEGGVPHDL
SREFRKRDLA HVNAAVASGG SGNVVQHDLL TTLATPHAIL LHYSPDVPWR HRTDLWSEQG
GSGKGTQEHV KDVVYRKGQV LSESWYRGVL GPAFPEIGEG DVPHASRRRD QFLYWVPLFT
DQAANHKGGR DYSTEHVVLT RDGEVLLDEL RRGQHLLARM PQDPGDYELS VDASSGVGFD
LSTRVSAKWR FHSEQTQAEE AVPLLAVRFA PDLDQRNRAP RGRVTIPVSV QRNGSADVSD
VRRPSVEVSY DDGRTWRAAP VSGRDGEWSV TTASPPGAVF ASLRSSTSDS SGNSLVQTII
RAYALR