Gene Amir_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1039 
Symbol 
ID8325211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1152123 
End bp1153850 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID644941583 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003098841 
Protein GI256375181 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGATCAC GCAGAATCGC CCACCCCGCC CTGGTGCTGG TGGGAGCCGC CGCACTCGTC 
GCCGCCGCGA CGACGCCCGC CGCCGCCGAG CAGGCCACCA CCGAGTACAC CGTGCTGGTC
GAGGATGGGG CCAGCCGGGA CGCCGCGGTC GCGGCGGTGC GCGCTGCGGG CGGCCACCTG
GTCCGCGAGA ACAGCGCGGT CGGGATGCTG GTCGTGCGGG CGCCGGAGTC CGGTTTCGCC
GGGCGCGTCT CCGCCTCGCC GTCGGTGCTG GGCGCGGCCA CGGCCAAGCC GATCGGCCGC
ACGCCCGGTT CGGTGGGCAA CCGCGAGTGG TCCGACGTGG AGAAGGAGAA CACCGCGCTG
GGCGCGGCGA AGGCCCCCTC CGCGTCGCGC AAGGCCGCTG CCGCGCAGGC GGGCCTCGAC
CCGCTGGACA GCGACCTGTG GGGCCTGCGC GCGGTGCGCT CCGACATCGC CCGCGCCAAG
CAGCCCGGCG ACAAGCGGGT CAAGGTCGGC GTGATCGACA CCGGCGTCGA CGGCAACCAC
CCGGACATCG CGCCGAACTT CGACCGCGAC CTGTCCCGCA ACTTCACCGT CGACCTGCCC
TACGACGCGG ACGGCGGCGA GTTCGACGGC CCGTGCGAGT TCCGGGGCTG CGTCGACCCG
GCGGACCACG ACGACGGCGG CCACGGCACG CACGTCGCGG GCACCATCGG CGCGGCGGCC
AACGGCTCCG GCGTCTCCGG CGTCGCCCCG AACGTCACCC TGGTGAACGT GCGGGCGGGC
CAGGACTCGG GCACGTTCTT CCTGCAGCCG GTCGTCGACG CGCTCACCTA CAGCGCGGAC
GCGGGCCTGG ACGTGGTGAA CATGTCCTTC TACGTCGACC CGTGGTACAT GAACTGCGGC
AACGACCCCA CGGCCACCGC CGAGGAGCAG CTGGAGCAGC GCACCACCAT CACCGCGGTG
CAGCGCGCCC TGAACTACGC GCACGGCAAG GGCGTCACGC TGATCGGCGC GGGCGGCAAC
AACCACGAGG ACCTCGGCAG CCCGAGGACG GACGTGGTGA GCCCGAACTA CCCGCCCGGC
CACGCCCGGC CGCGCCCGGT GGACTCCAGC TGCCTGAACC TGCCGACCGA GGGCGACCAC
GTCATCTCGG TGTCGGCGCT CGGCCCGTCG CTGACGAAGG CCGACTTCTC GAACTACGGC
ACCGAGCAGA CCGAGCTGTC CGCGCCCGGC GGGTACTTCC GCGACGGCCT GGGCACCGAC
TGGTACCGCA CGAACGAGAA CCTGGTCCTG TCGACCTACC CGCGCAACGT GGCCCTGGCC
GACGGCGCGA TCGACGCGGA CGGCAACCTC ACCCCGGCGG GCGAGTCGGC GGGCGTGAGG
AAGGACTGCT CGACCGGGAC CTGCGCGTAC TACCAGTTCC TGCAGGGCAC CTCGATGGCC
GCCCCGCACG CGTCCGGCGT GGCCGCGCTG GTGGTCAGCC AGTACGGCAA GAACGACAAG
AAGCACCCCG GCACGCTGAC CATGGCGCCG GACAAGGTGA AGACCGTGCT GACCGGGACC
GCGACCAAGC GGCCGTGCCC GGTGCCCAGG ACGGTGTCGT ACGTGAACGT CGGCCGCTCG
GCCGAGTTCG ACGCGACCTG CGTGGGCGAC GCGAAGTTCA ACGGCTTCTA CGGCCACGGC
ATCGTCGACG CGTACGGCGC GGTGACCAGG GGCGGCGGGC TGATCTAG
 
Protein sequence
MRSRRIAHPA LVLVGAAALV AAATTPAAAE QATTEYTVLV EDGASRDAAV AAVRAAGGHL 
VRENSAVGML VVRAPESGFA GRVSASPSVL GAATAKPIGR TPGSVGNREW SDVEKENTAL
GAAKAPSASR KAAAAQAGLD PLDSDLWGLR AVRSDIARAK QPGDKRVKVG VIDTGVDGNH
PDIAPNFDRD LSRNFTVDLP YDADGGEFDG PCEFRGCVDP ADHDDGGHGT HVAGTIGAAA
NGSGVSGVAP NVTLVNVRAG QDSGTFFLQP VVDALTYSAD AGLDVVNMSF YVDPWYMNCG
NDPTATAEEQ LEQRTTITAV QRALNYAHGK GVTLIGAGGN NHEDLGSPRT DVVSPNYPPG
HARPRPVDSS CLNLPTEGDH VISVSALGPS LTKADFSNYG TEQTELSAPG GYFRDGLGTD
WYRTNENLVL STYPRNVALA DGAIDADGNL TPAGESAGVR KDCSTGTCAY YQFLQGTSMA
APHASGVAAL VVSQYGKNDK KHPGTLTMAP DKVKTVLTGT ATKRPCPVPR TVSYVNVGRS
AEFDATCVGD AKFNGFYGHG IVDAYGAVTR GGGLI