Gene Amir_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2202 
Symbol 
ID8326391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2437727 
End bp2439520 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content72% 
IMG OID644942749 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003099990 
Protein GI256376330 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.176046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGACG CTTACCGGAG GCTGGCCGCG CTCGGCCTGA CCGCCGTCAC AGCGACCGCG 
CTGGCCGTCA CGACGGCGGG CAGCGCGTGG GCCACGGGCG AGGTCGTGGG CGCGAGCAGC
GCCGACGCCG TGCCCGGCTC GTACGTGGTG ACCCTGCGCG ACGACGCCTC TCCGCGCGCC
GCCGCCACCA GCGCCGCCAC CTCGCTGACC TCGCGCTACG GCGGCGAGGT CAAGCGCACC
TACTCGCACG CCCTGAACGG CTTCCACGCC ACCATGTCCG CCGAGCAGGC CGCGAAGCTC
GCCGCCGACC CCAAGGTCGC GATGGTGCAG GCGGACCTGC GGATCACCGT CGACGCGGTC
CAGCCGAACC CGCCGTCGTG GGGCCTCGAC CGGATCGACC AGCGCGACCT GCCGCTGGAC
AGCTCCTACT CGTACGAGAC GGGCGCGTCG AACGTGACCG CCTACATCAT CGACACCGGC
ATCCGCACCA CGCACAGCAC GTTCGGCGGC CGGGCGAGCT GGGGCGCCAA CACGATCGAC
ACCAACAACA CCGACTGCCA GGGCCACGGC ACGCACGTCG CGGGCACCGT CGGCGGCGCG
GAGTACGGCG TGGCCAAGGA GGTGAAGCTG GTCGCGGTCA AGGTGCTCAA CTGCGCGGGC
AGCGGCACCA CGGCCAGCGT CGTCGGCGGC ATCGACTGGG TGACCGCCAA CGCGGTGAAG
CCCGCCGTCG CGAACATGAG CCTGGGCGGC GGCGCGGACG CCACCCTGGA CGCGGCGGTG
CGCACCTCGG TCGCCTCGGG CGTCACGCAC GTGGTGGCCT CGGGCAACAG CAGCGCGAAC
GCCTGCAGCT ACTCCCCCGC CCGCGTGGCC GAGGCGATCA GCGTGAACGC CTCGACCAGG
ACGGACGCGC GCGCCTCGTT CTCCAACTTC GGGACGTGCA CGGACCTCTT CGCGCCGGGT
GAGGGCATCA CCTCGTCGTG GAACACCAAC GACACCGCCA CCAACACCAT CAGCGGCACG
TCGATGGCGT CCCCGCACGT GGCGGGCGGC GCGGCGCTGT ACCTGGCGGG GAACCCGACC
GCCGCGCCCG CGACCGTGGA GGCCGCGCTG CTCTCGGCCG CCAGCTCGGA CAAGATCGGC
AACGCGGGCG CCGGGTCGCC GAACAAGCTG CTGTTCACCG GCAGCACGAC GACCCCGTCG
ATCACCAACC CCGGTGCCAA GGCGAACCTG GCGGGCGACT CGGTCAGCTT CCAGCTGTCG
GTGTTCGGCG GCACCGCGCC GCACGTGTTC ACCGCGACCG GGCTGCCGGA CGGCGTGACC
ATCAGCGACG GCGGCCTGGT CTCCGGGTCG CCGACGACGG CGGGCACGTA CTCGGTGACC
GTGACGGCCA CGGACTCGCT GGGCGTGTCC GGGTCGACCA CGTTCAGCTG GCTGGTGGTG
GAGCCGGGCG CCGAGTGCCC GACGACCACC AACAGCACCG CCTACCCGAT CGCCGACAAC
TCGACGGTGA GCAGCCCGAT CACCCTGGCG TGCGGGGCGC TGGCCTCGGC GACGACCACG
GTGACCGTGG ACATCACGCA CACCTACATC GGTGACCTGG TCGTGGACCT GGTCGCGCCG
GACGGGTCGG TGTACAACCT GCACAACCGG ACCGGGGGCA GCGCGGACGA CATCAAGCGC
AGCTTCACGG TGGACGCGTC GAGCGAGGTC GTGGTCGGCA CCTGGACGCT GCGGGTGCAG
GACGCGGCGT CGCTGGACAC CGGTCGGCTG AACTCGTGGT CGCTGGACGT GTGA
 
Protein sequence
MGDAYRRLAA LGLTAVTATA LAVTTAGSAW ATGEVVGASS ADAVPGSYVV TLRDDASPRA 
AATSAATSLT SRYGGEVKRT YSHALNGFHA TMSAEQAAKL AADPKVAMVQ ADLRITVDAV
QPNPPSWGLD RIDQRDLPLD SSYSYETGAS NVTAYIIDTG IRTTHSTFGG RASWGANTID
TNNTDCQGHG THVAGTVGGA EYGVAKEVKL VAVKVLNCAG SGTTASVVGG IDWVTANAVK
PAVANMSLGG GADATLDAAV RTSVASGVTH VVASGNSSAN ACSYSPARVA EAISVNASTR
TDARASFSNF GTCTDLFAPG EGITSSWNTN DTATNTISGT SMASPHVAGG AALYLAGNPT
AAPATVEAAL LSAASSDKIG NAGAGSPNKL LFTGSTTTPS ITNPGAKANL AGDSVSFQLS
VFGGTAPHVF TATGLPDGVT ISDGGLVSGS PTTAGTYSVT VTATDSLGVS GSTTFSWLVV
EPGAECPTTT NSTAYPIADN STVSSPITLA CGALASATTT VTVDITHTYI GDLVVDLVAP
DGSVYNLHNR TGGSADDIKR SFTVDASSEV VVGTWTLRVQ DAASLDTGRL NSWSLDV