Gene Amir_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2223 
Symbol 
ID8326412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2458685 
End bp2459815 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content78% 
IMG OID644942769 
Productpeptidase M50 
Protein accessionYP_003100010 
Protein GI256376350 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACTG CGGAGGGCTG GCGGGCGAGG GCGGGCCGGG AGGGCGGTCT GCCGCTGTTC 
CGCGCCGCGG GCATCCCGGT GCTGCTGGCG CCCTCGTGGT GGTTGGGCTC GGCGGTCATC
GTCGTGCTGT ACGCGCCGCT CGCGAGCCGG ATCAGCCCGG ACGCAGGCGG CTTCACCGGC
CTGGCGCTCG CCGCCGCGTT CGCGCTGTTC CTTGGCCTGT CCGTGCTGGC CCACGAGCTG
GGCCACAGCC TGGTCGCGCT GCGCCTGGGC CTGCCGGTGC GCAGGCTGCG GCTGTTCCTG
CTCGGCGGGG TCTCCGAGGT GGCCAGGGCC CCCGGCACCC CGCGCCACGA GGGCCTGGTC
GCGGCGGCGG GACCGCTGGT GTCCGTGCTG CTCGCGGGCG TGTTCGCGCT CGGCGCCCAC
GCCATCCCGA CCACCGACGC GGTGTGGCTG CTGGTCGCGC AGACCTCGTT CGCCAACGCC
GCCGTCGCCG TGTTCAACCT CCTGCCGGGC CTGCCGCTGG ACGGCGGGCG CATCCTGCGC
GCGGGCGTCT GGGCCATCAC CGGCAAGCGC GCCACCGGCA CCAGGGCCGC CGTCATCGGC
GGTGGGCTGG TGGCCGCGCT CCTGGTGGTC TGGGCGGTGC TCGGGCTGCT CGACGGCGCG
CCGGACCGCT GGCTGCGCTT CGGCGTGTGC CTGCTCACCG CCTGGTTCGT GGTCGCGGGC
GCGCGCGGCG AGTCGGCGGC CGAGCGGGCC AGGGCCTGGC CGGAGGGGCT CACCCTGCAG
CAGCTCGTGC GCCCGGTGCT CCAGCTGCCC GCCGAGAGCC CGGTGTCCGG CGCGCTGTCG
GCCGCCGCCG GGCGCGGGGT GGTGCTGGTG CGCGCCGACG GGGTCGCCGC CGGGCTGCTG
GACCGGACCC TGGCCGAGCG CCTGGCCAGC ACGTCCCCGC ACGCGCCCGC CGAGCAGGCC
GCCGTGCCGA TCCGGCCGGA GACCGTGCTG CTCGCCGACG AGGCCGGGGA CGACGTGGTC
GAGCGGGTCC AGGGGACGGC GGCGCGCGAG TACCTGGTGG TCGACCTGGA GGGCAGGCCC
GCCGGGGTGC TGCGCCGAGA GGACCTCAAG GCCGCGCTGG AGAGCCGCTA G
 
Protein sequence
MATAEGWRAR AGREGGLPLF RAAGIPVLLA PSWWLGSAVI VVLYAPLASR ISPDAGGFTG 
LALAAAFALF LGLSVLAHEL GHSLVALRLG LPVRRLRLFL LGGVSEVARA PGTPRHEGLV
AAAGPLVSVL LAGVFALGAH AIPTTDAVWL LVAQTSFANA AVAVFNLLPG LPLDGGRILR
AGVWAITGKR ATGTRAAVIG GGLVAALLVV WAVLGLLDGA PDRWLRFGVC LLTAWFVVAG
ARGESAAERA RAWPEGLTLQ QLVRPVLQLP AESPVSGALS AAAGRGVVLV RADGVAAGLL
DRTLAERLAS TSPHAPAEQA AVPIRPETVL LADEAGDDVV ERVQGTAARE YLVVDLEGRP
AGVLRREDLK AALESR