Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_5141 |
Symbol | |
ID | 8329339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 6120837 |
End bp | 6123653 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644945576 |
Product | Proprotein convertase P |
Protein accession | YP_003102808 |
Protein GI | 256379148 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.488081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCTCAC CGAAACGCCG GTCCCGAAGA AGCGACACCC TGATCATCGC GGGCTCCGCG GCGTTAGCGC TGCTGCTGGG TTCGACGGCT CCGGCAGCGG CGGACCCGCC CAAGGCGAGC AGGGGTGTCG AGGACACGAC GGCCGCGCAG ATCGCGGCCC TGCAGGAGAT CAAGAAGTCC GCCTCCCCGG CGGAGTCCAA AGTGGACAGC TCCCTGGTGG TCGAGCAGCG ACGACGCGCC GACGCCGGGA CGCGGGCCAA GCTCCCCTCG GTGCAGACGG GGGTAGAGGT TCAGAACGGG TCCGCCGTGC TGGTGGACAT CCGCGCCCAC AAGGTCAGCG ACGACCTCGT CAACGCCGTG AAGTCCGCCG GCGGGGCGGT CCGGTCGGTC TCGACCGAGG GCGCCACGAT CCGCGCGCAG CTGCCGCTGG CGTCGCTCAC CACCATCTCG GGCCGCTCCG ACGTGCGGCG CGTGGAGACC GCGGCGGACG CGAAGACCTT CCACCAGCAG GACGCGAAGT CCAAGCGGGA CAAGGCTTCC GAGGAGTCGA AGCAGTCGAA GGAGGAGCGG GGCGCCGAGG TCGAGCGGCG CACCCGCGAG GCGCTGGAGC GGGTCGGCGC GGACGCGGTC GTCACCTCCG AGGGCGACCG GGCGCAGGCC ACCGACACCG CCCGCCAGGA GCACCGGGTC ACCGGCACCG CCGTGAAGCT GTGCGCGCTG TCGGACGGCG TGCGCTCGCT CGCGGTGTCG CAGGCGGCCG GTGAGCTGCC CGCCGTGGAC GTGCTGCCGG GCCAGGAGGG CTCCGGCGAC GAGGGCACGG CGATGCTGGA GATCCTGCAC GACGTGGCGC CCAACGCGGA GCTCGGCTTC GCCACGGCGT TCACCAGCGA CGCCGGCTTC GCCGACAACA TCAGGGCCCT GCGCTTCCAG GCGGGCTGCG ACGTGATCGT CGACGACGTC CTGTACTTCA ACGAGTCGCC GTTCCAGGAC GGCATCATCG CGCAGGCGGT GGACGCGGTC GCCGCCGACG GCGCGGTGTA CTTCTCCTCG GCGGGCAACG AGGGCAGCGT CGCGGCGGGC ACCTCCGGCC ACTGGGAGGG CCAGTTCGTC GACTCGGGCG TGGGCATCGG GAAGTTCGCG GGCACCGCGC ACGACTTCGA CCCGGCCCCG AACGCCAAGC AGGTGCTCAA CCCGCTGTCG GCGTACTCGA CCGGCGTGCC GGTCACGCTG TTCTGGGCTG ACCCGCTGAA CCGGTCGTTC AACGACTACG ACCTGTACCT GGTGAACTCG GCGGGCGCGG TCGTCTCGTT CAGCCAGAAC ATCCAGGACG GCACGCAGAA CCCGTACGAG CGGATCGACA CGCCCGCCTC GGGTTCGGGG CTGCGCCTGG CGGTCGTGAA GTTCCGGGGT GACGACAAGT ACCTGGCGCT GAGCGCGCTC GGCGGCCGGT TCAAGGACTC GGCGGACGGC CTCAAGGGCT TCGCCACGCC GGGCGTCTCG CGCGGCCACT CGGCGGCCAA GGGCGCGATC AGCGTCGCGG CTGCCCCGGC GGCGGGCGCG CTGTCGTTCG ACCTGGAGCC GGGCGACCCG GCCAACCCGA CCGGCCCGTT CCCCGGCTCG TTCACCTCGG CGCAGCAGCC GGAGCGCTTC ACCTCCGACG GTCCGCGCCG GGTGTTCTTC GCGCCGGACG GCTCGCCCGC GTCCGAGGTG CGGCAGAAGC CGGACATCAC CGCGGCGGAC GGCGTGAACA CGTCGGTGGC CGGGTTCAAG CCGTTCTTCG GCACGTCGGC GGCGGCGCCG AACGCGGCGG GCATCGCGGG CCTGGTGCTG TCGGGCAACC CGACGCTGAC GCCCGCCGAG GTGCGGGCCG CGCTGGTCGG CACCGCGATC GACATCGCGG CGCCGGGCGT GGACAACCAG ACCGGCGCGG GCGTCGTGCG CGGCGACCTG GCGCTGGACT ACACGGGCGC GAGCCCGCAG CCGCTGGCGA AGGCGGCCAC GCCGACCATC GCCAACGACA ACGACGGCAG CCGGTACCTC AAGCCGGGCA CGACCGCGAC GGTGACGCTG CCGGTGGCCA ACACCGGTGA CGGCACGGCG GCGTCGACCA GCGTGGTGCT GACCTCGTCG ACGCCGGGCG TGACCATCGC GCCCCGGTCC AAGTACTACG GCAACGTCGA CCCCGGCCAG ACGCTCACCG GCACGTTCAA GGTGACCGTC CCCGCCTCGC AGACCATCGG GTCCGTGGTG AAGCTGGACG CGCGCGTGAC CTACGCGGGC GCCACCTCGC CGACCACCTC GGTGTTCCAG CTGCCGGTGG GCGAGCCGTC CAGCGAGGTC AAGACGTTCG GGTTCACCGG CGTCACGGCC ATCCCGGACA GCAGCACCGT GGGCGTCTCG GTGCAGATCC CGGTGACGGG CGTGGGCGTG CTGTCGTCGC TGAAGTTCGT GATCGGGGGC GAGACCTGCA CGACGGCGGA GAAGGCCACG ACCGTCGGCG TCGACCACAG CTACGTCGGC GACCTGGTCG GCACGCTGAC CTCGCCGTCG GGCGCGAGCG CGACGCTGTT CCAGCGCCAC GGCAGCTCGG GCAACAACCT GTGCCAGGTC GTGTTCGACG ACTCGGCGAC CGCCGCGTTC TCCTCGGTGA CGAGCGCGCA GGCGCCGTTC ACCGGGTCGT GGCGGCCGGT GACGCCGCTG GCGCCGCTGC TGAACGGCTC GGCGGACGGC ACGTGGACGT TCAAGGTCGT CGACTCGGCT GCGTCGGACA CCGGCTCGGT GCGCGACGTG GCGCTGAAGG TGACCGGGTA CGTGTGA
|
Protein sequence | MRSPKRRSRR SDTLIIAGSA ALALLLGSTA PAAADPPKAS RGVEDTTAAQ IAALQEIKKS ASPAESKVDS SLVVEQRRRA DAGTRAKLPS VQTGVEVQNG SAVLVDIRAH KVSDDLVNAV KSAGGAVRSV STEGATIRAQ LPLASLTTIS GRSDVRRVET AADAKTFHQQ DAKSKRDKAS EESKQSKEER GAEVERRTRE ALERVGADAV VTSEGDRAQA TDTARQEHRV TGTAVKLCAL SDGVRSLAVS QAAGELPAVD VLPGQEGSGD EGTAMLEILH DVAPNAELGF ATAFTSDAGF ADNIRALRFQ AGCDVIVDDV LYFNESPFQD GIIAQAVDAV AADGAVYFSS AGNEGSVAAG TSGHWEGQFV DSGVGIGKFA GTAHDFDPAP NAKQVLNPLS AYSTGVPVTL FWADPLNRSF NDYDLYLVNS AGAVVSFSQN IQDGTQNPYE RIDTPASGSG LRLAVVKFRG DDKYLALSAL GGRFKDSADG LKGFATPGVS RGHSAAKGAI SVAAAPAAGA LSFDLEPGDP ANPTGPFPGS FTSAQQPERF TSDGPRRVFF APDGSPASEV RQKPDITAAD GVNTSVAGFK PFFGTSAAAP NAAGIAGLVL SGNPTLTPAE VRAALVGTAI DIAAPGVDNQ TGAGVVRGDL ALDYTGASPQ PLAKAATPTI ANDNDGSRYL KPGTTATVTL PVANTGDGTA ASTSVVLTSS TPGVTIAPRS KYYGNVDPGQ TLTGTFKVTV PASQTIGSVV KLDARVTYAG ATSPTTSVFQ LPVGEPSSEV KTFGFTGVTA IPDSSTVGVS VQIPVTGVGV LSSLKFVIGG ETCTTAEKAT TVGVDHSYVG DLVGTLTSPS GASATLFQRH GSSGNNLCQV VFDDSATAAF SSVTSAQAPF TGSWRPVTPL APLLNGSADG TWTFKVVDSA ASDTGSVRDV ALKVTGYV
|
| |