Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_0958 |
Symbol | |
ID | 8325129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 1066453 |
End bp | 1069656 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644941504 |
Product | protein of unknown function DUF450 |
Protein accession | YP_003098763 |
Protein GI | 256375103 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCG TCCACCACGA ATCCGCGTTC GGTGACGCGA TCGTCGCCTC GATGATCGAC TCCGGCTGGG AGCTTGGCAA GAGCGCCGAC TACCGGGCCG ATCTAGGCCT GGACACCGGC GAGCTGTTCA CCTTCATCGG CGCCACCCAG GCCGGTGAGT GGGAGGATCT TCTCGACCTC TACGGCGGCG ACGCCAACGC TGCGCAGGCC GGGTTCACCA AGCGGTTGGA CCAGGCGATC GCCACCGACG GTGTGCTGCA CGTGCTGCGC AAGGGCGTGA AGGACAGGGG CGTCCGCATC CGCGTCGCAT ACTTCAAGCC ATCACTCGTC GCCTCCGACG CAGTGCTCAA CGACTACCGC GCCAACCGGC TGACCGTCGT CCGCGAGCTG GAGTACGCGG CCAAGCAGGC GAACAAGGGG CACAGGCTCG ACCTCACCCT CTTCCTCAAC GGTATCCCGC TGGCCACGGC AGAGTTGAAG AACCCGCTCA CCGGGCAGGG CGTGGAACAC GCGAAGGAGC AGTACCGCCA CGACCGCGAC GCAACCGAGC TGATTTTCTC ACGACGCGTG ATCGCGAACT TCGCCGTCGA CCCAGACTTG GTCTTCGTAG CCACTGAACT GCGTGGCGCG AAGACCAGGT TCCTACCCTT CAACACCGGC TCTAACGGCC CCGGACAGCC CGGTGGCGCG GGCAACCCGC CCTCCACCTC CTTCGGCAGG CACACCACCT CGTACCTGTG GGAGCAGGTC TGGCAATGGG ACAACTGGCT GGACCTGCTG GAGCGCTTTG TTCATCTGCA CAAGGAAAAG GGCGCTGACG GGCGCGCCAC GAAGAAGGTG ATCTTCCCTC GGTTCCACCA GTGGGACATC GTGAAGAAGC TGACCACGGA CGCCGCACGG CGCGGTGCGG GGCACAACTA CCTCTGCATG GCCTCGGCTG GCTCCGGCAA GTCGAACACC ATCGGCTGGC TCGCGCACCG CCTGTCGTCG CTGCACACGC CCACCGACCC TGCGGAGATC GACCCGGACG AGCTCTCCGC AGGGCTTGAG CCGGGCACCC CGGTATTCGA CAAGGTCGTC ATCATCACCG ACCGGCGCAA CCTCGACTCC CAGCTGCGCG AGACGGTCGG CAGCTTCGAG CAAACCGCCG GCCTGGTGGT GAAGATCGAC GACAAGCACG GAGCGAAGTC GGACCAGCTC GCCCGCGCGT TGTCACGCGA GCCCGGAAAG ATCATCACGG TCACGCTGCA CACCTTCCCG GCTCTGCTGG ACTACCTGCG TCGCAACCCC ATCGAGATCA AGGGCAGCAA TTTCGCCATC ATCGTGGACG AAGCGCACTC CTCCCAGTCC GGTGATGCCG CAACTGCTGT GCGTGCTGCG CTGCGCGACC TCGGGCTCGA TGCGGACTCC GACGACGCGG GTGCTGTCAC CGCCCAGGCA GACCAGAACG CCGCCACCGA CGCGAAGCTG CGGAGGAAGG CGCACCAGCG CGGCCAGGCC GCGAACCTGT CGTACTTCGC GTTCACGGCC ACGCCCAAGG CCAAGACCCT GGAACTCTTC GGCACGCTGG GCGACACCGG CCCGGACGGC AAGCCCACGT ACCGACCGTT CCACACCTAC TCGATGCGCC AGGCCATCGA GGAGGGCTTC ATCCTCGACC CGCTGCGCAA CTACGTCACC TACAACACGT ACTGGAAGCT CGTCAACGAG AACCCGGACG AACGCGAGGT CGACCCGACC AAGACCAATT CCCTGCTCGC CCGGTACGCG TTCACCCACG AGTCCACCGT GGCGCAGCAG GCCCAGGTGA TCGTGGAGCA CTTCCGGGCC CACACCTCTG GCCGCCTGGG CGGCCGTGCG AAGGCGATGG TGGTCACCGC GTCCCGGCAC AGCGCCGTGC AGATGGCACG AGCGATCAAG AAGTACCTCG GTGACCGCGA GTACGCCGAC CCCGGCGTGC TGGTGGCGTT CTCCGGCACC CTCACCTACG ACGGTGAGGA GATCACCGAG CCAGGGGAGA ACGGCGGCCT GTCCGAGAAC GCCCTGCCCA AGGCGTTCGC GTACACGCGC AAGGACGACA AGGTGCTACG TGGCGGTGGG ACCTCCGCGC AACGGGAGTA CCGCATCCTG GTCGTCGCCG ACAAGTACCA GACCGGCTTC GACCAGCCGC TGCTGACGAC CATGTACGTG AACAAGAAGC TGATGGGCAT TTCCGCCGTG CAGACGCTGT CCCGGCTCAA CCGCACCGCC GAGCGCAAGG CGCAGTCAGA TCTGGCCGTG CTGGACTTCG TCAACGAAGC CGAGCAGATC AAGGACTCAT TCCAGCCGTA CTTCGAAGAC GCGATGACGC TTCCGTCGGA CCCCAACCTG CTCTACACCG CGCAGAGCAG GGTCATGGCG CCGGATGTTC TCATCGAGCA GGAGATGCGC GAGTTCGTTG CGGCGTACCT CAGTGCGGAG GAGAAAGCGG CGGGGTCAGT AGCCAGGTGG GAGAAGCTGC ACGCGGAGCT CTACCGCTAC CTCGGACCGG CAGTCGCCAG GTTCACCGAG CTCTTGGACC GGGCGGACGA CGACGGCGAA GACCTGGAGG TCGCGGAGAC CTTCCGCGCG GACCTCAACG ACTACGTGCG GAAGTACGGC TTCCTCGCGC AGATCATCCC CTACACGGAT CCGGAGCTGG AAAGGCTCTA CCTCTACGGG CGGCACCTGC TCAACCGGCT GCCGCGGCGC GGTGATGGAG GAGTGGACAT CGGCGAGATC GACCTCAGCC ACATGCGGGT GGAGAAGACG GGAGAGCACG ACCTCGGACT GACGCCCGAG GGAACGGGCG AACTGCCGGG CTTCGGCGAC GGCCCCAGCG GGGCGCAGGA GCCAGAGAAG TCGCCGCTGT CGGAACTGAT CGAACGGTTC AACGCGAAGT ACGGCACCGA GTTCACCGAG CAGGATGTCG TCACACCGTT CACCGAGGCC GTGGCCGACC CGAAAGTGCG GTTGGCCGCC GTCAACGACG AGGACAACTT CGGTCACGTC TTCGACGACG TCTTCGAGGA CAAGATGGCC GAACACGTCG ACACCATCGC CGATCTCGGA CGGCAGTACT TCAGCCACGA CGACAGCTTC AAGCTTTCGC TGAACCGAAG TGCGCGACGA GCCGCTTGGC GGATGATCCG ACGCGAGGAA GGCGTGAACG ACGACTTGGC GTGA
|
Protein sequence | MSPVHHESAF GDAIVASMID SGWELGKSAD YRADLGLDTG ELFTFIGATQ AGEWEDLLDL YGGDANAAQA GFTKRLDQAI ATDGVLHVLR KGVKDRGVRI RVAYFKPSLV ASDAVLNDYR ANRLTVVREL EYAAKQANKG HRLDLTLFLN GIPLATAELK NPLTGQGVEH AKEQYRHDRD ATELIFSRRV IANFAVDPDL VFVATELRGA KTRFLPFNTG SNGPGQPGGA GNPPSTSFGR HTTSYLWEQV WQWDNWLDLL ERFVHLHKEK GADGRATKKV IFPRFHQWDI VKKLTTDAAR RGAGHNYLCM ASAGSGKSNT IGWLAHRLSS LHTPTDPAEI DPDELSAGLE PGTPVFDKVV IITDRRNLDS QLRETVGSFE QTAGLVVKID DKHGAKSDQL ARALSREPGK IITVTLHTFP ALLDYLRRNP IEIKGSNFAI IVDEAHSSQS GDAATAVRAA LRDLGLDADS DDAGAVTAQA DQNAATDAKL RRKAHQRGQA ANLSYFAFTA TPKAKTLELF GTLGDTGPDG KPTYRPFHTY SMRQAIEEGF ILDPLRNYVT YNTYWKLVNE NPDEREVDPT KTNSLLARYA FTHESTVAQQ AQVIVEHFRA HTSGRLGGRA KAMVVTASRH SAVQMARAIK KYLGDREYAD PGVLVAFSGT LTYDGEEITE PGENGGLSEN ALPKAFAYTR KDDKVLRGGG TSAQREYRIL VVADKYQTGF DQPLLTTMYV NKKLMGISAV QTLSRLNRTA ERKAQSDLAV LDFVNEAEQI KDSFQPYFED AMTLPSDPNL LYTAQSRVMA PDVLIEQEMR EFVAAYLSAE EKAAGSVARW EKLHAELYRY LGPAVARFTE LLDRADDDGE DLEVAETFRA DLNDYVRKYG FLAQIIPYTD PELERLYLYG RHLLNRLPRR GDGGVDIGEI DLSHMRVEKT GEHDLGLTPE GTGELPGFGD GPSGAQEPEK SPLSELIERF NAKYGTEFTE QDVVTPFTEA VADPKVRLAA VNDEDNFGHV FDDVFEDKMA EHVDTIADLG RQYFSHDDSF KLSLNRSARR AAWRMIRREE GVNDDLA
|
| |