Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_0561 |
Symbol | |
ID | 8324720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 607655 |
End bp | 611689 |
Gene Length | 4035 bp |
Protein Length | 1344 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 644941105 |
Product | WD-40 repeat protein |
Protein accession | YP_003098374 |
Protein GI | 256374714 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0971599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGGGA ACGGTCACGG TTCGGGTCCT CGCACCGCTT TCGCCGAACG CTTCGCGCTG CTCTACGCGG AGGCGGGCGA CCCCCCGCTC AAGCGCGTGA CCGCCTCGGT GGCCAGGTCC CGCCGGGTCG ACGAGCAGGG CCGACCGGTG CGGGTGACCG CGCAGCGGGT GAGCGACTGG CGCCGGGGGC GCAACGTCCC GGCCCGGTTC TCGGCACTCT CGGTAGTGCT GGAGGTGCTC ATCGGTGAAG CCAGGAAGCA GCGGCCGACC CCGCCGATCC CCGAGCTGCA CGACCTGGAG GCGTGGCGCG AGCTGTGGGA GCAGGCCCTC GCCAGCCCCG TCGCGGCCGT CGAGAGCGGG CCCGCGGAAA TGCCCGAGGA AAGCGGCGTG TGCCCCTACC GGGGATTGGC CGCCTTCCAG CCCGAGGATT CCTCCTGGTT CTTCGGCCGG GAACGCAGCA CGGCCGCCCT CGTCACCCGG CTGGACAATT CCACGGAATA CGGCGGAATC GTTGTCCTGG TCGGCGCCTC GGGCGCCGGG AAATCCTCGC TGGTGCGCGC CGGCCTCATT CCCTCCATCC GCGAGGGGGC GTTAGCCGAC CCGAGCTCCG CGACCTGGCC CTCGGCCGTC ATGACACCGG GCGCGAAACC GCTCGCCGAA TTGGTCGAGC GGGTTCCCGA GCTGTCCGCG GTCCTGCCCC CCGGCGGGCT CCCCGACGAC CCGCCGACCG AGCGCGGCGC GGGCCCGAGG TCGCTGGGCG TGATCGACTT CGCGGAACGG GTCAGGGCCG CTGTCGCCGA GCACGCGCGG GCGCGCGGCG GCGAGCGCCT GGTGCTGGTG GTCGACCAGT TCGAGGAGGC CTTCACCCTC TGCGGCGACG ACGCGCAGGT CCAGCTGTTC GTCCAGGCGC TGGCGGCGGC CTGCACCCCG GAGCGGCCGG GCGGGCACGC GCCCGCGCTG GTCGTGCTGG GCGTGCGCGC CGACTTCTAC GGGCGCTGCC TGGCGATCCC GGAGCTGGCC GACGCGCTCC AGGAGCGGCA GATGGTCCTG GGCCCGATGA CCTCGGCCGA GCTGCGCGAG GCGGTGGCCA GGCCCGCGCG CGCGGCCGGG CTCCAGCTGG AGGCCGGCCT GGTCGAGCTG ATGCTGCGCG ACCTGGGCGT GCGGGCCGGT CGGACGCCGG TGCACGGCGC GCGGGGCGCG TACGACGCGG GCGCCCTGCC GCTGCTCTCG CACGCCCTGC TCGCCACCTG GCAGCGCAGG CAGGCGGGCA GGCTCACCAT CGCGGGGTAC CGGGCGGCGG GCGGCATCCA GGGCGCGGTG GCCGCCACGG CCGAGCGCGC CTGGGCCGAC CTGCCCCCGG CCGCGCAGGG CGCCGCGCGC CCGCTGCTGC TGCGCCTGGT GCGGATCGGC GCGGACACCC ACGACACCCG CCGCCGCTCC ACCCGGCAGG AGCTGGTGGG CCAGGCGGGC GACCCGGCCG CCGCCGAGGA GGCCCTGGAG GTGCTGGCCC GCGCCCGGCT GGTCACCCTC GACGCGGGCT CGGTGGAGAT CACCCACGAG GCGCTGATCC AGGCGTGGCC GAGGCTGCGC GGCTGGATCG ACCAGGACCG GGAGGGCGAG CTGCTGCGGC AGCGGCTGGA GGAGGACGCG GCGGCCTGGG CCGAGCAGGG CCGGGACTCC TCGCTGCTGT ACCGGGGCGC GCGGCTGGAG GCGGCCAGGC ACTGGGCGAC CAGGCACCGG GCGGACGGCG CGGGCCCGAC CGGCGCCACC GGCGACTTCC TCGCGGTGTC CGGCCAGCAC CACCGGCGCG GGGTGTGGGC GCTGCGCGGC GGGGCCGCGC TGGTGGTGGT GCTCGCGCTG ATCGCCGGGG TCGCGGCCGT GGTGGCGGTG CGCCAGCGCG ACGACGCGGT GTTCCGGCAG GTCGTGGCCG AGGCCGACAA GCTGCTCGAC CGCGACCCGT CGCTGTCCGC GCAGCTGGCG CTGGTCGCGC GCGGGATGCG GCCGTCCGCG CTCGACCTGG GCACCCGGCT GATCTCCTCC GGCGGCGCGC CGCTGGCCAC CCCGCTGACC GGGCACACCG GCGCGGTCTA CCTGACCACG TTCTCCCCGG ACGGCCGCAC CCTGGCCACC GCCAGCTACG ACCGCACCGT GCGGCTGTGG GACGTCACCG ACCGGGACGA CCCGAAACCG CTGGGCGAGC CGCTGACCGG GCACGGCGAC TGGGTCAGCT CCGCCGTGTT CTCCCCCGAC GGCCGCACGC TCGCGTCGGC GGGCAAGGAC GGCTCGGTCC GGCTGTGGGA CGTCGCCGAC CGGGCGCGCC CCAGGCAGCT GAGCACCGCG GAGTCCCCCG GCAGGGACAC CGTCTACCTG GTCGCGTTCT CCCCGGACGG CCGCACGCTC GCGTCCGCGC ACGCCGACCG CGCGGTGCGG CTGTGGGACG TCACCGACCC GAGCGCCCCG AAGCAGGTCG CGGAGCTGGC CGGGCACGGG CAGCAGGTGC GCACCGTGGC CTTCTCCCCC ACCGGGCTGC TGGCCTCCGG CTCGGACGAC GCGACCGTGC GGCTGTGGGA CGTGGCGGAC CCGAGCGCGC CGAGGCAGGC CGGGGAGCCG CTGGGCGGGT TCGACAGCAC GGTGCACTCG GTCGCGTTCT CCCCCGACGG CCGCACCCTC GCCGCGGGCA GCGAGGACCG CAGCATCCGG CTGTGGGACG TCACCGACCC GGCCGCCCCC GAGGCGCGCG GCAGGCCGCT GGCGCTGCAC CTGGCCCCGG TGTGGTCGGT CGCGTTCTCC CCGGACGGGC GGGTGCTGGC CTCGGGCGCG GCGGACAGCA CGGCGCGGCT GTGGAACGTC ACCGACCCGG CGCGGGTCCA GCCGCTGGGC AAGCCGCTGG CGGGCCGCAG CGGCACGGTG TTCGCGGTCG GGTTCTCCCC CGACGGCCGC GCGCTGGCCA CCGGCAGCCT GGACCCGGTG GTGCGGATGT GGTCGCTGCC GTCGACGGTG CTGGTCGGGC ACGCCGCCCG CACGGTCGGC CCCCGGTTCG CCCCGGACGG GCGCGCGCTG CTGACCGGCA GCGAGGACGG CACGGTGCGC GCGTGGGACC TGGCCGGTCC CGGCGGCCCG GACGCGCCCA CCCCGATCGG CGACCGGCGG GACGCGGGCG AGCCGGTGCG GGCGGTGGTG CTGGGCGAGG ACGGCCGCAC GATGGTCACG GCGGGCCCGA AGGCGGTGCG GCTGTGGGAC GTGCGGCCGG GCGCGGAGCC GGAACCGCTG GGCGAGCCGC TGCCGCTGCG CACCCGGTTC AGCTCGCCGC TGGCGCTGCG CGGGAACCTG CTGGTCACGG CGGACGAGGA CGACACCGTC CTGCTGTGGG ACCTGTCCGA CCGGGCGCGC CCCAGGCAGC TGGGCGAGCC GCTGACCGGG CACGACGGGT ACGTGAACGT CGCGCTGCTC ACCCCGGACG GCCGGTTCCT GGTGACCGGC AGCGCCGACA GCGCGCTGCG GGTGTGGGAC GTGTCGGACC CGGCGCGCCC GAGGGCGGCG GGCCGGTTCC CCGGCCACGA CGGCCCGGTG CGGGCGGGCG CGCTGTCCCC GGACGGGCGG GTGCTGGCGA CGGCGGGCGA CGACAAGCTG GTGCGGCTGT GGGACTTCTC GGACCCGACC GCGCCGAGGG CGCTGGGAGC GCCGCTGGCC GGGCACGAGG AGGCCGTGGT GGCGGTGGTG TTCACCCCCG ACGGGCGCAC CCTGGCCAGC GGCGGCGAGG ACGCGCGGCT GCGGCTGTGG GACACCTCGG CCCCCGCGCT GGCCCGGCCG ATCGGCGAGG GCGTGGTCGG GCACGACTCG GCGCTGCGGG ACATCTCGGT GAGCCCGGAC GGCTCGGTGG TGGCGACCAG CTCGGCCGAC GGCACGGTGC GGCTGTGGCA CCTGGACGCG GACTGGGCCC GCAGGCGGAT CTGCGCGCGG ACCGGGGGCG TGCTGACCGA GGACGCCTGG CGGGAGCACG TCCCGCAGCT GGACTACGCG CCCCCGTGCA GGTAG
|
Protein sequence | MVGNGHGSGP RTAFAERFAL LYAEAGDPPL KRVTASVARS RRVDEQGRPV RVTAQRVSDW RRGRNVPARF SALSVVLEVL IGEARKQRPT PPIPELHDLE AWRELWEQAL ASPVAAVESG PAEMPEESGV CPYRGLAAFQ PEDSSWFFGR ERSTAALVTR LDNSTEYGGI VVLVGASGAG KSSLVRAGLI PSIREGALAD PSSATWPSAV MTPGAKPLAE LVERVPELSA VLPPGGLPDD PPTERGAGPR SLGVIDFAER VRAAVAEHAR ARGGERLVLV VDQFEEAFTL CGDDAQVQLF VQALAAACTP ERPGGHAPAL VVLGVRADFY GRCLAIPELA DALQERQMVL GPMTSAELRE AVARPARAAG LQLEAGLVEL MLRDLGVRAG RTPVHGARGA YDAGALPLLS HALLATWQRR QAGRLTIAGY RAAGGIQGAV AATAERAWAD LPPAAQGAAR PLLLRLVRIG ADTHDTRRRS TRQELVGQAG DPAAAEEALE VLARARLVTL DAGSVEITHE ALIQAWPRLR GWIDQDREGE LLRQRLEEDA AAWAEQGRDS SLLYRGARLE AARHWATRHR ADGAGPTGAT GDFLAVSGQH HRRGVWALRG GAALVVVLAL IAGVAAVVAV RQRDDAVFRQ VVAEADKLLD RDPSLSAQLA LVARGMRPSA LDLGTRLISS GGAPLATPLT GHTGAVYLTT FSPDGRTLAT ASYDRTVRLW DVTDRDDPKP LGEPLTGHGD WVSSAVFSPD GRTLASAGKD GSVRLWDVAD RARPRQLSTA ESPGRDTVYL VAFSPDGRTL ASAHADRAVR LWDVTDPSAP KQVAELAGHG QQVRTVAFSP TGLLASGSDD ATVRLWDVAD PSAPRQAGEP LGGFDSTVHS VAFSPDGRTL AAGSEDRSIR LWDVTDPAAP EARGRPLALH LAPVWSVAFS PDGRVLASGA ADSTARLWNV TDPARVQPLG KPLAGRSGTV FAVGFSPDGR ALATGSLDPV VRMWSLPSTV LVGHAARTVG PRFAPDGRAL LTGSEDGTVR AWDLAGPGGP DAPTPIGDRR DAGEPVRAVV LGEDGRTMVT AGPKAVRLWD VRPGAEPEPL GEPLPLRTRF SSPLALRGNL LVTADEDDTV LLWDLSDRAR PRQLGEPLTG HDGYVNVALL TPDGRFLVTG SADSALRVWD VSDPARPRAA GRFPGHDGPV RAGALSPDGR VLATAGDDKL VRLWDFSDPT APRALGAPLA GHEEAVVAVV FTPDGRTLAS GGEDARLRLW DTSAPALARP IGEGVVGHDS ALRDISVSPD GSVVATSSAD GTVRLWHLDA DWARRRICAR TGGVLTEDAW REHVPQLDYA PPCR
|
| |