Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3021 |
Symbol | |
ID | 8327211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3489781 |
End bp | 3493725 |
Gene Length | 3945 bp |
Protein Length | 1314 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 644943546 |
Product | WGR domain protein |
Protein accession | YP_003100786 |
Protein GI | 256377126 |
COG category | [S] Function unknown |
COG ID | [COG3831] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0786299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCGGT TGGAGTACGT CGGCGGCACG TCGCGCAAGT TCTGGGAGGG CGCGCGCGAC GGGCTGTCGG TGACGGTGCG GTGGGGCCGC ATCGGGACCT CGGGGCAGTC GAAGACCAAG GAGTTCGCGT CCGAGCAGTC GGCGCGCGAC CACCTGGCCA AGCTGATCGC CGAGAAGCGC GCCAAGGGGT ACTCCGACGG CGCGCCCACC GCTCCCATGC CACTGCGGGA CCCGGCCGAC GCCACTTCGG GGAGGGGCGT CGGCCCACCC CGGTCCGGGG CCACCGCGGA CGTCGCGCCC GACGTCCCCT CGGACCGCTC GGACGGGGTT GGTGGTGGTG CGGGGCCGGG TGCGTCGGCC CCCGGCACCG CCGCCCCGTC GGAACCCGCC GGAGCCGGGG TGATCGTGCC CGGCGGGCTC GACCACTTCG CCTCGGTGGG AACGCCGCCC GGCGGGCTCG ACCACTTCGC CGGCGCGACC GTGCCGGTCA CCACGCCCGC GCCGACCGCT GTGGCCGAGC GCGAGCCTGA CGAGACCGCG TCCCCGGCAG GCCGGATCGT GCCCGGTGGG TTCGACCACT TCGCCGCGCC GTCGACCGCC CCGGCCAGGA CCGAGCCCGT CCCGGAGCCC GCCCCCGCGT GGGACCCGGC CGCCGAGGAC CGCTGGCCCG ACGACGCCCG CCCGCACCGC CTCGCCCTGC ACCGCAGGGG CGACGGGCGG TCCAAGCGCA AGCTCGTCCC CGGAGCCGCC GCCGACATGC TCGCGCTCGC CCGGACCCAC GACGAGCAGT TGCGGCACAG GTCGTGGAAG GCCCGTCTCA AGAGCGACGC CGACCTCGTC GACGCCCTCA CCGCCCACCT CGACGGCCAC GCCAGCCCGG TCGGCGCGGG CGTGCTGCAC AACATGCTCG CGCGGCACAC CTCCTCCTAC GAGCTGGACA AGCTCAACGC GCACGCCGAC GCCTGGCTCG CCGAGCACGG CGTCGTGTTC GCGGCCGAGG CGGTCCTGGC CAGCGTGACC ACCGTCGTCA ACGACGGCAT CCGCCACCAG TCCACGCGCA AGACGGTCAG CCGCTCCTGG CACGGCATCG AGCTGGTCGG ACGGGTGCGC GCCGCGCTCG CCACCGCGAC CGACGACGAG CACGCCTCCG CGCTCGACCG CGTCGGCGCC TACCTGGACC GCCGCTCGGC CCGCGCGATC GCCGCCGTCC TCATGCCGAC CGAGGCCGAC CTGGTCGAGC TGATGTGCCA GGACGTGCGG ACCGCGCACG ACGAGGAGCA GCGCGCCCAC CTCGTGCTCA CCGCGATCAG CACCCCCGAG CAGCTCGACC TGGTCAAGCC CGCCGTGCCG CTGTGGGACG TCGACCACTG GAACCGCACC CTCCCCACCC TGGTCACCGC GCTCGGCGCC GAGGCGCTGC CCGCGCTCGT CGACTGGTTC CGCGCCGACG GCGCCGACCC GTCCCTGCGC GCCTCGCTGG CCCAGCACAT CGCCGCGACC CCGACCGACG AGGCGCTGCG GTTCCTGGTC GACCTCGCCC CCGACCGCAC CACCTCGGGC CTGCTGCTCG CCGCGCTGCG CCGCCAGCCC GGCCGGGCCA CGCGGGTGCT CGCCGACGCC GTGCTGGCCG CGCCGAACCA GTCCGCAGCC GGTCGGCCCG CGGCCCGCGA CCTGCTGGAG GCGCACGTGC TGGCCTGCCC GGCCGCCGTC GCCGCCGTCC CGCTGACCGC CGAGGCCAGG GCGCTGGTCG ACGGCGTCGC CGAGGGCTTC GCGGACCGCG TCCCCGAGGC CGACCCCGCC GACCTGCCCC GCCCGCTGGT GGACGCCCCG TGGGAACGCG CCACCGCCGC CGCGCCACCG GTGGTGGTCC GCGGCCTGGA ACCACTCGGC GACGCCACCG AGAAGTGGCT GCCCGGCGAG CGCGACGAGT GGCTGGCGGG CATCCCCGAC TACTTCCTGC CCTGGGAGGA GGTCCGCGAC GCGGTCCTCA CCGACGAGGG CAACTACCTG GAGCGCGAGT CCGCCCTGCT GCGCGGCCCG GTCGACCAGG TGCGGCACCT GCTGCGCAGG CTGGGGCCGT ACGGGATCTA CAACGCGGAG ACCGGTCGCG CGCTGGTCGC CCGGTTCGGG ACCGACCTGC TGGAGGTGCT GCCCGACCTG CACGTGTACC CCTCGGTCGC CGCCGAGGTC CTGCTGCCGC TGCGCTCGCC GTGGGTGGCG GCGCGGATGG CCGAGTCGCT GGGCAAGCGG CTGTACCGGC CTCGGGCCGT GGCCTGGTTC GAGCGGCACG GGGTGGACGG CGCGCGCCCG CTGGTGGCCG CCGCCGTCGG CCCGACCGGC CGGGCGCGCG CCGCCGCCGA GTCCGCCCTG CGCCTGGTCG CAGACCGGGT CGGCCCCGAG GCCGTGCTCG CCGTCGCGGC CGAGCACGGC GACAAGGCGC ACCGGGCGGT GCGGCACCTG CTGGAGGTCG ACCCGGTCGA CGTGGTGCCC GCCAAGCTCC CGTCGCTGGA GGGCGTCGAC CCGGCCGTGC TGCCGCAGGT GCTGCTGCGC GACCGGAGCC GCGCCGTGCC GCTGTCGGCC GTGCCGAACC TGCTGCTCAG CCTGGCGATC GACCGCGCCG ACAGCCGGTA CGCGGGCGCG CTCCTCGTGC GCGAGCTGTG CGACCCGGAG TCGCTGGCCG CGTTCGGGCG GGCGCTGCTG AGCGCGTGGC GGCTCGTCGG GATGCCGTCC GAGCAGGCCT GGGTGCTCAC CGCGCAGGGC GCCGTCGGCG ACGACGAGAC CGTGCGCGTG CTGTCCCCGC TGATCCGGGT GTGGCCGGGC GAGGGCGGGC ACCGGCGCGC GGTGACCGGG CTCGACGTGC TCGCCCAGAT CGGCAGCGAC GTGGCGCTCA TGCACCTGGA CGCCATCTCG CGGAAGGTGA AGTTCAAGGG GCTCAAGGCG AGCGCGCGGG AGAAGCTGGA GCAGATCGCC GTCGACCGCG ACCTCAGCGC CGACCAGCTG GCCGACCGGC TCGTGCCGGA CTTCGGGCTG GACGCGGACG GCACGCTCGT CCTGGACTAC GGCCCGCGCC GGTTCACGGT CGGGTTCGAC GAGCAGCTGA AGCCGTACGT GCTCGACGGG GACGGCAACC GCCGCAAGGA GCTGCCCAAG CCGGGTGCGA AGGACGACCC CGAGCTGGCC CCGGCGGCGC ACCGCAGGTT CGCGGCGCTG AAGAAGGACG TGCGCACCTC GGCGGCCGAC CAGATCGTCC GCCTGGAGAA CGCGATGGTG GTGGGCAAGC GCTGGACGAC GGCCGAGTTC ACCACCCTGC TGGCCGGGCA CCCGCTGCTG CGCCACGTGG TGCGCCGCCT GGTGTGGCTG GCGGAGCGGC CCGCCGAGGA CGGCGCCGGT CGGGCGCTCA GCTTCCGCCT GGCCGAGGAC AACAGCCTGG CCGACTCCGC CGACGACCCG GTGCTGCTGC CCGGTGACGC CGAGATCCGC ATCGCCCACC CGCTGCTGCT GGCCGGGGAG CTGGACGCGT GGGCGGAGGT GTTCGCGGAC TACGAGCTGC TCCAGCCGTT CCCGCAGCTG GGCAGACCGG TGCACGCCCC GGAGGACGCA GACGGGGCGG TGAAGGCGCT GCTGGAGGGC GAGGTCGAGA CGGGCAGGCT GCTGTCGCTG GTGCGCCGGG GCTGGGAGCG CGCCGAACCG CAGGACGCAG GCGTGGAGCC GTGGATGGTC AAGCGCCTGG GCCCGACCAG GGTCCTGGTG CTCGACCTGG ACCCCGGCCT GTACGTGGGC GCGGTGTCGC TGATCTCGGA CAAGCAGCGC CTGGCCGCGC TGACCCTGGA ACGCCCGGCG AGCTACTACT GGAACCCGCA GGGCAAGGAG ACCCCGTCCC TGTCCACGCT GGACCCGGTG GTGCTGTCGG AGCTGATCGC GGACCTGGAG TCGCTGCGGG TGTGA
|
Protein sequence | MERLEYVGGT SRKFWEGARD GLSVTVRWGR IGTSGQSKTK EFASEQSARD HLAKLIAEKR AKGYSDGAPT APMPLRDPAD ATSGRGVGPP RSGATADVAP DVPSDRSDGV GGGAGPGASA PGTAAPSEPA GAGVIVPGGL DHFASVGTPP GGLDHFAGAT VPVTTPAPTA VAEREPDETA SPAGRIVPGG FDHFAAPSTA PARTEPVPEP APAWDPAAED RWPDDARPHR LALHRRGDGR SKRKLVPGAA ADMLALARTH DEQLRHRSWK ARLKSDADLV DALTAHLDGH ASPVGAGVLH NMLARHTSSY ELDKLNAHAD AWLAEHGVVF AAEAVLASVT TVVNDGIRHQ STRKTVSRSW HGIELVGRVR AALATATDDE HASALDRVGA YLDRRSARAI AAVLMPTEAD LVELMCQDVR TAHDEEQRAH LVLTAISTPE QLDLVKPAVP LWDVDHWNRT LPTLVTALGA EALPALVDWF RADGADPSLR ASLAQHIAAT PTDEALRFLV DLAPDRTTSG LLLAALRRQP GRATRVLADA VLAAPNQSAA GRPAARDLLE AHVLACPAAV AAVPLTAEAR ALVDGVAEGF ADRVPEADPA DLPRPLVDAP WERATAAAPP VVVRGLEPLG DATEKWLPGE RDEWLAGIPD YFLPWEEVRD AVLTDEGNYL ERESALLRGP VDQVRHLLRR LGPYGIYNAE TGRALVARFG TDLLEVLPDL HVYPSVAAEV LLPLRSPWVA ARMAESLGKR LYRPRAVAWF ERHGVDGARP LVAAAVGPTG RARAAAESAL RLVADRVGPE AVLAVAAEHG DKAHRAVRHL LEVDPVDVVP AKLPSLEGVD PAVLPQVLLR DRSRAVPLSA VPNLLLSLAI DRADSRYAGA LLVRELCDPE SLAAFGRALL SAWRLVGMPS EQAWVLTAQG AVGDDETVRV LSPLIRVWPG EGGHRRAVTG LDVLAQIGSD VALMHLDAIS RKVKFKGLKA SAREKLEQIA VDRDLSADQL ADRLVPDFGL DADGTLVLDY GPRRFTVGFD EQLKPYVLDG DGNRRKELPK PGAKDDPELA PAAHRRFAAL KKDVRTSAAD QIVRLENAMV VGKRWTTAEF TTLLAGHPLL RHVVRRLVWL AERPAEDGAG RALSFRLAED NSLADSADDP VLLPGDAEIR IAHPLLLAGE LDAWAEVFAD YELLQPFPQL GRPVHAPEDA DGAVKALLEG EVETGRLLSL VRRGWERAEP QDAGVEPWMV KRLGPTRVLV LDLDPGLYVG AVSLISDKQR LAALTLERPA SYYWNPQGKE TPSLSTLDPV VLSELIADLE SLRV
|
| |