Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3571 |
Symbol | |
ID | 8327761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 4147372 |
End bp | 4150236 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 644944067 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003101307 |
Protein GI | 256377647 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACGAGG TGGTGGAACG GGGCGGGCGC CTGGCGTTCC GGGTGCTGGG ACCGATCGAG GTCACCGGCT CCGGGGGGCC GGTGCGCATC CCGCCGGGAC GGCAGCAGGT CATCCTGGCC TGCCTGCTCG TGGAGGCGAA CAAGGTGGTC AGCACGGACC ACCTGGTGGA CGCGCTGTGG GAGGTCAACC CGCCGGACAC CGCGCGCACC CAGGTGCAGA TCTGCGTGTC GCGGCTGCGC AAGACGCTGG CCGACGCGGG CGTGGACGTC TCCATCGTGA CCCGCCCGCC GGGCTACCAG CTGCGCCTGC CGGACGCCTC GCTGGACGTG CACGAGTTCA CCAGGGGCGT CACCGAGGGC CGCGCGGCGG CCAGGCGCGG CGAGCTGTCC GAGGCGTCCG AGCTGCTGCG GGCCTCGGTG GGGCTGTGGC GCGGGGAGTG CCTGAGCGGG CTGACGAGCG CGCCGCTGCG CACCAGGGCG CTGCGCCTGG AGGAGGACCG GCTCAACGCG GTGGAGACCT GCCTGGGCAT CGACCTGGAG CTGGGCCTGC ACCACGAGCT GGTCGGCGAG ATCGGCAGGC TGGTGCGCGA GCACCCGCTG CGGGAGCGGC CGAGGGCGCT GCTGATGCTG GCGCTGTACC GGTCGGGCCG CAAGGCCGAG GCCCTGGAGG TGTACCGGGA GGGCCGCGAC CTGCTGGTGG AGGAGCTGGG TCTGGAGCCC GGCGAGGAGC TGCGGGAGCT GGAGCGGGCG ATCCACGCCG GGGACGCCTC GCTGCTGCGC GGCCCCGAGC CCGCGCGCGA GCGAAGACCC GCTGAGCCCG CCGACGTCGC GCGCGCGGCG GTGCCCAGGC AGCTGCCCGC CGACACCGCC GACTTCATCG GCGGCGAGGA GCTGATCACC GCCGCCGAGG AGGTGCTCAC CGGGGGCGCG GGGCGGCGCG CGGTCGGCGT CGTGGTGGTC ATCGGCAGGC CGGGCGTGGG CAAGTCGACG CTGGCCGCGC ACCTGGGGCA CCGGGTCGCC GAGGAGCACT TCCCCGACGG GCAGCTGTAC TGCGACCTGC GCGGCGGCTA CGGCGACGCC GGCGGGTCCG CCGACGTGCT CGGCCGGTTC CTGCAGGCGC TCGGCATCCC CGGCGCGATG ATCCCGGTGG AGCACACCGC GCGCACCGAG ATGTACCGGA CGCTGCTGGC GGACCGGCGG GTGCTGGTGG TGCTGGACAA CGCGGTCAGC GAGCGCCAGG TGCTGCCGCT GCTGCCCGGC GGCGGGCGCT GCGCGGTGGT GGTGACCAGC CGGGCGCGGC TGACCGGGCT GCCGGGCGCG CGGCAGCTGG AGCTGGACGT GCTGGACCGG GAGCAGTCGC TGGAGCTGCT CGGCCGGGTC GTGGGCGAGC GGCGGGTGGC GGGCGAGCCG GAGGCCGCGG AGGCGCTGGT GCGCACCGTC GGCGGGCTGC CGCTGGCGCT GCGGATCGTC GCGGCGCGGC TGGCGGCCCG GCCGCACTGG TCGCTGGCGT CGATGGTGCA CCGGCTGGCC AGCGAGCGGC ACCGCCTCGA CGAGCTGGCG CACGGCGAGA TGACGATCCG GGCGAGCCTG TCGCTGACCC ACGACGGGCT GGACCAGCCG ACGCGGCGGC TGTTCGGGCT GCTCAGCCTG GCCGAGGGCC CGTCGCTGCC CGGCTGGGTG GCGGGCGCGG CGCTGGACGA CGGCAGGCCG TACGCGTCGG ACCTGATCGA GCCGCTGGTG GACGTGCAGA TGCTCGACGT GGTCTCGGTC GACGGCACCG GCGAGTTCCG CTACCGCTTC CACGACATCA TCCGGCTGTT CGCCCGCGAG CAGCTGGCGT CGGTGGACGA GCGGGAGCAG CGGGAGGTGC AGGAACGGGT GCTGGGCGGC TGGCTGTCGC TGGCCGAGCA GGCGCACCGG GGCGTGTTCG GCGGCGACTT CACCGCCCTG CACGGGAGCG CGCCGCGCTG GCACCCGCAC CCCGTGCACG CCGAGCGGCT GCTGGAGAGC CCACTGGAGT GGCTGGAGGG CGAGCTGCCG AACCTGCGGG CGGCCGTGGC GCAGGCGGCG CGGCTCGGGC TGGACGAGCT GTGCTGGGAC CTGGCGGTGA CCACGACGAC GCTGTTCGAG GCGCGCGGCC ACCTGGACGA CTGGCGGCAC ACCCACGACG AGGCGCTGCG GGCCACCAGG GCCGCGGGCA ACGCGCGCGG CACGGCGGCG CTGCTGGCCT CGCTCGGCAC CCTGCACATC AACCGGGGGC GCGCCGAGGA GTCCGGGGCG GTCCTGGTGG AGGCGCTGGC GGCGTTCACC GAGCTGGGCG ACGTGCGCGG GCAGGCGCTG TGCAGGCGCG ACCTGGGGCT GCTCACCCGG CAGGCCGGGG ACGACGCGGG CGCGCTGGCG CTGTACGGGC TGGCGCTGGC CGGGTTCGAG GAGGTCGGCG ACGTCGTCGG GCGGGCGATC GTGCTGACCC AGCGGGCGCA CGTGCTCATG CGCACCGGGC GGGACGACGA GGCGCTCGCG CAGCTCGCGG AGGCGATGGC CACCTGCCGG GAGGTCGGGT ACACCGGCGG GGTGGCGACC ACGATGCGGC GCATCGGGCA GGTGCAGCTG CACCGGGGTG AGCACGAGCT CGCGGAGCGG ACGCTGACCG AGGTGCTGGA GATGGTGCGG GCCAGCCGGG ACGTGATCGG CGAGGGGCAC CTGCTGCACA ACCTGGGCGA GGTGAACGCG GCGGCGGGGC GCGTCGAGGC GGCTCGGGAG TGCTTCGAGC GGTCGCTGGC GGTGCGGGAG CGGATGATGG ACCACGGCGG GGTGGCGGTG GTGCGGCGGG AGCTGGCGCT GCTGGAGGGG AAGGTCCCCG CGTAG
|
Protein sequence | MDEVVERGGR LAFRVLGPIE VTGSGGPVRI PPGRQQVILA CLLVEANKVV STDHLVDALW EVNPPDTART QVQICVSRLR KTLADAGVDV SIVTRPPGYQ LRLPDASLDV HEFTRGVTEG RAAARRGELS EASELLRASV GLWRGECLSG LTSAPLRTRA LRLEEDRLNA VETCLGIDLE LGLHHELVGE IGRLVREHPL RERPRALLML ALYRSGRKAE ALEVYREGRD LLVEELGLEP GEELRELERA IHAGDASLLR GPEPARERRP AEPADVARAA VPRQLPADTA DFIGGEELIT AAEEVLTGGA GRRAVGVVVV IGRPGVGKST LAAHLGHRVA EEHFPDGQLY CDLRGGYGDA GGSADVLGRF LQALGIPGAM IPVEHTARTE MYRTLLADRR VLVVLDNAVS ERQVLPLLPG GGRCAVVVTS RARLTGLPGA RQLELDVLDR EQSLELLGRV VGERRVAGEP EAAEALVRTV GGLPLALRIV AARLAARPHW SLASMVHRLA SERHRLDELA HGEMTIRASL SLTHDGLDQP TRRLFGLLSL AEGPSLPGWV AGAALDDGRP YASDLIEPLV DVQMLDVVSV DGTGEFRYRF HDIIRLFARE QLASVDEREQ REVQERVLGG WLSLAEQAHR GVFGGDFTAL HGSAPRWHPH PVHAERLLES PLEWLEGELP NLRAAVAQAA RLGLDELCWD LAVTTTTLFE ARGHLDDWRH THDEALRATR AAGNARGTAA LLASLGTLHI NRGRAEESGA VLVEALAAFT ELGDVRGQAL CRRDLGLLTR QAGDDAGALA LYGLALAGFE EVGDVVGRAI VLTQRAHVLM RTGRDDEALA QLAEAMATCR EVGYTGGVAT TMRRIGQVQL HRGEHELAER TLTEVLEMVR ASRDVIGEGH LLHNLGEVNA AAGRVEAARE CFERSLAVRE RMMDHGGVAV VRRELALLEG KVPA
|
| |