Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_5275 |
Symbol | |
ID | 8329477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 6273904 |
End bp | 6276873 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 644945714 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003102942 |
Protein GI | 256379282 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTGGG ACTTGGGGAT CGGGTTTGGC GACGACGAGG TCGGGGTGCT CGGGCCGCTG CTCGTGCGGC GGGGTGGGGT GGTCGTCGCC GTCACCGCGC CCAAGCAGCG GATCGTGCTC GCCTCGCTGG TCATGGCCGC CAACCGGGCC GTGTCGTCGG CCGAGCTGGT CCGGGCCGTC TGGGGTGACC GGGCGCCCGC CCGTGCGGCG CACACGCTCG CCGTCTACGT CATGCGGCTG CGGCGGGCGC TCGGGGAGCC GCAGCTCGTC CACACCACGC CCTCCGGTTA CCTGATCAGC CTGCCGCACG GGGCCGTCGA CCTGCACCGG TTCGCCGACC ACGCCGCGCA CGGGGCGCTC GCCGCCTCCG CCAAGGACTT CGGCTCCGCC GTCGAGCACT ACCAGCGGGC CCTCGACTGC TGGCGCGGCC CGGCGCTCGC GGACGTGCCG TCCGAGGCCC TGCACGCCGA CGAGGTCCCG GCGCTGGTCG AGGAGCAGCT GCGGGTCACG GCCGAGCTGG TCGACGCCCG GCTGCGGCTC GGGCACGGGG CCGAGCTGGT GCCGGACCTG CGGCGGCTGA CCGCGCGCCA CCCGCTGCGC GAGCGGTTCT GGTCGCAGCT CATGGTCGCG CTCCACCGCG CGGACCGCCA GGCGGACGCG CTCGACGCCT ACCGCCAGGC CGGCACCGCC CTCGCCCGCG AGCTCGGCGT CGACCCCAGC GAGTCGCTCA GGGCCGTGCA CCACTCGATC CTCACCGGCG ACCCCGCGCT CCGGGGGCCG CTGCCCTCCG GGTGGGCCCC GGTGTCGCAG CTGCCCGCGC CGGTCGGGAA CTTCGTCGGG CGCGCCGACG AGCTGGACCG GGTCACCGCC CTGCTCGCCG GGCCCGCCGC CGTCGTCGCC GTGTGCGGGC CGCCAGGCGT GGGGAAGACC GCCTTCGCGG TGACGGTCGG TCACTCGGTG CGTGAGCGCT ACCACCACGG GCAGCTGTAC GCGGACCTGC GCGGCCACTC CACGTCACCG CCGCTCAGCA CCACCACCGT GCTCGGCCGG TTCCTCCGGG CCCTCGGCGC GCGCCCCGAC AGCATCCCCG CCGACGAGGC CGAGCTCGTG CGCGCCTACC GCGACCGGCT GCGCGGGCGC CGGGTGCTGA TCACGCTCGA CAACGCGGCC TCCGCCGCCC AGGTGCTGCC GCTGCTGCCG GACGTCCCCG AGTGCTCCGT GGTGATCACC AGCCGGAACG AGCTGGCCGG GGACGTCGGG GCCGCGGCGG TCCGGCTGGA CGTGCTGCGC GGCGACGAGG CGTGGATGCT GCTGACCCGC TCGCTCGCCC CCGAGGCCGC CGACGAGCAG GGCGACGCGC TGCGCGAGCT GGCCCGGCTG TGCGGCTACC TGCCGCTGGC GCTGCGGATC GCGCTCGGCA ACCTCGTCGG CGCGCACACC ACCGACATCC GGTCCTATGT GGACGACCTG CGCGGCGGGG ACCGCCTGTC CGCCTTGGCG GTCCCGGACG ACGACAGCGC GGCGGTGCGC CGGGCGTTCG ACCTGTCGCA CGCCGCGCTG CGGCCGGACG CCGCGAAGTT GTTCCGCCTC ACCGGGTTGC TGCCCGGCCC GGACTTCAGC GCGTTCGGCG CCGCCGCCCT GCTGGGCGCG GACGAGGCCA CCGCCCGCGC GCTCGCCGAG GAGCTGGCCT CGGCGAACCT GGTGCAGCGC GTGGGCGACG AGCGGTTCGC GGTGCACGAC CTGCTGCGCG AGTACGCGGC CGAACGGGCC CGCGCCGCCG GTGACGACCT GGAGGCCGCG CGCGGCAGGC TGTTCGACTG GTACCTGAGC ACCGCCTCCG ACGTGGGCGA CGTGCTGTTC CCCGAGGTCC GGCCCGCGCG CGGGCGGGCA GACCTGCCGG ACGCCCGCAC GGCGCGCGCC TGGTTGGAGG CGGAGCGGCC GAGCCTGCTG GCGGCCACCG AGCGGTGCGC GCGCCTGGGG CCGCTGCCGA TGGCCTGGTC GCTGGTCGAG GCGGTCGGCG GGTTCCTGGG CTCGCACGGG CACCACGGCG GGTTCCTGAA CGCGGTGCGC GCGGCGGGCG ACGCGGCGCG CGCGGCCGGG GACACCGAGG CCGGGGGCGT GGTCCTGGCG CACCTGGTGG CGGCGCACCG GAACCTCGGC GACCTGCGGG CCGCCCGCGA CGCAGCCCGC TCGGGGAACC CGGTGGGGCG CGCGGTGTGG CTGCTCGCCG GGGTCGCGGG CGTGGTCGCG CTGGACGTGG GCGAGCTGGC CGAGGCCGAG GACCGGTTCC GGGAAGTCGT TGGCGCGTCA GGGGAACTGG CGCACGCCCG GTGCGCCGGG CTGATCGGGC TCGGCGCGGT CCGCCTGGCG CGCGGGGAGC TCGACGCCGC CGAGGCGCTG CTGCGCGAGG GCCACGGGCT GGCGCTGCGG GTGGGCGCGG TGAACCTGGC GGCGTCCGGC GCGGACCTGC GCGGGCGGTG CCGGGGCGCG CGGGGCGACC ACCCCGGCGC GGTCGACCTG CTGCGGGAGG CGCGGGACGG GTGGGCCCGC ACCGGGGCCC GGCCGCCGCA CGCGGAGACC ACCGCGCACC TGGCCGCCGC GCTGTGCCTG GCCGGGGAGC ACGGCGAGGC GCTGCGGACC GCGCAGCGGG CGCTGGCCCT GGTGCAGGAG CTGGGCGGCA GCCCCCGGAT CGAGGCGGAC GTGCACAACG CGCTGGGCCT GGTGCAGCGG CACCTGGTGA ACCCCGAGGC GGCGGTCGCG GCGCACCTGC GGGCGCTGGA GCTGTCCGGC GGGGTCGGCT ACCGGCACGG GGTCGTGCAG GCCCACGTGC TGCTGGCGCC CGCGCTGCTG GCCGCCGGGC GGCGCGGGGA CGCGGTGCGG CACGCGCGGA TCGGCATCGG GCTGGCCGGG GAGACCGGGT ACGGCGGGCT GCGGCAGGCC GCCGAGTCGC TGCTGGCCGC GTTGGGCTGA
|
Protein sequence | MGWDLGIGFG DDEVGVLGPL LVRRGGVVVA VTAPKQRIVL ASLVMAANRA VSSAELVRAV WGDRAPARAA HTLAVYVMRL RRALGEPQLV HTTPSGYLIS LPHGAVDLHR FADHAAHGAL AASAKDFGSA VEHYQRALDC WRGPALADVP SEALHADEVP ALVEEQLRVT AELVDARLRL GHGAELVPDL RRLTARHPLR ERFWSQLMVA LHRADRQADA LDAYRQAGTA LARELGVDPS ESLRAVHHSI LTGDPALRGP LPSGWAPVSQ LPAPVGNFVG RADELDRVTA LLAGPAAVVA VCGPPGVGKT AFAVTVGHSV RERYHHGQLY ADLRGHSTSP PLSTTTVLGR FLRALGARPD SIPADEAELV RAYRDRLRGR RVLITLDNAA SAAQVLPLLP DVPECSVVIT SRNELAGDVG AAAVRLDVLR GDEAWMLLTR SLAPEAADEQ GDALRELARL CGYLPLALRI ALGNLVGAHT TDIRSYVDDL RGGDRLSALA VPDDDSAAVR RAFDLSHAAL RPDAAKLFRL TGLLPGPDFS AFGAAALLGA DEATARALAE ELASANLVQR VGDERFAVHD LLREYAAERA RAAGDDLEAA RGRLFDWYLS TASDVGDVLF PEVRPARGRA DLPDARTARA WLEAERPSLL AATERCARLG PLPMAWSLVE AVGGFLGSHG HHGGFLNAVR AAGDAARAAG DTEAGGVVLA HLVAAHRNLG DLRAARDAAR SGNPVGRAVW LLAGVAGVVA LDVGELAEAE DRFREVVGAS GELAHARCAG LIGLGAVRLA RGELDAAEAL LREGHGLALR VGAVNLAASG ADLRGRCRGA RGDHPGAVDL LREARDGWAR TGARPPHAET TAHLAAALCL AGEHGEALRT AQRALALVQE LGGSPRIEAD VHNALGLVQR HLVNPEAAVA AHLRALELSG GVGYRHGVVQ AHVLLAPALL AAGRRGDAVR HARIGIGLAG ETGYGGLRQA AESLLAALG
|
| |