Gene Amir_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3571 
Symbol 
ID8327761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4147372 
End bp4150236 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content76% 
IMG OID644944067 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003101307 
Protein GI256377647 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGAGG TGGTGGAACG GGGCGGGCGC CTGGCGTTCC GGGTGCTGGG ACCGATCGAG 
GTCACCGGCT CCGGGGGGCC GGTGCGCATC CCGCCGGGAC GGCAGCAGGT CATCCTGGCC
TGCCTGCTCG TGGAGGCGAA CAAGGTGGTC AGCACGGACC ACCTGGTGGA CGCGCTGTGG
GAGGTCAACC CGCCGGACAC CGCGCGCACC CAGGTGCAGA TCTGCGTGTC GCGGCTGCGC
AAGACGCTGG CCGACGCGGG CGTGGACGTC TCCATCGTGA CCCGCCCGCC GGGCTACCAG
CTGCGCCTGC CGGACGCCTC GCTGGACGTG CACGAGTTCA CCAGGGGCGT CACCGAGGGC
CGCGCGGCGG CCAGGCGCGG CGAGCTGTCC GAGGCGTCCG AGCTGCTGCG GGCCTCGGTG
GGGCTGTGGC GCGGGGAGTG CCTGAGCGGG CTGACGAGCG CGCCGCTGCG CACCAGGGCG
CTGCGCCTGG AGGAGGACCG GCTCAACGCG GTGGAGACCT GCCTGGGCAT CGACCTGGAG
CTGGGCCTGC ACCACGAGCT GGTCGGCGAG ATCGGCAGGC TGGTGCGCGA GCACCCGCTG
CGGGAGCGGC CGAGGGCGCT GCTGATGCTG GCGCTGTACC GGTCGGGCCG CAAGGCCGAG
GCCCTGGAGG TGTACCGGGA GGGCCGCGAC CTGCTGGTGG AGGAGCTGGG TCTGGAGCCC
GGCGAGGAGC TGCGGGAGCT GGAGCGGGCG ATCCACGCCG GGGACGCCTC GCTGCTGCGC
GGCCCCGAGC CCGCGCGCGA GCGAAGACCC GCTGAGCCCG CCGACGTCGC GCGCGCGGCG
GTGCCCAGGC AGCTGCCCGC CGACACCGCC GACTTCATCG GCGGCGAGGA GCTGATCACC
GCCGCCGAGG AGGTGCTCAC CGGGGGCGCG GGGCGGCGCG CGGTCGGCGT CGTGGTGGTC
ATCGGCAGGC CGGGCGTGGG CAAGTCGACG CTGGCCGCGC ACCTGGGGCA CCGGGTCGCC
GAGGAGCACT TCCCCGACGG GCAGCTGTAC TGCGACCTGC GCGGCGGCTA CGGCGACGCC
GGCGGGTCCG CCGACGTGCT CGGCCGGTTC CTGCAGGCGC TCGGCATCCC CGGCGCGATG
ATCCCGGTGG AGCACACCGC GCGCACCGAG ATGTACCGGA CGCTGCTGGC GGACCGGCGG
GTGCTGGTGG TGCTGGACAA CGCGGTCAGC GAGCGCCAGG TGCTGCCGCT GCTGCCCGGC
GGCGGGCGCT GCGCGGTGGT GGTGACCAGC CGGGCGCGGC TGACCGGGCT GCCGGGCGCG
CGGCAGCTGG AGCTGGACGT GCTGGACCGG GAGCAGTCGC TGGAGCTGCT CGGCCGGGTC
GTGGGCGAGC GGCGGGTGGC GGGCGAGCCG GAGGCCGCGG AGGCGCTGGT GCGCACCGTC
GGCGGGCTGC CGCTGGCGCT GCGGATCGTC GCGGCGCGGC TGGCGGCCCG GCCGCACTGG
TCGCTGGCGT CGATGGTGCA CCGGCTGGCC AGCGAGCGGC ACCGCCTCGA CGAGCTGGCG
CACGGCGAGA TGACGATCCG GGCGAGCCTG TCGCTGACCC ACGACGGGCT GGACCAGCCG
ACGCGGCGGC TGTTCGGGCT GCTCAGCCTG GCCGAGGGCC CGTCGCTGCC CGGCTGGGTG
GCGGGCGCGG CGCTGGACGA CGGCAGGCCG TACGCGTCGG ACCTGATCGA GCCGCTGGTG
GACGTGCAGA TGCTCGACGT GGTCTCGGTC GACGGCACCG GCGAGTTCCG CTACCGCTTC
CACGACATCA TCCGGCTGTT CGCCCGCGAG CAGCTGGCGT CGGTGGACGA GCGGGAGCAG
CGGGAGGTGC AGGAACGGGT GCTGGGCGGC TGGCTGTCGC TGGCCGAGCA GGCGCACCGG
GGCGTGTTCG GCGGCGACTT CACCGCCCTG CACGGGAGCG CGCCGCGCTG GCACCCGCAC
CCCGTGCACG CCGAGCGGCT GCTGGAGAGC CCACTGGAGT GGCTGGAGGG CGAGCTGCCG
AACCTGCGGG CGGCCGTGGC GCAGGCGGCG CGGCTCGGGC TGGACGAGCT GTGCTGGGAC
CTGGCGGTGA CCACGACGAC GCTGTTCGAG GCGCGCGGCC ACCTGGACGA CTGGCGGCAC
ACCCACGACG AGGCGCTGCG GGCCACCAGG GCCGCGGGCA ACGCGCGCGG CACGGCGGCG
CTGCTGGCCT CGCTCGGCAC CCTGCACATC AACCGGGGGC GCGCCGAGGA GTCCGGGGCG
GTCCTGGTGG AGGCGCTGGC GGCGTTCACC GAGCTGGGCG ACGTGCGCGG GCAGGCGCTG
TGCAGGCGCG ACCTGGGGCT GCTCACCCGG CAGGCCGGGG ACGACGCGGG CGCGCTGGCG
CTGTACGGGC TGGCGCTGGC CGGGTTCGAG GAGGTCGGCG ACGTCGTCGG GCGGGCGATC
GTGCTGACCC AGCGGGCGCA CGTGCTCATG CGCACCGGGC GGGACGACGA GGCGCTCGCG
CAGCTCGCGG AGGCGATGGC CACCTGCCGG GAGGTCGGGT ACACCGGCGG GGTGGCGACC
ACGATGCGGC GCATCGGGCA GGTGCAGCTG CACCGGGGTG AGCACGAGCT CGCGGAGCGG
ACGCTGACCG AGGTGCTGGA GATGGTGCGG GCCAGCCGGG ACGTGATCGG CGAGGGGCAC
CTGCTGCACA ACCTGGGCGA GGTGAACGCG GCGGCGGGGC GCGTCGAGGC GGCTCGGGAG
TGCTTCGAGC GGTCGCTGGC GGTGCGGGAG CGGATGATGG ACCACGGCGG GGTGGCGGTG
GTGCGGCGGG AGCTGGCGCT GCTGGAGGGG AAGGTCCCCG CGTAG
 
Protein sequence
MDEVVERGGR LAFRVLGPIE VTGSGGPVRI PPGRQQVILA CLLVEANKVV STDHLVDALW 
EVNPPDTART QVQICVSRLR KTLADAGVDV SIVTRPPGYQ LRLPDASLDV HEFTRGVTEG
RAAARRGELS EASELLRASV GLWRGECLSG LTSAPLRTRA LRLEEDRLNA VETCLGIDLE
LGLHHELVGE IGRLVREHPL RERPRALLML ALYRSGRKAE ALEVYREGRD LLVEELGLEP
GEELRELERA IHAGDASLLR GPEPARERRP AEPADVARAA VPRQLPADTA DFIGGEELIT
AAEEVLTGGA GRRAVGVVVV IGRPGVGKST LAAHLGHRVA EEHFPDGQLY CDLRGGYGDA
GGSADVLGRF LQALGIPGAM IPVEHTARTE MYRTLLADRR VLVVLDNAVS ERQVLPLLPG
GGRCAVVVTS RARLTGLPGA RQLELDVLDR EQSLELLGRV VGERRVAGEP EAAEALVRTV
GGLPLALRIV AARLAARPHW SLASMVHRLA SERHRLDELA HGEMTIRASL SLTHDGLDQP
TRRLFGLLSL AEGPSLPGWV AGAALDDGRP YASDLIEPLV DVQMLDVVSV DGTGEFRYRF
HDIIRLFARE QLASVDEREQ REVQERVLGG WLSLAEQAHR GVFGGDFTAL HGSAPRWHPH
PVHAERLLES PLEWLEGELP NLRAAVAQAA RLGLDELCWD LAVTTTTLFE ARGHLDDWRH
THDEALRATR AAGNARGTAA LLASLGTLHI NRGRAEESGA VLVEALAAFT ELGDVRGQAL
CRRDLGLLTR QAGDDAGALA LYGLALAGFE EVGDVVGRAI VLTQRAHVLM RTGRDDEALA
QLAEAMATCR EVGYTGGVAT TMRRIGQVQL HRGEHELAER TLTEVLEMVR ASRDVIGEGH
LLHNLGEVNA AAGRVEAARE CFERSLAVRE RMMDHGGVAV VRRELALLEG KVPA