Gene Amir_5323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5323 
Symbol 
ID8329525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6329566 
End bp6332643 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content79% 
IMG OID644945761 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003102989 
Protein GI256379329 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCC GGCTGCTGGG GCCGGTCGCG GTCGGGACGG ACGCCGTCCA GGCCGCGCTC 
GGCGGACCCA AGCCGAAGGC CCTGCTGGTG GCGCTCGCGC GCCAGGGCGG GCACGTCGTC
GGCATCGACC GCCTGGTCGA CCTGCTGTGG GGCGAGACAC CACCCGCGTC CGGCACGGCG
CTCGTGCACA CCTACGTCTC CCAGCTGCGC CGCGCGCTCG CCAAGGTCGG CCTGCCCGAC
GCCCTGCGCA CCAGCTCGCC CGGCTACCGC CTCGTCGCCG AGCCCGGCGA CTGCGACGTG
GACGTGTTCA CCGCCGAGCA CACCGCCGCC CGCGAGGCCG AGCGCTCCGG CGACCACGCC
GCCGCGCACG CCGCGTACGG CCGCGCGCTG GCCCTGTGGC GCGGCCCGGC GCTCGACGGC
GTCGACGCGG ACTTCGCGCT GGCGTGGGCG CAGGAGCTGG CCGAGCACCG GCTGGCCGCG
CGCGACGGCC TCGCGCGGAC GGCGCTGGCG CTGGGCCGCC CGCTGAGCGC GGCCGACGAG
CTGCGCGGCC TGGTCGCCGA GCACCCGCTG CGCGAGGAGA GCCGGGCGCT GCTGATGCGC
GCGCTGGCCG AGTCGGGCAG GCAGGCCGAC GCGCTGGAGG TCTACCGGGA CGGCAGGCGG
CACCTGCTGG ACGAGCTGGG CATCGAACCC GGCCGGGGCC TGCGCGAGCT GCACACCGCG
ATCCTGGACG GCTCCCTGAC CACCCCGGCC GCCGACGTGC CGCGCCCCGC CGCAGCCGCA
GCCGCAGTCA CGGCCACCGC GACCGCCGCC CCGGCCCGCC CGCTCACCCG CACCCTGCCG
CCCGACATCA ACGACTTCAC CGGCCGCGCC GACGAGCTGG CCGCGATCCT CGCCCTCGGC
GCCACCGGCC CCGACCGCCC CGCCGCGCCC GTCGTGGTGG TGTCCGGCGC GGGCGGCACC
GGCAAGTCCG CGCTGGCCGT GCACGCCGCG CACCTGCTGG CCGAGCAGTA CCCGGACGGC
CAGCTGTTCA CCGACCTGCG CGGCCACGGC GCCCCACCCA GCGCCTCCAC CGTGCTGGCC
CGCTTCCTCG GCGCGCTCGG CGTGCCCGTG GAGGACCTGC CACCCGGCCT GGACGACCGC
ATCGCCCTCT ACCGGCGGCA CCTGACCGGC CGCCGCCTGG TGATCGTGCT CGACAACGCC
CGCACCGAGC AGCAGGTGCG CCCGCTGCTG CCCACCGAAC CCGGCTGCCT GGTCCTGGTC
ACCAGCCGCG CCCGCCTCGC GGGCCTGGGC TCCGCCGTGG ACCTGGAGGT GTTCGACGCC
GGGTCGGCGG TGGAGATGCT GGGCCGGATC ATCGGCTCGG ACCGGGTCGC CTCCGCGCCC
GACGCCGCCC GCCGCATCGC GACCCTGTGC GCCGGGGTGC CGCTGGCGAT CCGGGCGGCG
GGCGCGAAGC TGCTGGCCCG CCCGCACTGG CCGCTCAAGT CGCTGGCGAC CCGGCTGTCC
GACGAGCGGC GGCGGTTGGA CGAGCTGACC GTGGGCGACC TGGCGATCCG CTCGTGCCTG
GGCCTGAACT ACGCCGAGCT GGACGAGCGC GCCAAGCACG CGTTCCACCT GCTGTGCCTG
CTGGACCTGC CGGACTTCGG CTGGTGGGTG GCCGCCCCGC TGCTGGGCGT GGACACCGCG
ACCGCCGAGG ACCTGGTGGA GAACCTGGTC GACCTGCGGC TGCTGGACGT GGCGGGCATC
GACCCGATCG GCCGGGTCCG CTACCGCTTC CACGACCTGG TGCAGCTCTT CGGCGCGGAG
CTCGCCGCCC AGCACGAGCC GCCCGGCGCC GCCACCGACG CGGTCGCCGC GTGCCTGGCG
GCGTGGGCCG ACCTGGCCGA GACCGGTGTG AAGGGCCTGC CGCGCGTCAC GCTCAGCCCC
CGCCTGCCCG CCGCCCCGGC CGGGGCCCCG AGCACCGATC CCGAGCTGGT CGCCGAGGTC
GAGGCCGACC CGGCGGGCTG GCTGGACGCC GAGACCGCCG CCGCCGTGCG CGCCGTCCAC
CGCGCCCACG AGCTGCGCGT CGACGCGGTG ACCACGCTGC GCGTCGCGGT GCTGCTGTCC
TCGGCGTTCG CCGCCCGCAA CGAGTTCGAG GCCTGGCAGC GCACCCTGCG GGTCGCGCTG
GCGCTGGCCG AGGAGAGCGG CGACCCGCGC GCCGGGGCCG TGGTCCTGGC CGGGCTGGGG
CAGCTGCACG CCGAGCTGGA CGAGTACGAC GAGGCCCTCG CGCACTTCGA GCGGGCCCTG
GAGCGCGCCG ACGCGGCCGG GGACGACGCG GTGCGCGCGG TGTGCCTGGC GGGCACCGGC
ACCGCGCACC GCGAGCGCGG CAACCCCGAG CTGGCCGCCA CCGCCCTGAC CGCCGCCGCC
GAGCTGGGCG CCGCGCTGGC CGACGACGCC GTCGTGGCCG CCGCCGAGTA CGGCCTGGGG
GCCCTGCGCC GCGACCTGGG CGACCTGGCG GGCGCGGCGG ACCGGTTCGA CCGGGCCGTG
CGCCGCTACG CCGCGGCGGG CGACCGGCGC GGCGAGGCGC TGGCCCTGCG CGGGGTGGCG
CTGTGCCACC GGGCGCGGGG TGAGCACCGG GTGTCCGCCG AGCTGTGCGA ACGGGCGGCG
GTGGCGCTCG ACGAGGTCGG TGACGCCCTC GGCGCGGCCT ACGCCCGCCA GGCGTGGGCC
AAGGCCGCCC TGCGCCTGCC CGGTTCCCCG ACCGCCGAGC TGGCGGCCTG CCTGGACGCG
TGCCTCGCGG TGTGCGAGCG CGGCAACGAC CGGTTCGGCG TCGCGCTGGT CACCCGCACC
ACCGGCGAGC TGCACCTGTC CCGAGGCGAC CTGGCCACGG CCCGCGAGCA CCTCCGCGCG
GCGCTGGCGG GCTGGACCGA GCTCGGCCTG GACACCTGGC GCGCCCGCAC CCTGCGCGAC
CTGGCCGCCG CGGACCCCGA GCACCAGGAC GAGCACTGGG CGCTGGCCCG CGAACTGCTC
AGCGGCACCG GGGCCCGCGA GGAGCGCGAA CTCGCCGAGA CCACCCCCGT GGGCTGGCGC
GCCGCCGTCG TCCGCTGA
 
Protein sequence
MRIRLLGPVA VGTDAVQAAL GGPKPKALLV ALARQGGHVV GIDRLVDLLW GETPPASGTA 
LVHTYVSQLR RALAKVGLPD ALRTSSPGYR LVAEPGDCDV DVFTAEHTAA REAERSGDHA
AAHAAYGRAL ALWRGPALDG VDADFALAWA QELAEHRLAA RDGLARTALA LGRPLSAADE
LRGLVAEHPL REESRALLMR ALAESGRQAD ALEVYRDGRR HLLDELGIEP GRGLRELHTA
ILDGSLTTPA ADVPRPAAAA AAVTATATAA PARPLTRTLP PDINDFTGRA DELAAILALG
ATGPDRPAAP VVVVSGAGGT GKSALAVHAA HLLAEQYPDG QLFTDLRGHG APPSASTVLA
RFLGALGVPV EDLPPGLDDR IALYRRHLTG RRLVIVLDNA RTEQQVRPLL PTEPGCLVLV
TSRARLAGLG SAVDLEVFDA GSAVEMLGRI IGSDRVASAP DAARRIATLC AGVPLAIRAA
GAKLLARPHW PLKSLATRLS DERRRLDELT VGDLAIRSCL GLNYAELDER AKHAFHLLCL
LDLPDFGWWV AAPLLGVDTA TAEDLVENLV DLRLLDVAGI DPIGRVRYRF HDLVQLFGAE
LAAQHEPPGA ATDAVAACLA AWADLAETGV KGLPRVTLSP RLPAAPAGAP STDPELVAEV
EADPAGWLDA ETAAAVRAVH RAHELRVDAV TTLRVAVLLS SAFAARNEFE AWQRTLRVAL
ALAEESGDPR AGAVVLAGLG QLHAELDEYD EALAHFERAL ERADAAGDDA VRAVCLAGTG
TAHRERGNPE LAATALTAAA ELGAALADDA VVAAAEYGLG ALRRDLGDLA GAADRFDRAV
RRYAAAGDRR GEALALRGVA LCHRARGEHR VSAELCERAA VALDEVGDAL GAAYARQAWA
KAALRLPGSP TAELAACLDA CLAVCERGND RFGVALVTRT TGELHLSRGD LATAREHLRA
ALAGWTELGL DTWRARTLRD LAAADPEHQD EHWALARELL SGTGAREERE LAETTPVGWR
AAVVR