Gene Amir_6166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_6166 
Symbol 
ID8330377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp7228008 
End bp7231118 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content72% 
IMG OID644946600 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003103819 
Protein GI256380159 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000170258 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGCT TCGCCGTGCT GGGCCCCCTG CGGGCAGAAC TGGACCGCGG ACCCGCGGAC 
CTCAAGGGCC CCCGCCACCG CGCGGTGCTG GCCAGGCTGC TGGTCGCGCG CGGCCGAACC
GTCCCCCTGG ACACCCTGGT CGCCGACCTC TGGGACGACG CCCCACCCCC GAGCGCGCGC
GGCGCCGTCC AGACCTTCGT CGGCGACCTG CGCAAGGCCC TGGAACCGGA CCGCCCACCC
CGCACCCCAC CCCACCTGCT GGTCACGGTC GCGAACGGCT ACGCGCTGCG CACCGACAAC
ACCGACGCCC ACCACTTCGA GTCCGCCGTC CACCAGGCCA AGTCCGCCCC ACCACCCCGA
GCCCGAACCC TGCTCACCAG CGCCTTGGCC CTCTGGCGCG GCCCCGCGTA CGCCGAGTTC
GCCGACCACC CCTGGGCACT GGCCGAGTCG ACCGCCCTGG AAGAGCTCCG CCTCCTGGCC
GTGGAACGCC TGGCCGAGAC CGCCCTGTCC CTGAACGCCC CCGCCGACGC CATCCCACCC
CTCACCCGGC ACGCCGCGTC CCACCCGCAC CGCGAGCACG CCGCCCACCT CCTGGCGCTG
GCCCTCTACC GCACCGGCCG CCAGGGCGAG GCCCTGGAAA CCCTGCGCCG CACCAGATCC
GCACTCCGCG CCGACCTGGG CGTGGACCCC GGCGAACCCC TGCGCGCCCT GGAAGCCGAC
ATCCTCGCCC AGTCCCGAAC CCTGCTCCCC CCGCCGCGCC CCGCCACCCC GCCGCGCCCC
GCCATCCCGC CCCGAACCGC CACCCCGCCC CGAACCGCCA CCCCACCCCT GTTCGGCCGC
GAGGAAGAAC TGGCCACCCT CACCCGAGCC GCCACCGAGG CCGTCAGAAC CCACCGCCTC
CACCACGTCC TCGTCTCAGG CGCAGCGGGT TCCGGCAAAT CCGCCCTGAC CGAAGCACTG
GCCACCCACC TGCGCGCCGA GGGCTGGTCC ACCGCCACCA CCACCTGCCC CGACCTCCCC
GGAACCCCGG CAGCCTGGCC GTGGACCGCA CTCCGCACCC ACCTGGGCCT CCCCCCGGAA
CCCGACCGCA CCCCCCGCTT CACCACCCTC CGCACCCTGT CCGCCCACCT GTCCAAGTCC
TCGCCCACGC TCCTGGTCCT GGACGACCTG CACCAGGCGG ACGAGGACAC CCTCGCCCTG
CTCACCGCCC TCCCGCCCGA GGCGGGCCCA ACCCTGGTCG TCGGCACCCA CCGGGCCACC
GACATCCCGC CCACCCTCAC CGCCGCGCTG GCCCGCCTGG CCCGAACCAC CCCGACCCGC
CTCTACCTCT CCGGCCTGGA CGAGCAGGCG GTGGCGAACC TGATCGCCAC CCACCGCCCC
CCGACGCGAC GAGCGACCAG GTCCATCCAC ACCAGGAGCG CGGGCAACCC GTTCCTGGCC
CACGAACTGG CCAAGCTGTG GGCGACCGAG GGCGACGAGG CCCTGCGCAC CGTCCCCGCA
GGCGTCCGAG ACGTCCTCCT GCACCGCCTC TCCGCCCTCC CGGAACCCGC GGGAACCCAC
CTCCGCCAAG CCGCCGTCCT GGGCCGCGAG GTCGACCTCG CGATCCTCGC CGAGCTGGCG
GGCGAAGACG TCCTGGACTC GATCGAGTCC GCGATCACCG CCGGTTTCCT GACAGAGCAC
GACGCAGACC ACGTCCACTT CACCCACGAC CTGGTCCACG AGACCGTCCG AGCCGACACC
ACGGCCCCCA GACGCGCCCG CTGGCACGTG GCCGCAGCAG AAGCCGTAGA GCGCGCCACC
CCGGACGAAC ACGAACGCAT CGCCCACCAC CTGCTGGAGG CAGCCACAAG AACAACGGCC
GCCAAAGCCG CTCACCACGC GTCCCTGGCC GCAACCAGGG CCGAACACCG CTCGGCCCCG
CACGAGGCGG CAAGGCTCTG GCGCGCCACG ATCAACTCCC TGGACCACTC CCCAACCCCA
CACCCCCAAG CCCGCCTGAC CGCCGTGATG GGCCTGGTCC GAGCCCTGGC CGTGACCGGC
GACCTGGCCG CAGCCCGCGA ACACCGAGCC GAAGCAGTCC ACCAAGCGGA GTCCCTCCCC
GACCCAGTCC ACAGGGCACG CGTAGTCGGC TCCTTCGACG TCCCAGCCCT GTGGACAACC
CCCGATGACG ACGCCCTCTC CGCGACTCTG GCCGCGTCAG CCAACCGAGC CCTGACCTCC
CTCCCCGCCA CCTACCAGGC CGAACGAGCC CGCCTCCTGG TGACCATCGC GATGGAACGA
AGGGCAGACC CAACCCACGA GGCGTCCCAA GCCGCGCAGG AGGCCGAACG CATAGCCAGA
TCCCTGGCCG ACCCCACCCT CCTAGCGCTG TCCCTGAACG CCCGCTACCT CCAGACCTTC
CACCGGGCAG GCCTGGCCCC AGCCCGCAAA TCCCTGGCAG AGGAACTCCT CACAGTCACC
TCCGCCCAAC CCGACCTGGT CGCCTTCGAG GTCCTGGCCC ACCTACTCCT GATCCAGTCC
TCAGCGGCCC TGGCAGACCT CCCCACCGCA GACCACCACG CCACCCGCGC AAACCACCTG
GCAACCCGGA ACGACCTCCC CCTGGTAACC CCGTTCACCG ACTGGTACCA GGCCCTCCGC
CTATCCCTGA CGGGCCACAA GACCAAGGCC GAAGCCGCCT ACCGGGCAGC CGCCCCGAAG
CTCACGGAAA CCCACCTCCC CGGCCTGGCC CCCGGCCTCC TCTCCCTGGC CCTGCACACC
CTGGGCGTCC CCCAACCCAC CCCGGACTGG GCCACGAACC AACCCTGGAC CGCCCCCACA
ACCCACCACA TCCCCAAGGC CCCACACGAC CACCTCTACG AACTGAGAAC CTGCCTGCAC
GCACAAGCAG CCCTGTCCAG ACCCACCAAG AACCACGAGG AACTGACCGA ACTCCACAAC
GCCCTGGCCC CAGCGGAAGA CGAACTGGCA GGCGCCACAA CGGGCCTGGT CTCCCTGGGC
CCAGTAGCCG CCTACCTGGC AGACCTCTCC CAAGCCCTGG GCGACCTCAA AGAGGCAACG
CGCCTACGCG CCAAAGCAAC CACCCTCACC ACCCGCCTGC GCACCGCCTG A
 
Protein sequence
MVSFAVLGPL RAELDRGPAD LKGPRHRAVL ARLLVARGRT VPLDTLVADL WDDAPPPSAR 
GAVQTFVGDL RKALEPDRPP RTPPHLLVTV ANGYALRTDN TDAHHFESAV HQAKSAPPPR
ARTLLTSALA LWRGPAYAEF ADHPWALAES TALEELRLLA VERLAETALS LNAPADAIPP
LTRHAASHPH REHAAHLLAL ALYRTGRQGE ALETLRRTRS ALRADLGVDP GEPLRALEAD
ILAQSRTLLP PPRPATPPRP AIPPRTATPP RTATPPLFGR EEELATLTRA ATEAVRTHRL
HHVLVSGAAG SGKSALTEAL ATHLRAEGWS TATTTCPDLP GTPAAWPWTA LRTHLGLPPE
PDRTPRFTTL RTLSAHLSKS SPTLLVLDDL HQADEDTLAL LTALPPEAGP TLVVGTHRAT
DIPPTLTAAL ARLARTTPTR LYLSGLDEQA VANLIATHRP PTRRATRSIH TRSAGNPFLA
HELAKLWATE GDEALRTVPA GVRDVLLHRL SALPEPAGTH LRQAAVLGRE VDLAILAELA
GEDVLDSIES AITAGFLTEH DADHVHFTHD LVHETVRADT TAPRRARWHV AAAEAVERAT
PDEHERIAHH LLEAATRTTA AKAAHHASLA ATRAEHRSAP HEAARLWRAT INSLDHSPTP
HPQARLTAVM GLVRALAVTG DLAAAREHRA EAVHQAESLP DPVHRARVVG SFDVPALWTT
PDDDALSATL AASANRALTS LPATYQAERA RLLVTIAMER RADPTHEASQ AAQEAERIAR
SLADPTLLAL SLNARYLQTF HRAGLAPARK SLAEELLTVT SAQPDLVAFE VLAHLLLIQS
SAALADLPTA DHHATRANHL ATRNDLPLVT PFTDWYQALR LSLTGHKTKA EAAYRAAAPK
LTETHLPGLA PGLLSLALHT LGVPQPTPDW ATNQPWTAPT THHIPKAPHD HLYELRTCLH
AQAALSRPTK NHEELTELHN ALAPAEDELA GATTGLVSLG PVAAYLADLS QALGDLKEAT
RLRAKATTLT TRLRTA