Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_6166 |
Symbol | |
ID | 8330377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 7228008 |
End bp | 7231118 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644946600 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003103819 |
Protein GI | 256380159 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000170258 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGCT TCGCCGTGCT GGGCCCCCTG CGGGCAGAAC TGGACCGCGG ACCCGCGGAC CTCAAGGGCC CCCGCCACCG CGCGGTGCTG GCCAGGCTGC TGGTCGCGCG CGGCCGAACC GTCCCCCTGG ACACCCTGGT CGCCGACCTC TGGGACGACG CCCCACCCCC GAGCGCGCGC GGCGCCGTCC AGACCTTCGT CGGCGACCTG CGCAAGGCCC TGGAACCGGA CCGCCCACCC CGCACCCCAC CCCACCTGCT GGTCACGGTC GCGAACGGCT ACGCGCTGCG CACCGACAAC ACCGACGCCC ACCACTTCGA GTCCGCCGTC CACCAGGCCA AGTCCGCCCC ACCACCCCGA GCCCGAACCC TGCTCACCAG CGCCTTGGCC CTCTGGCGCG GCCCCGCGTA CGCCGAGTTC GCCGACCACC CCTGGGCACT GGCCGAGTCG ACCGCCCTGG AAGAGCTCCG CCTCCTGGCC GTGGAACGCC TGGCCGAGAC CGCCCTGTCC CTGAACGCCC CCGCCGACGC CATCCCACCC CTCACCCGGC ACGCCGCGTC CCACCCGCAC CGCGAGCACG CCGCCCACCT CCTGGCGCTG GCCCTCTACC GCACCGGCCG CCAGGGCGAG GCCCTGGAAA CCCTGCGCCG CACCAGATCC GCACTCCGCG CCGACCTGGG CGTGGACCCC GGCGAACCCC TGCGCGCCCT GGAAGCCGAC ATCCTCGCCC AGTCCCGAAC CCTGCTCCCC CCGCCGCGCC CCGCCACCCC GCCGCGCCCC GCCATCCCGC CCCGAACCGC CACCCCGCCC CGAACCGCCA CCCCACCCCT GTTCGGCCGC GAGGAAGAAC TGGCCACCCT CACCCGAGCC GCCACCGAGG CCGTCAGAAC CCACCGCCTC CACCACGTCC TCGTCTCAGG CGCAGCGGGT TCCGGCAAAT CCGCCCTGAC CGAAGCACTG GCCACCCACC TGCGCGCCGA GGGCTGGTCC ACCGCCACCA CCACCTGCCC CGACCTCCCC GGAACCCCGG CAGCCTGGCC GTGGACCGCA CTCCGCACCC ACCTGGGCCT CCCCCCGGAA CCCGACCGCA CCCCCCGCTT CACCACCCTC CGCACCCTGT CCGCCCACCT GTCCAAGTCC TCGCCCACGC TCCTGGTCCT GGACGACCTG CACCAGGCGG ACGAGGACAC CCTCGCCCTG CTCACCGCCC TCCCGCCCGA GGCGGGCCCA ACCCTGGTCG TCGGCACCCA CCGGGCCACC GACATCCCGC CCACCCTCAC CGCCGCGCTG GCCCGCCTGG CCCGAACCAC CCCGACCCGC CTCTACCTCT CCGGCCTGGA CGAGCAGGCG GTGGCGAACC TGATCGCCAC CCACCGCCCC CCGACGCGAC GAGCGACCAG GTCCATCCAC ACCAGGAGCG CGGGCAACCC GTTCCTGGCC CACGAACTGG CCAAGCTGTG GGCGACCGAG GGCGACGAGG CCCTGCGCAC CGTCCCCGCA GGCGTCCGAG ACGTCCTCCT GCACCGCCTC TCCGCCCTCC CGGAACCCGC GGGAACCCAC CTCCGCCAAG CCGCCGTCCT GGGCCGCGAG GTCGACCTCG CGATCCTCGC CGAGCTGGCG GGCGAAGACG TCCTGGACTC GATCGAGTCC GCGATCACCG CCGGTTTCCT GACAGAGCAC GACGCAGACC ACGTCCACTT CACCCACGAC CTGGTCCACG AGACCGTCCG AGCCGACACC ACGGCCCCCA GACGCGCCCG CTGGCACGTG GCCGCAGCAG AAGCCGTAGA GCGCGCCACC CCGGACGAAC ACGAACGCAT CGCCCACCAC CTGCTGGAGG CAGCCACAAG AACAACGGCC GCCAAAGCCG CTCACCACGC GTCCCTGGCC GCAACCAGGG CCGAACACCG CTCGGCCCCG CACGAGGCGG CAAGGCTCTG GCGCGCCACG ATCAACTCCC TGGACCACTC CCCAACCCCA CACCCCCAAG CCCGCCTGAC CGCCGTGATG GGCCTGGTCC GAGCCCTGGC CGTGACCGGC GACCTGGCCG CAGCCCGCGA ACACCGAGCC GAAGCAGTCC ACCAAGCGGA GTCCCTCCCC GACCCAGTCC ACAGGGCACG CGTAGTCGGC TCCTTCGACG TCCCAGCCCT GTGGACAACC CCCGATGACG ACGCCCTCTC CGCGACTCTG GCCGCGTCAG CCAACCGAGC CCTGACCTCC CTCCCCGCCA CCTACCAGGC CGAACGAGCC CGCCTCCTGG TGACCATCGC GATGGAACGA AGGGCAGACC CAACCCACGA GGCGTCCCAA GCCGCGCAGG AGGCCGAACG CATAGCCAGA TCCCTGGCCG ACCCCACCCT CCTAGCGCTG TCCCTGAACG CCCGCTACCT CCAGACCTTC CACCGGGCAG GCCTGGCCCC AGCCCGCAAA TCCCTGGCAG AGGAACTCCT CACAGTCACC TCCGCCCAAC CCGACCTGGT CGCCTTCGAG GTCCTGGCCC ACCTACTCCT GATCCAGTCC TCAGCGGCCC TGGCAGACCT CCCCACCGCA GACCACCACG CCACCCGCGC AAACCACCTG GCAACCCGGA ACGACCTCCC CCTGGTAACC CCGTTCACCG ACTGGTACCA GGCCCTCCGC CTATCCCTGA CGGGCCACAA GACCAAGGCC GAAGCCGCCT ACCGGGCAGC CGCCCCGAAG CTCACGGAAA CCCACCTCCC CGGCCTGGCC CCCGGCCTCC TCTCCCTGGC CCTGCACACC CTGGGCGTCC CCCAACCCAC CCCGGACTGG GCCACGAACC AACCCTGGAC CGCCCCCACA ACCCACCACA TCCCCAAGGC CCCACACGAC CACCTCTACG AACTGAGAAC CTGCCTGCAC GCACAAGCAG CCCTGTCCAG ACCCACCAAG AACCACGAGG AACTGACCGA ACTCCACAAC GCCCTGGCCC CAGCGGAAGA CGAACTGGCA GGCGCCACAA CGGGCCTGGT CTCCCTGGGC CCAGTAGCCG CCTACCTGGC AGACCTCTCC CAAGCCCTGG GCGACCTCAA AGAGGCAACG CGCCTACGCG CCAAAGCAAC CACCCTCACC ACCCGCCTGC GCACCGCCTG A
|
Protein sequence | MVSFAVLGPL RAELDRGPAD LKGPRHRAVL ARLLVARGRT VPLDTLVADL WDDAPPPSAR GAVQTFVGDL RKALEPDRPP RTPPHLLVTV ANGYALRTDN TDAHHFESAV HQAKSAPPPR ARTLLTSALA LWRGPAYAEF ADHPWALAES TALEELRLLA VERLAETALS LNAPADAIPP LTRHAASHPH REHAAHLLAL ALYRTGRQGE ALETLRRTRS ALRADLGVDP GEPLRALEAD ILAQSRTLLP PPRPATPPRP AIPPRTATPP RTATPPLFGR EEELATLTRA ATEAVRTHRL HHVLVSGAAG SGKSALTEAL ATHLRAEGWS TATTTCPDLP GTPAAWPWTA LRTHLGLPPE PDRTPRFTTL RTLSAHLSKS SPTLLVLDDL HQADEDTLAL LTALPPEAGP TLVVGTHRAT DIPPTLTAAL ARLARTTPTR LYLSGLDEQA VANLIATHRP PTRRATRSIH TRSAGNPFLA HELAKLWATE GDEALRTVPA GVRDVLLHRL SALPEPAGTH LRQAAVLGRE VDLAILAELA GEDVLDSIES AITAGFLTEH DADHVHFTHD LVHETVRADT TAPRRARWHV AAAEAVERAT PDEHERIAHH LLEAATRTTA AKAAHHASLA ATRAEHRSAP HEAARLWRAT INSLDHSPTP HPQARLTAVM GLVRALAVTG DLAAAREHRA EAVHQAESLP DPVHRARVVG SFDVPALWTT PDDDALSATL AASANRALTS LPATYQAERA RLLVTIAMER RADPTHEASQ AAQEAERIAR SLADPTLLAL SLNARYLQTF HRAGLAPARK SLAEELLTVT SAQPDLVAFE VLAHLLLIQS SAALADLPTA DHHATRANHL ATRNDLPLVT PFTDWYQALR LSLTGHKTKA EAAYRAAAPK LTETHLPGLA PGLLSLALHT LGVPQPTPDW ATNQPWTAPT THHIPKAPHD HLYELRTCLH AQAALSRPTK NHEELTELHN ALAPAEDELA GATTGLVSLG PVAAYLADLS QALGDLKEAT RLRAKATTLT TRLRTA
|
| |