Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_4437 |
Symbol | |
ID | 7280005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011879 |
Strand | + |
Start bp | 377366 |
End bp | 380221 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643580391 |
Product | RNA polymerase, sigma 32 subunit, RpoH |
Protein accession | YP_002478205 |
Protein GI | 219883041 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 0.118841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGGT GGCGTGCCGC CGGCCGCCGC GGGGTCATCG AGGCGGTCAC CGGGGCAGGC AAGACGCGGG TCGGCATCGC CTCCGCCTTC GAAGCAGTGC GGCGCGGATT CAAGGTCTTG GTTCTCGTCC CCACCGCAGA GCTGCAGATC CAGTGGCTCC AGGCCCTGGC CAGGGACCTC CCGGGGGCGC GGCGTGGGGC GTTGGGCGAC AACCGCCATG ACAGCCTGGA CACCGTGGAC GTCCTCGTCG CGATCGTTCA TTCCGCGGCA ACGCGCCAAA CCCTCCGGGA CCACAAGGCC GGGCTGATCA TCGCGGACGA ATGCCACCGA TACGCAGCCC CGATGTTCGC AGAAGCCCTG GAAACCGGCT ACACCTGGCG TCTGGGCCTG TCCGCCACCT ATGAACGCAC CGACGGTGAA CACCTCGTCC GGCTCGCACC TTTCTTCGGC GACATCGTCT TCAGGCTCTG GTACGACAGG GCGCTGGCAG AGAAGATCAT CGCCCCTTTC GACGTCGCCA TGGTCGGTGT TGAACTCACC CCGTCAGAGC GCGACGAATA CGATGCGTTT ACGGAGACGA TGGCCAAAGC CACCGGCCCG CTTGAGATCT ACCTGCACAT GCCTAGGAAC CCGTTCGCGC CGTTCATTGC CGCAGTCGCG CACCTGGCAG ACAGCAAAAT CGACTCCCCG GGCCCGGCCC TGGCCCGCAA ATACATGGCC GCGATGACCG GCCGGCAGAA CCTGCTGGCG GAGACACCGA CCAAGCGCAT GGCCCTGGCA GCCCTGAACC GTGCAATCAC CGCCGCCGGA CCGACCCTGG TGTTCACCCA GACGAAGAAA TCCGCTCTGT CTGCTGCGCA GGTCTGCGCT GCGATGGGCA ACCCCGCCAC CACGGTGATG TCGGGCATGA GCCGCGACCA GCGTGCCGCG GCGCTGGGCA GCTTCCGGGA CGGGAACGCC AAAGTCCTCA CCGCACCGCG GGTACTGGAC GAGGGCATCG ACGTTCCCGA GGCCGATCTG GGCATCATGG TCGCCGCCAG CAGCAGCCAG CGCCAACTCG TCCAGCGTCT CGGCCGGGTC ATCCGGAAGA AGATTGACGG CCGGGCCGGG CGCTTTGTCG TCCTCTACTC CAGGCACACT GTGGAGGACC CCGCGGTCCG GGGCGACGAG TACCTTGGCG CCGTCCTGCC CTTTGCCCGC AGGCAGGAAA CGTTCCGGAT TGAGACCGAT GTGGAGGCTG TTGAGGCGTT CCTCGCCTAC ACAGAACATG AAACTAAAGT GCCGGAGGTC CCTGAGGTGC CCCAAGCTCC TGCGGTTGAA CCGCCTGCCG CCCCGTTCGT CCTCGAACGC GACGACAGTG ACTGGTCCGA AGCGCCCGTC CTTGAAGGCA GCATCGGCGA CGACGTGCGG CTCTATCTCG AACAGGCCAG CTCCTTCGAC TTGTTGACCC ATGAGGAGGT CACAGATCTG GGGATGGCAA TCGAAGCCGG ACTGTACGCT GAGCACCTGC TGGGCACGGC ACGTCCCGAA GGGCGCCGGG CGATCCTCGA GTTGGAGTCC GTCGCGTCCG CGGGCCGCGA TGCGGTGCTC AAGCTCGTGA GCTGCAACCT GCGCCTGGTC GTGTCCATCG CCAAGCGCCA CACCGGCCGC GGCATGGATT TCCTCGACCT CATCCAGGAG GGCAACGCCG GGCTCTACCG CGCCGTCCAG AAGTTCGACT ACACGAAGGG CTTCAAGTTC TCCACATACG CAACCTGGTG GATCCGCCAG GCCATCACCC GCGGAATCGC CGACCAGTCA CGAACAATCC GGCTCCCCGT CCACTTCCAC GAGCAGGTCG TCAGCGTCTT CATCGCCGAA CGGAACTTCC TCCAAAGCGA AGGACGCGAC GGCACTGCGG AGGAGATCGC TGAGGCATGC GGCAAGACCG TCGACGAGAT CAACGCTGTC CGCCGCCACC GGACCCCGCC GGCCTCGCTG GACTGGGAAG TGCCCAACGG TAAGGGCGGC GTCGAGCTGC TGGGGGAGAC CCTCTACGAC CCCGACGAAC CCACCGCGTA CGACGCCGCC GTGCTGGCAC AGCTGCAGTC CGCGGTGCAC GGAGCCCTGG ACCGGCTCTC GGAACGGGAG GCCGGTGTCC TCGCCTACAG GTTCGGACTG GCCACCGGGG AACAACTGAC GCTTGATCAG ATCGGCACGA TCTACGGGGT GACCCGGGAA CGCATCCGTC AGATCGAAGC GAAAGCGTTC GCTGCACTCC GTGAGCCAGC CGCAGCAGCA CAGCTCAGGG ACTTCTTCGA CGGACGCAGC GATCTTGACG CCGCTGGCAG CGCAGATACA TCGAAAGCTA CGACCGCACC GGCCGCGAAG AAGCCGACCA AGCCCGCCAA GAAAAAGAGA AGGAAAGTGG AAGCGGGGCC TGTCGTGGAA ATGCGGAAAG CCAAATGGGA CCGTGCGCTG GCGAAACTCT CGGCGTTCGT AGCCCGGGAA GGCCACGCCC TGGTCCCCAC CAAACACGTC GAATCCGGCC ACAACCTGGG GGCTTGGATC AACACCCAGC GCAACGCGCT CCGCCTGGGC ACCATCAGCC CCGAACGGCT TGCTGAGATC GACGCGATCG ACATTTCGTG GCGCAAAGGC AAGATCGCCC CGTGGGAAAC ACAGCAGGCC GATACGGCGC GGCAGCCCGC CCCGGAACCT GAGCCGATCG AAATAGCGCC CGACAATGAG GACCCCTTCG CTGGCCCGCC GGTGGCTGCC ACCATCAGCC TCATCCTGGA CGAGGATCCT TTCGCTGAGT CTCCGGCAAC GCCCGGCAAA ACGTGCCTGT TGGACGAAGA GGACCCCTTC GCCTGA
|
Protein sequence | MDRWRAAGRR GVIEAVTGAG KTRVGIASAF EAVRRGFKVL VLVPTAELQI QWLQALARDL PGARRGALGD NRHDSLDTVD VLVAIVHSAA TRQTLRDHKA GLIIADECHR YAAPMFAEAL ETGYTWRLGL SATYERTDGE HLVRLAPFFG DIVFRLWYDR ALAEKIIAPF DVAMVGVELT PSERDEYDAF TETMAKATGP LEIYLHMPRN PFAPFIAAVA HLADSKIDSP GPALARKYMA AMTGRQNLLA ETPTKRMALA ALNRAITAAG PTLVFTQTKK SALSAAQVCA AMGNPATTVM SGMSRDQRAA ALGSFRDGNA KVLTAPRVLD EGIDVPEADL GIMVAASSSQ RQLVQRLGRV IRKKIDGRAG RFVVLYSRHT VEDPAVRGDE YLGAVLPFAR RQETFRIETD VEAVEAFLAY TEHETKVPEV PEVPQAPAVE PPAAPFVLER DDSDWSEAPV LEGSIGDDVR LYLEQASSFD LLTHEEVTDL GMAIEAGLYA EHLLGTARPE GRRAILELES VASAGRDAVL KLVSCNLRLV VSIAKRHTGR GMDFLDLIQE GNAGLYRAVQ KFDYTKGFKF STYATWWIRQ AITRGIADQS RTIRLPVHFH EQVVSVFIAE RNFLQSEGRD GTAEEIAEAC GKTVDEINAV RRHRTPPASL DWEVPNGKGG VELLGETLYD PDEPTAYDAA VLAQLQSAVH GALDRLSERE AGVLAYRFGL ATGEQLTLDQ IGTIYGVTRE RIRQIEAKAF AALREPAAAA QLRDFFDGRS DLDAAGSADT SKATTAPAAK KPTKPAKKKR RKVEAGPVVE MRKAKWDRAL AKLSAFVARE GHALVPTKHV ESGHNLGAWI NTQRNALRLG TISPERLAEI DAIDISWRKG KIAPWETQQA DTARQPAPEP EPIEIAPDNE DPFAGPPVAA TISLILDEDP FAESPATPGK TCLLDEEDPF A
|
| |