Gene Achl_4437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_4437 
Symbol 
ID7280005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011879 
Strand
Start bp377366 
End bp380221 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content66% 
IMG OID643580391 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_002478205 
Protein GI219883041 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)
[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value0.118841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGGT GGCGTGCCGC CGGCCGCCGC GGGGTCATCG AGGCGGTCAC CGGGGCAGGC 
AAGACGCGGG TCGGCATCGC CTCCGCCTTC GAAGCAGTGC GGCGCGGATT CAAGGTCTTG
GTTCTCGTCC CCACCGCAGA GCTGCAGATC CAGTGGCTCC AGGCCCTGGC CAGGGACCTC
CCGGGGGCGC GGCGTGGGGC GTTGGGCGAC AACCGCCATG ACAGCCTGGA CACCGTGGAC
GTCCTCGTCG CGATCGTTCA TTCCGCGGCA ACGCGCCAAA CCCTCCGGGA CCACAAGGCC
GGGCTGATCA TCGCGGACGA ATGCCACCGA TACGCAGCCC CGATGTTCGC AGAAGCCCTG
GAAACCGGCT ACACCTGGCG TCTGGGCCTG TCCGCCACCT ATGAACGCAC CGACGGTGAA
CACCTCGTCC GGCTCGCACC TTTCTTCGGC GACATCGTCT TCAGGCTCTG GTACGACAGG
GCGCTGGCAG AGAAGATCAT CGCCCCTTTC GACGTCGCCA TGGTCGGTGT TGAACTCACC
CCGTCAGAGC GCGACGAATA CGATGCGTTT ACGGAGACGA TGGCCAAAGC CACCGGCCCG
CTTGAGATCT ACCTGCACAT GCCTAGGAAC CCGTTCGCGC CGTTCATTGC CGCAGTCGCG
CACCTGGCAG ACAGCAAAAT CGACTCCCCG GGCCCGGCCC TGGCCCGCAA ATACATGGCC
GCGATGACCG GCCGGCAGAA CCTGCTGGCG GAGACACCGA CCAAGCGCAT GGCCCTGGCA
GCCCTGAACC GTGCAATCAC CGCCGCCGGA CCGACCCTGG TGTTCACCCA GACGAAGAAA
TCCGCTCTGT CTGCTGCGCA GGTCTGCGCT GCGATGGGCA ACCCCGCCAC CACGGTGATG
TCGGGCATGA GCCGCGACCA GCGTGCCGCG GCGCTGGGCA GCTTCCGGGA CGGGAACGCC
AAAGTCCTCA CCGCACCGCG GGTACTGGAC GAGGGCATCG ACGTTCCCGA GGCCGATCTG
GGCATCATGG TCGCCGCCAG CAGCAGCCAG CGCCAACTCG TCCAGCGTCT CGGCCGGGTC
ATCCGGAAGA AGATTGACGG CCGGGCCGGG CGCTTTGTCG TCCTCTACTC CAGGCACACT
GTGGAGGACC CCGCGGTCCG GGGCGACGAG TACCTTGGCG CCGTCCTGCC CTTTGCCCGC
AGGCAGGAAA CGTTCCGGAT TGAGACCGAT GTGGAGGCTG TTGAGGCGTT CCTCGCCTAC
ACAGAACATG AAACTAAAGT GCCGGAGGTC CCTGAGGTGC CCCAAGCTCC TGCGGTTGAA
CCGCCTGCCG CCCCGTTCGT CCTCGAACGC GACGACAGTG ACTGGTCCGA AGCGCCCGTC
CTTGAAGGCA GCATCGGCGA CGACGTGCGG CTCTATCTCG AACAGGCCAG CTCCTTCGAC
TTGTTGACCC ATGAGGAGGT CACAGATCTG GGGATGGCAA TCGAAGCCGG ACTGTACGCT
GAGCACCTGC TGGGCACGGC ACGTCCCGAA GGGCGCCGGG CGATCCTCGA GTTGGAGTCC
GTCGCGTCCG CGGGCCGCGA TGCGGTGCTC AAGCTCGTGA GCTGCAACCT GCGCCTGGTC
GTGTCCATCG CCAAGCGCCA CACCGGCCGC GGCATGGATT TCCTCGACCT CATCCAGGAG
GGCAACGCCG GGCTCTACCG CGCCGTCCAG AAGTTCGACT ACACGAAGGG CTTCAAGTTC
TCCACATACG CAACCTGGTG GATCCGCCAG GCCATCACCC GCGGAATCGC CGACCAGTCA
CGAACAATCC GGCTCCCCGT CCACTTCCAC GAGCAGGTCG TCAGCGTCTT CATCGCCGAA
CGGAACTTCC TCCAAAGCGA AGGACGCGAC GGCACTGCGG AGGAGATCGC TGAGGCATGC
GGCAAGACCG TCGACGAGAT CAACGCTGTC CGCCGCCACC GGACCCCGCC GGCCTCGCTG
GACTGGGAAG TGCCCAACGG TAAGGGCGGC GTCGAGCTGC TGGGGGAGAC CCTCTACGAC
CCCGACGAAC CCACCGCGTA CGACGCCGCC GTGCTGGCAC AGCTGCAGTC CGCGGTGCAC
GGAGCCCTGG ACCGGCTCTC GGAACGGGAG GCCGGTGTCC TCGCCTACAG GTTCGGACTG
GCCACCGGGG AACAACTGAC GCTTGATCAG ATCGGCACGA TCTACGGGGT GACCCGGGAA
CGCATCCGTC AGATCGAAGC GAAAGCGTTC GCTGCACTCC GTGAGCCAGC CGCAGCAGCA
CAGCTCAGGG ACTTCTTCGA CGGACGCAGC GATCTTGACG CCGCTGGCAG CGCAGATACA
TCGAAAGCTA CGACCGCACC GGCCGCGAAG AAGCCGACCA AGCCCGCCAA GAAAAAGAGA
AGGAAAGTGG AAGCGGGGCC TGTCGTGGAA ATGCGGAAAG CCAAATGGGA CCGTGCGCTG
GCGAAACTCT CGGCGTTCGT AGCCCGGGAA GGCCACGCCC TGGTCCCCAC CAAACACGTC
GAATCCGGCC ACAACCTGGG GGCTTGGATC AACACCCAGC GCAACGCGCT CCGCCTGGGC
ACCATCAGCC CCGAACGGCT TGCTGAGATC GACGCGATCG ACATTTCGTG GCGCAAAGGC
AAGATCGCCC CGTGGGAAAC ACAGCAGGCC GATACGGCGC GGCAGCCCGC CCCGGAACCT
GAGCCGATCG AAATAGCGCC CGACAATGAG GACCCCTTCG CTGGCCCGCC GGTGGCTGCC
ACCATCAGCC TCATCCTGGA CGAGGATCCT TTCGCTGAGT CTCCGGCAAC GCCCGGCAAA
ACGTGCCTGT TGGACGAAGA GGACCCCTTC GCCTGA
 
Protein sequence
MDRWRAAGRR GVIEAVTGAG KTRVGIASAF EAVRRGFKVL VLVPTAELQI QWLQALARDL 
PGARRGALGD NRHDSLDTVD VLVAIVHSAA TRQTLRDHKA GLIIADECHR YAAPMFAEAL
ETGYTWRLGL SATYERTDGE HLVRLAPFFG DIVFRLWYDR ALAEKIIAPF DVAMVGVELT
PSERDEYDAF TETMAKATGP LEIYLHMPRN PFAPFIAAVA HLADSKIDSP GPALARKYMA
AMTGRQNLLA ETPTKRMALA ALNRAITAAG PTLVFTQTKK SALSAAQVCA AMGNPATTVM
SGMSRDQRAA ALGSFRDGNA KVLTAPRVLD EGIDVPEADL GIMVAASSSQ RQLVQRLGRV
IRKKIDGRAG RFVVLYSRHT VEDPAVRGDE YLGAVLPFAR RQETFRIETD VEAVEAFLAY
TEHETKVPEV PEVPQAPAVE PPAAPFVLER DDSDWSEAPV LEGSIGDDVR LYLEQASSFD
LLTHEEVTDL GMAIEAGLYA EHLLGTARPE GRRAILELES VASAGRDAVL KLVSCNLRLV
VSIAKRHTGR GMDFLDLIQE GNAGLYRAVQ KFDYTKGFKF STYATWWIRQ AITRGIADQS
RTIRLPVHFH EQVVSVFIAE RNFLQSEGRD GTAEEIAEAC GKTVDEINAV RRHRTPPASL
DWEVPNGKGG VELLGETLYD PDEPTAYDAA VLAQLQSAVH GALDRLSERE AGVLAYRFGL
ATGEQLTLDQ IGTIYGVTRE RIRQIEAKAF AALREPAAAA QLRDFFDGRS DLDAAGSADT
SKATTAPAAK KPTKPAKKKR RKVEAGPVVE MRKAKWDRAL AKLSAFVARE GHALVPTKHV
ESGHNLGAWI NTQRNALRLG TISPERLAEI DAIDISWRKG KIAPWETQQA DTARQPAPEP
EPIEIAPDNE DPFAGPPVAA TISLILDEDP FAESPATPGK TCLLDEEDPF A