Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0261 |
Symbol | |
ID | 3833224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 268678 |
End bp | 270063 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828197 |
Product | sigma-54 (RpoN) |
Protein accession | YP_429139 |
Protein GI | 83589130 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAG GTTTAGGCCT TAACCTGGAA CAGACGCAAA AATTGATCAT GACGCCGGAA TTACGCCAGG CCATTGTTAT CCTGCAGCTC TCCACCCTGG AACTGGCCGA CTATATCCAG CAGGAACTCC AGGAAAACCC GGTCCTGGAA GTGCGGGACG ACGGCGAAGT GGCGGCCGGC TGTGAAAGTC GGGAGGAGAA CGATTCCTTC GATATTGACT GGCAGGAATA TTTCCAGGAC GGCAGCGACC TAGGTTATGT CTGCGAGCCC CGGGAAGAAC TGCCGGAATG GTATCCGGAG AATTTCCCTA CCGGCCAGCC CACCTTGCAG GATCACCTCC TGCTCCAGCT GCAGATTGCC GCTACCAACC CGCGCCAGGT GGAAATCGGG AGTTTCTTAA TCGGCAACCT GGACTCTAAC GGTTATTTAA AAATCAGCCT CAAAGAGGCA GCCGCCCTGT TAAAGGTGCC CCAGGCTGAC GTCTGGCAGG TCCTGGACCT TATCCAGCAC TTTGATCCCC CCGGCGTAGG GGCCCGGGAC CTGGTGGAGT GCCTGCTGTT ACAGCTGCGC CAGAGGGGAG AGGCCCCGGC CGGGACGGAA AAAATCATTC GCGGCTTTTT GCCGGATGTG GCTGAAGGGC GCCTGACCCG CATCGCCCGG CGCCTGAATA TGAGCCTGCC GGCCGTCCAG GCAGCGGTAG ACTATATTCG CTCCCTGGAC CCAAAGCCGG GCCGTTCCTT TGGCGGCAGC CTGGATAACC GCTATATTGT CCCTGACGTC ATTATCGAAC GGGTGGGGGA CGAGTATGTG GTCCTGGTAA ATGACCTGGC TATGCCCCGG CTGGGAATCA ACCCGGTTTA TCAGGCCCTC CTACGAAAGG AGGCTGCCTG TGACCCCCGG ACCAGGCAGT TTATCGAACG CAAGCTCAAC GCTGCCGCCT GGCTGATACG CAGCCTGGAG CAGCGGCGCC TGACTGTTTA CCGGGTAGTT AACTGTCTGG TGAAGGAACA GCGGGAGTTT CTGGATAAAG GACTGAAATA TTTGAAACCC CTGACCATGC GCCAGGTAGC CGCTGCCCTT GGTATCCACG AATCCACCGT CAGCCGGGCT ACAGCCAACA AGTATGTCCA GACGCCCCAG GGGGTTTTTG AGCTGAAGTT TTTCTTCGCC AGCGGCGTCG AGAAGCGAGG GGGCGCGGCC GCTGCCGAGA GCATCAAAAA GATGATTTCT GAAGCCATTG GCCGGGAAGA CCCTGCTTCG CCCCTGACTG ACCAGCAACT CCAGGAGATG TTGCAGCGAC AGGGTATCCG CATCTCCCGG CGGACGGTTG CCAAGTACCG GGACGAGCAG GGCATCCCGG CGGCGGCTAA AAGAAAACGA TATTAG
|
Protein sequence | MREGLGLNLE QTQKLIMTPE LRQAIVILQL STLELADYIQ QELQENPVLE VRDDGEVAAG CESREENDSF DIDWQEYFQD GSDLGYVCEP REELPEWYPE NFPTGQPTLQ DHLLLQLQIA ATNPRQVEIG SFLIGNLDSN GYLKISLKEA AALLKVPQAD VWQVLDLIQH FDPPGVGARD LVECLLLQLR QRGEAPAGTE KIIRGFLPDV AEGRLTRIAR RLNMSLPAVQ AAVDYIRSLD PKPGRSFGGS LDNRYIVPDV IIERVGDEYV VLVNDLAMPR LGINPVYQAL LRKEAACDPR TRQFIERKLN AAAWLIRSLE QRRLTVYRVV NCLVKEQREF LDKGLKYLKP LTMRQVAAAL GIHESTVSRA TANKYVQTPQ GVFELKFFFA SGVEKRGGAA AAESIKKMIS EAIGREDPAS PLTDQQLQEM LQRQGIRISR RTVAKYRDEQ GIPAAAKRKR Y
|
| |