Gene Moth_0261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0261 
Symbol 
ID3833224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp268678 
End bp270063 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content59% 
IMG OID637828197 
Productsigma-54 (RpoN) 
Protein accessionYP_429139 
Protein GI83589130 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAG GTTTAGGCCT TAACCTGGAA CAGACGCAAA AATTGATCAT GACGCCGGAA 
TTACGCCAGG CCATTGTTAT CCTGCAGCTC TCCACCCTGG AACTGGCCGA CTATATCCAG
CAGGAACTCC AGGAAAACCC GGTCCTGGAA GTGCGGGACG ACGGCGAAGT GGCGGCCGGC
TGTGAAAGTC GGGAGGAGAA CGATTCCTTC GATATTGACT GGCAGGAATA TTTCCAGGAC
GGCAGCGACC TAGGTTATGT CTGCGAGCCC CGGGAAGAAC TGCCGGAATG GTATCCGGAG
AATTTCCCTA CCGGCCAGCC CACCTTGCAG GATCACCTCC TGCTCCAGCT GCAGATTGCC
GCTACCAACC CGCGCCAGGT GGAAATCGGG AGTTTCTTAA TCGGCAACCT GGACTCTAAC
GGTTATTTAA AAATCAGCCT CAAAGAGGCA GCCGCCCTGT TAAAGGTGCC CCAGGCTGAC
GTCTGGCAGG TCCTGGACCT TATCCAGCAC TTTGATCCCC CCGGCGTAGG GGCCCGGGAC
CTGGTGGAGT GCCTGCTGTT ACAGCTGCGC CAGAGGGGAG AGGCCCCGGC CGGGACGGAA
AAAATCATTC GCGGCTTTTT GCCGGATGTG GCTGAAGGGC GCCTGACCCG CATCGCCCGG
CGCCTGAATA TGAGCCTGCC GGCCGTCCAG GCAGCGGTAG ACTATATTCG CTCCCTGGAC
CCAAAGCCGG GCCGTTCCTT TGGCGGCAGC CTGGATAACC GCTATATTGT CCCTGACGTC
ATTATCGAAC GGGTGGGGGA CGAGTATGTG GTCCTGGTAA ATGACCTGGC TATGCCCCGG
CTGGGAATCA ACCCGGTTTA TCAGGCCCTC CTACGAAAGG AGGCTGCCTG TGACCCCCGG
ACCAGGCAGT TTATCGAACG CAAGCTCAAC GCTGCCGCCT GGCTGATACG CAGCCTGGAG
CAGCGGCGCC TGACTGTTTA CCGGGTAGTT AACTGTCTGG TGAAGGAACA GCGGGAGTTT
CTGGATAAAG GACTGAAATA TTTGAAACCC CTGACCATGC GCCAGGTAGC CGCTGCCCTT
GGTATCCACG AATCCACCGT CAGCCGGGCT ACAGCCAACA AGTATGTCCA GACGCCCCAG
GGGGTTTTTG AGCTGAAGTT TTTCTTCGCC AGCGGCGTCG AGAAGCGAGG GGGCGCGGCC
GCTGCCGAGA GCATCAAAAA GATGATTTCT GAAGCCATTG GCCGGGAAGA CCCTGCTTCG
CCCCTGACTG ACCAGCAACT CCAGGAGATG TTGCAGCGAC AGGGTATCCG CATCTCCCGG
CGGACGGTTG CCAAGTACCG GGACGAGCAG GGCATCCCGG CGGCGGCTAA AAGAAAACGA
TATTAG
 
Protein sequence
MREGLGLNLE QTQKLIMTPE LRQAIVILQL STLELADYIQ QELQENPVLE VRDDGEVAAG 
CESREENDSF DIDWQEYFQD GSDLGYVCEP REELPEWYPE NFPTGQPTLQ DHLLLQLQIA
ATNPRQVEIG SFLIGNLDSN GYLKISLKEA AALLKVPQAD VWQVLDLIQH FDPPGVGARD
LVECLLLQLR QRGEAPAGTE KIIRGFLPDV AEGRLTRIAR RLNMSLPAVQ AAVDYIRSLD
PKPGRSFGGS LDNRYIVPDV IIERVGDEYV VLVNDLAMPR LGINPVYQAL LRKEAACDPR
TRQFIERKLN AAAWLIRSLE QRRLTVYRVV NCLVKEQREF LDKGLKYLKP LTMRQVAAAL
GIHESTVSRA TANKYVQTPQ GVFELKFFFA SGVEKRGGAA AAESIKKMIS EAIGREDPAS
PLTDQQLQEM LQRQGIRISR RTVAKYRDEQ GIPAAAKRKR Y