Gene Moth_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0725 
Symbol 
ID3831001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp755358 
End bp757268 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content61% 
IMG OID637828656 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_429586 
Protein GI83589577 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02329] propionate catabolism operon regulatory protein PrpR 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0512499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.510149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACC TGGCCCTGGT GGCTCCTTAT ACTGACCTGG CGGCCCTGGC CAGACAGGTA 
TGCGAAGAAC TGGATGAAGA CGTAGCCGTC GCTACCGGAG ACCTGGCGGA AGGGGTAAGA
GTCGCCCGGG ATCTGGTGAC TAAAGGCGCG GAAGTCATTA TCAGCCGCGG GGGCACGGCC
ACGGCCATTA GCCGGCAGGT AGAGGTGCCG GTAGTGGAGA TCGCTGTCAG CGCCTTTGAC
CTGATTCGGG CCCTGGCCAG GGCCCGGGAT CTCGGCAGCT ATATCGGCGT GGCCGGTTTC
CGCAACGTCA TTTATGGTAC CAAAAGTTTA GAATCTGCCC TGGGAGTTCA CATTGAAGAA
CTAATCATCG AAGCTGAAGA AGAAGCTGCC GGGATAATTG CTGAAGGCCG GTCCATGGGT
CTGGAAGTTA TTGTCGGCGA TGCCGTTTCC GTGCGTTCGG CTAAGGAAAT GGGACTCCAG
GCCATCCTGG TGACCTCCGG TAAAGAAGCC ATCAGCCAGG CCATCCGCGA GGCCCGGGAG
GTAGCCATGG TACGCCGCCG CGAGCGGGCC CGGGCCGAGC AGTTCAAGGC TATCCTGGAT
TTCGCCTATG AGGGCATTGT AGCCACCGAC CAGGAAGGCC GCATTACCCT GGTTAACCCG
GCGGCCGAGA AGATCCTGGG CCTTGCGGCC CACCGAGTAG TAGGACGGCC GGCGCGGGAG
GTATTGCCCG GCGTGCCCCT GAATCAGGTG CTGCAGTCAG GCCAAAAGCG CCTGGGGGAA
CTGCACCGGG CCGGCAATAC CCTGGTGGCG GAGAATATCA TACCGGTCAT TGCCGGCCGG
GAGACCGTCG GCGCCGTGGC TACCTTCCAG GATGTCAGTC ACCTGCAGGC AGTTGAGGCC
AGGGCCCGCC AGGAGCTTTA CCTCAAAGGC CATGTGGCCC GGTATACCTT CGAAGATATC
GTCACCCAGA GTCCGGTCAT GGCCAAGATA ATTGAACGGG CCCGCCAGTT TGCCGCCGCC
GAAGCGACGG TTTTAATCAA CGGGGAAACA GGAACGGGTA AAGAAATGGT AGCCCAGAGT
ATTCATAACG CCAGCCGGCG GCGGAATGGC CCCTTTGTGG CCGTTAACTG CGCCGCCGTA
CCGGAGAATT TGTTGGAAAG CGAGCTCTTC GGCTACGAGG AAGGGGCTTT TACCGGAGCC
CGCAAGGGCG GCAAAAAGGG ACTCTTTGAA CTGGCCCACG GCGGCACCCT TTTCCTGGAC
GAGATCGGCG AGCTGTCTTT AAACTTACAG GCGCGGCTTC TACGGGTGCT ACAACAAAAG
GCCATCATGC GCGTCGGCGG CGACCGGGTG CTGCCCGTGG ACGTGCGCAT CATCGCCGCC
ACCCACCGCA ACCTGAAAGA TGCCATCGCC AGGGATGCCT TCCGCCGTGA CCTGTACTAC
CGCCTCAATG TTTTGCAGAT AAATCTCCCG CCCCTCCGGG AGCGCCCGGA AGACCTGCCT
TTATTAATTA AAGCTTTGGT AGAAAAGATC AGTCGCCGTG CCGGCCGCCT GCCTCCTATC
TTTAGCGAGG AGATTATCGC CAGGATGCAG GCCTATTCCT GGCCGGGTAA CGTACGCGAA
CTGGAGAATA TCCTGGAAAG GCTGGTAGTC CTGCGCAGTG GGGAAGAGGT CGAGGCCGGC
GACCTGGACG AGATCTTGGA GCCGGCGGAA AACCAGCCGC AGCCCGTCCT GCAGCTGGCC
CTGCGGGGCA CCCTGGCGGA AATGGAGGGG GAGATCATCC GCCGGACCCT GGCCCTCACC
GGCAACAATA AGGAAGAGAC CTGCCGGCGC CTGGGCCTCA GTAAGACCAC CCTCTGGCGG
CGGTTAAAAA GCTGGCAGGA TGAAGGACAA CGCAGGGTCA ATGGTAATTA A
 
Protein sequence
MSHLALVAPY TDLAALARQV CEELDEDVAV ATGDLAEGVR VARDLVTKGA EVIISRGGTA 
TAISRQVEVP VVEIAVSAFD LIRALARARD LGSYIGVAGF RNVIYGTKSL ESALGVHIEE
LIIEAEEEAA GIIAEGRSMG LEVIVGDAVS VRSAKEMGLQ AILVTSGKEA ISQAIREARE
VAMVRRRERA RAEQFKAILD FAYEGIVATD QEGRITLVNP AAEKILGLAA HRVVGRPARE
VLPGVPLNQV LQSGQKRLGE LHRAGNTLVA ENIIPVIAGR ETVGAVATFQ DVSHLQAVEA
RARQELYLKG HVARYTFEDI VTQSPVMAKI IERARQFAAA EATVLINGET GTGKEMVAQS
IHNASRRRNG PFVAVNCAAV PENLLESELF GYEEGAFTGA RKGGKKGLFE LAHGGTLFLD
EIGELSLNLQ ARLLRVLQQK AIMRVGGDRV LPVDVRIIAA THRNLKDAIA RDAFRRDLYY
RLNVLQINLP PLRERPEDLP LLIKALVEKI SRRAGRLPPI FSEEIIARMQ AYSWPGNVRE
LENILERLVV LRSGEEVEAG DLDEILEPAE NQPQPVLQLA LRGTLAEMEG EIIRRTLALT
GNNKEETCRR LGLSKTTLWR RLKSWQDEGQ RRVNGN