Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0725 |
Symbol | |
ID | 3831001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 755358 |
End bp | 757268 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828656 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_429586 |
Protein GI | 83589577 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR02329] propionate catabolism operon regulatory protein PrpR |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0512499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.510149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACC TGGCCCTGGT GGCTCCTTAT ACTGACCTGG CGGCCCTGGC CAGACAGGTA TGCGAAGAAC TGGATGAAGA CGTAGCCGTC GCTACCGGAG ACCTGGCGGA AGGGGTAAGA GTCGCCCGGG ATCTGGTGAC TAAAGGCGCG GAAGTCATTA TCAGCCGCGG GGGCACGGCC ACGGCCATTA GCCGGCAGGT AGAGGTGCCG GTAGTGGAGA TCGCTGTCAG CGCCTTTGAC CTGATTCGGG CCCTGGCCAG GGCCCGGGAT CTCGGCAGCT ATATCGGCGT GGCCGGTTTC CGCAACGTCA TTTATGGTAC CAAAAGTTTA GAATCTGCCC TGGGAGTTCA CATTGAAGAA CTAATCATCG AAGCTGAAGA AGAAGCTGCC GGGATAATTG CTGAAGGCCG GTCCATGGGT CTGGAAGTTA TTGTCGGCGA TGCCGTTTCC GTGCGTTCGG CTAAGGAAAT GGGACTCCAG GCCATCCTGG TGACCTCCGG TAAAGAAGCC ATCAGCCAGG CCATCCGCGA GGCCCGGGAG GTAGCCATGG TACGCCGCCG CGAGCGGGCC CGGGCCGAGC AGTTCAAGGC TATCCTGGAT TTCGCCTATG AGGGCATTGT AGCCACCGAC CAGGAAGGCC GCATTACCCT GGTTAACCCG GCGGCCGAGA AGATCCTGGG CCTTGCGGCC CACCGAGTAG TAGGACGGCC GGCGCGGGAG GTATTGCCCG GCGTGCCCCT GAATCAGGTG CTGCAGTCAG GCCAAAAGCG CCTGGGGGAA CTGCACCGGG CCGGCAATAC CCTGGTGGCG GAGAATATCA TACCGGTCAT TGCCGGCCGG GAGACCGTCG GCGCCGTGGC TACCTTCCAG GATGTCAGTC ACCTGCAGGC AGTTGAGGCC AGGGCCCGCC AGGAGCTTTA CCTCAAAGGC CATGTGGCCC GGTATACCTT CGAAGATATC GTCACCCAGA GTCCGGTCAT GGCCAAGATA ATTGAACGGG CCCGCCAGTT TGCCGCCGCC GAAGCGACGG TTTTAATCAA CGGGGAAACA GGAACGGGTA AAGAAATGGT AGCCCAGAGT ATTCATAACG CCAGCCGGCG GCGGAATGGC CCCTTTGTGG CCGTTAACTG CGCCGCCGTA CCGGAGAATT TGTTGGAAAG CGAGCTCTTC GGCTACGAGG AAGGGGCTTT TACCGGAGCC CGCAAGGGCG GCAAAAAGGG ACTCTTTGAA CTGGCCCACG GCGGCACCCT TTTCCTGGAC GAGATCGGCG AGCTGTCTTT AAACTTACAG GCGCGGCTTC TACGGGTGCT ACAACAAAAG GCCATCATGC GCGTCGGCGG CGACCGGGTG CTGCCCGTGG ACGTGCGCAT CATCGCCGCC ACCCACCGCA ACCTGAAAGA TGCCATCGCC AGGGATGCCT TCCGCCGTGA CCTGTACTAC CGCCTCAATG TTTTGCAGAT AAATCTCCCG CCCCTCCGGG AGCGCCCGGA AGACCTGCCT TTATTAATTA AAGCTTTGGT AGAAAAGATC AGTCGCCGTG CCGGCCGCCT GCCTCCTATC TTTAGCGAGG AGATTATCGC CAGGATGCAG GCCTATTCCT GGCCGGGTAA CGTACGCGAA CTGGAGAATA TCCTGGAAAG GCTGGTAGTC CTGCGCAGTG GGGAAGAGGT CGAGGCCGGC GACCTGGACG AGATCTTGGA GCCGGCGGAA AACCAGCCGC AGCCCGTCCT GCAGCTGGCC CTGCGGGGCA CCCTGGCGGA AATGGAGGGG GAGATCATCC GCCGGACCCT GGCCCTCACC GGCAACAATA AGGAAGAGAC CTGCCGGCGC CTGGGCCTCA GTAAGACCAC CCTCTGGCGG CGGTTAAAAA GCTGGCAGGA TGAAGGACAA CGCAGGGTCA ATGGTAATTA A
|
Protein sequence | MSHLALVAPY TDLAALARQV CEELDEDVAV ATGDLAEGVR VARDLVTKGA EVIISRGGTA TAISRQVEVP VVEIAVSAFD LIRALARARD LGSYIGVAGF RNVIYGTKSL ESALGVHIEE LIIEAEEEAA GIIAEGRSMG LEVIVGDAVS VRSAKEMGLQ AILVTSGKEA ISQAIREARE VAMVRRRERA RAEQFKAILD FAYEGIVATD QEGRITLVNP AAEKILGLAA HRVVGRPARE VLPGVPLNQV LQSGQKRLGE LHRAGNTLVA ENIIPVIAGR ETVGAVATFQ DVSHLQAVEA RARQELYLKG HVARYTFEDI VTQSPVMAKI IERARQFAAA EATVLINGET GTGKEMVAQS IHNASRRRNG PFVAVNCAAV PENLLESELF GYEEGAFTGA RKGGKKGLFE LAHGGTLFLD EIGELSLNLQ ARLLRVLQQK AIMRVGGDRV LPVDVRIIAA THRNLKDAIA RDAFRRDLYY RLNVLQINLP PLRERPEDLP LLIKALVEKI SRRAGRLPPI FSEEIIARMQ AYSWPGNVRE LENILERLVV LRSGEEVEAG DLDEILEPAE NQPQPVLQLA LRGTLAEMEG EIIRRTLALT GNNKEETCRR LGLSKTTLWR RLKSWQDEGQ RRVNGN
|
| |