Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1910 |
Symbol | |
ID | 3831183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1978082 |
End bp | 1979770 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829843 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_430753 |
Protein GI | 83590744 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000822204 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGAGC ATCTGGACTT CTGGGGGGCA ATTCTCGCCG CCCGGCGGCA GGGGATGATT TATGTTGACG CCGGGGGCCT TATTAAGGTG GCCAATTCCC TGGCGACCGC CATCCTGGGA TTGGACGGCT GTGATCTCAG GGGCCGCCGC CTGACGGAAG TCTTTCCTGA GACCCGCCTG GCCGAGGTCC TGACCCGTAA GGGCCCTCTC CTTGGGACCA GCTTTACCGC CGGCAGTAGA ATCTATCGGG TCGATTATTT CCCCGACGAT GGGGGAGGTG TCGTCCTTCT TTTCAGTGAC GAGCAGCCTG AGACGAATTC CCTCAAAGCC CTTAACGAAC TCTACGCCTC CCTCCTTGAC GATTTTCCTC TATACCTGGT GGTCGTCGAC GGCCGGGGGG TGGTGGTCGC CATTAACCAT TTATATGCCA GCCTCCTGGG TGCAAAGGCC GGTGCCCTGA TCGGCCGGGA AGTAAAGGAT ATTTTGCCCT TTACCTCCTT GCCGGAGGTC CTATCTTCGG GGATGGCGGT AGCCGGCCGG GAAGTGGCCT TCCGGGGCAA AAAATTGCTA TTATCGGAAG TGCCGGTTAA AGATGGGGCC GGGCATCTCC TGGGAGGAGT CGGTAAGGCC CTGGATCTCG AGGACCTGGC AGCCGCCGGT TTCAGCGACC TCGCCCGCCG CCTGCAGGTC CTGGAGGGCA AGGTTCTTTT CTACCAGCAG GAACTCCAGT CAATCCAGGG CGACCAGGAC GCCTGGCAGG GCATCCTGGG GGTGAGCCCG GCTATGGTCC GGCTAAAAAA AATGGCGGCC CGGGTAGCCC GCGGCGAGGC TAACGTTTTG ATTACCGGCG AGAGCGGTAC CGGGAAGGAA CTGCTGGCCC AGGCCATCCA CCGGGCCAGT CCTCGCCGGA ATGAACCATT GATCAAGATC AACTGTGCCG CCATCCCGGA AAACCTGCTG GAGGCCGAAC TCTTCGGCTA CGAAGAGGGG GCCTTTACCG GCGCCGCCCG GGGCGGCAAA CCGGGCAAGT TTGAACTGGC TGACGGCGGT ACCATCTTCC TGGATGAAGT CGGCGACCTG CCCCTCAGTA TGCAGCCCAA GCTCCTGCGG GTCCTGCAGG AAAGGGCCTT TGAACGGGTC GGGGGACGCC GGACCATCAA GGTTGATGTA AGGATTATTG CTGCCACCAA CCAGGATCTC CTGGCCCTGG AGGCGGCCGG CAAATTTCGC CTTGACCTTT ACTACCGCCT GGCGGTGGTT ACCCTGACGG TACCACCCTT AAGGGAACGG CCGGGGGACC TCGAAATACT GGTGGCGGCC ATTATCCGGC GTTTAAATGC ACGCTACCAC CAGACGGTCC GGGGCCTGGA TCCGGAGGTA CAGCGGCTTT TTTCCCGCTA TAGCTGGCCG GGGAATGTCC GGGAGCTGGA AAATGTCCTG GAGCATGCCT TTAATTTCCT GGATCCGGGG GAAGAAATAA TTACCTTGAA GCATCTCCCG GCCACCATGC AAAGGACGAT TGGTGGCGGG GGTGGCCTGA AGCTGAAGAG TGCTGTCGCT ACGGCGGAAA GGGAAGTTAT CTTAAAGGCC CTCGCGGCCG CTGGTGGAAA CAAACAGGAA GCGGCCCGTC TCCTGGGTAT TCACCCCTCC GGCCTTTATC AAAAGCTCAA GCGTTATGAT ATCAATTAG
|
Protein sequence | MKEHLDFWGA ILAARRQGMI YVDAGGLIKV ANSLATAILG LDGCDLRGRR LTEVFPETRL AEVLTRKGPL LGTSFTAGSR IYRVDYFPDD GGGVVLLFSD EQPETNSLKA LNELYASLLD DFPLYLVVVD GRGVVVAINH LYASLLGAKA GALIGREVKD ILPFTSLPEV LSSGMAVAGR EVAFRGKKLL LSEVPVKDGA GHLLGGVGKA LDLEDLAAAG FSDLARRLQV LEGKVLFYQQ ELQSIQGDQD AWQGILGVSP AMVRLKKMAA RVARGEANVL ITGESGTGKE LLAQAIHRAS PRRNEPLIKI NCAAIPENLL EAELFGYEEG AFTGAARGGK PGKFELADGG TIFLDEVGDL PLSMQPKLLR VLQERAFERV GGRRTIKVDV RIIAATNQDL LALEAAGKFR LDLYYRLAVV TLTVPPLRER PGDLEILVAA IIRRLNARYH QTVRGLDPEV QRLFSRYSWP GNVRELENVL EHAFNFLDPG EEIITLKHLP ATMQRTIGGG GGLKLKSAVA TAEREVILKA LAAAGGNKQE AARLLGIHPS GLYQKLKRYD IN
|
| |