Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2101 |
Symbol | |
ID | 3832467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2193384 |
End bp | 2194565 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637830026 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_430936 |
Protein GI | 83590927 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAACC AGGAATTAAT CCAGCCCCTG CTGGAACGGG CCGCAGCCCT CTGGCAAACA GCCATTGACG TCGTTGACCC TGGGGGTCAG GTAGTAGCCA GCTCGGAACC TTCACGGTTG CATTTTTACC ATCCGGAGGT GCTGGGACTT TCACTTACGG AAGATGGAGT TCGCGTTCAA GGTGAAAGTT ACTATCTTCC CCTGGTAATA AATGACCGCG TGGCCGGCTG GCTGGTATGG CAGGGGGGTG TTAAGGGCGA AGTTGCCACC CTGTTGCAGG CCGCTCTGGA GGAAATCTTG GCGGCGGCCA GGCGGCGGGA TCAGCAGTAT TTCACCGCCC GGGAGGAAGA AGCCCTGGTT GCCGATCTTC TGGCCCGGGA GGCGGCCTCC CGGGTTGAGG AATTAAAGGT AAGGGCGCTG AATTCGGGTT ATGACTTGAA CCTGCCCCGG GCGGTTATAG TATTACAATT GGAACCCAGG GAAAACCGCT ACTTCAACAT TAATTTGCAC CTGGGGTATG ATGTTACAGA GGAGCAGTTA AAGGAGGAAA TCCTGGAGCA AATAAAGGGC GATCTTTACC TGACCGCCCA GGATCTAGTG GCTTATTATG GCAGAGAGAT GCTGGTCATT TTTAAGGCTT TTTTAGAAGT AGAGAATATG GGCCGCCTCT ACCAGGCCCT GGAGGTTATC TGTCGGCACC TGTATGAGTT GATCAAAGAT AACCGTCTCT TTGCTGTCCG GGTGGCCTAC GGGAGCATTG TCACGGAAAT CGCTGGTTTA AGGCAGTCCT ATAAGGAGGC AGCCGAATTG ATTGCCCTGG GCCATTCCTG CGGTCATAAT AGCGGCTATA TTGATTTCGA AGACATTCTT TTTGAGGCCC TGGTGTGTTC CCTGCCGGGG CGGTTGAGGG GTAGATATCT GGAACCCCTT TATCAGAAAA TTGTAGCGGC TGGCGAAGAA GGTATCCAGT TGCTGGAGAC GGTGGAAGCC TGCATCGACA ACAATATGAA TATCAAGCAG ACGGCTGAGC AACTATACCT GCACCGCAAC ACTGTAACCA ATCGCCTGGA GAGAATCAAG CTCCTGACGG GCCTGGACCC GGGAACGGGT TTTCGCGCCC TTTTCTGGCT GAAAATGCTG GCCGTCTACA GGAGATTGGT GCAGACACGG GATGAGGCGT AA
|
Protein sequence | MLNQELIQPL LERAAALWQT AIDVVDPGGQ VVASSEPSRL HFYHPEVLGL SLTEDGVRVQ GESYYLPLVI NDRVAGWLVW QGGVKGEVAT LLQAALEEIL AAARRRDQQY FTAREEEALV ADLLAREAAS RVEELKVRAL NSGYDLNLPR AVIVLQLEPR ENRYFNINLH LGYDVTEEQL KEEILEQIKG DLYLTAQDLV AYYGREMLVI FKAFLEVENM GRLYQALEVI CRHLYELIKD NRLFAVRVAY GSIVTEIAGL RQSYKEAAEL IALGHSCGHN SGYIDFEDIL FEALVCSLPG RLRGRYLEPL YQKIVAAGEE GIQLLETVEA CIDNNMNIKQ TAEQLYLHRN TVTNRLERIK LLTGLDPGTG FRALFWLKML AVYRRLVQTR DEA
|
| |