Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2223 |
Symbol | |
ID | 3830830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2317613 |
End bp | 2318731 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637830143 |
Product | PadR family transcriptional regulator |
Protein accession | YP_431053 |
Protein GI | 83591044 |
COG category | [K] Transcription |
COG ID | [COG1695] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0957891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000188934 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTATA CTGATCGCGA GTACTGGAAC GGTATCATCA AGATGTGCCT CTCGAAGTTT TTCATCCTGC GGGTTCTCTA CACCCAGCCC ATGCACGGCT ACGAGATAGC CCGCACCGTA GCCCAGGTCA CCAGGGGGTG CTGCACGCCC ACTGAAGGGA CGATTTACCC GGTACTCAGG GAGTTCGAGG AGGGCGGCTA TGTCACTTCT TCCCTTGAGA TAGCCGGGGG CCGGGAGCGC AAAGTCTATA CCCTTACGCC AAAAGGGCAG GAGGCCTTCC GCGTGGCTGT AGAGGCCTGG AAGGAGGTTA CCGGCTACAT TTTAGAGGCG GTAAAGTTAG AGGATTATGC AAGCACAAGG AGGCGCTGTA TTATGGCAGG AGCGAAAGGT ATACTTTCCA ACTTTTCTGG CCTGCGGGAA GGCAACTGCT ATTCCGTTCG CAACGAGGAA ATCCCCATAG GACAATGCCG TACCGCCGCT AGCAGTTGTA GCTGCGGGAG TTCTTTACCG GGCAGTTTCG ACAAAGAAGA GGGGGGAGAA ATAGGGATAT CGCCCGATAC CCAGGCGTCC CAGCGTTCCC TCGACATCGA GTTTCTCTAC CTGGACCTTG ATTCCTGCAC CAGGTGCGGT GGAACGGCCC GCAACCTGGA AGAGGCCCTC AACGAAGTGG CAGGGGTCTT ACAGGCTACC GGGATCCAGG TCAACCTGCA CAAGATCCAC GTCCAGTCGG AAGAACAGGC CCTGGCCCTG GGATTTGTCA GTTCGCCTAC CATCCGCATC AACGGGCGGG ACATCCAGCT CGACGTCAGG GAAAGCCTCT GCGAATCCTG CGGCGAGATT TGCGGCGAGG ATGTGGACTG CCGCGTCTGG GTGTACCAGG GGAAGGAATA TACCGAAGCT CCCAAGGGTA TGATTATTGA GGCCATCCTG AAGCACGTTT ACGGGGGTGG AAATGAAAGC CCGGCAGAAA GGGAAACACT GCAGGAGCTG CCCGATAATT TAAAGCGCTT CTTTGCGGCC AGGCGTAAGA AAGAAGAAGG TGGAGCCGGC CGCAAGCCCG CGCCGGAACC CCAAGACAGT TGCTGCGGCA TTTCTCCCTT ATCTAAATGC TGCAAGTAA
|
Protein sequence | MNYTDREYWN GIIKMCLSKF FILRVLYTQP MHGYEIARTV AQVTRGCCTP TEGTIYPVLR EFEEGGYVTS SLEIAGGRER KVYTLTPKGQ EAFRVAVEAW KEVTGYILEA VKLEDYASTR RRCIMAGAKG ILSNFSGLRE GNCYSVRNEE IPIGQCRTAA SSCSCGSSLP GSFDKEEGGE IGISPDTQAS QRSLDIEFLY LDLDSCTRCG GTARNLEEAL NEVAGVLQAT GIQVNLHKIH VQSEEQALAL GFVSSPTIRI NGRDIQLDVR ESLCESCGEI CGEDVDCRVW VYQGKEYTEA PKGMIIEAIL KHVYGGGNES PAERETLQEL PDNLKRFFAA RRKKEEGGAG RKPAPEPQDS CCGISPLSKC CK
|
| |