Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2113 |
Symbol | |
ID | 3833264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2207418 |
End bp | 2209064 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637830038 |
Product | two component AraC family transcriptional regulator |
Protein accession | YP_430948 |
Protein GI | 83590939 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCACGG ATATCAAGAT CCTGCTTGTC GACGACGAAC CCCTGGAGCG CCAGGCCATC CGCTTTTTGC TGGCCAGGGA GCGCCCTCAT TACCAGATTG CCGGGGAAGC GGGTAATGGA GGCGAGGCGG TCAAACTGGC TGCCAGGTTG CGACCGGACA TCGTCTTCCT GGATATCAAG ATGCCCGTCA TGGATGGGTT GACCGCCGGT CGGGAGATTC GGGCAATCCT ACCCGAGGCC AGGTTGATTT TTGTTACTGC CTATGGCGAA TTCGATTATG CCCGGGAAGC TGTTGCCCTG GGGGCATCCA AATATTTACT AAAGCCGGTG GCGGCCGAAG AAATGCTTCC CCTCCTGGAT GAACTGGCTG CCGGCGTCGC CGCCGCTCGC CGGCGCCAGC AGGAGACAGC AAGGTTGCGG GCCGCTCTGG AGGAAGCGAA GCCCTTTATT CGCCTGGGCT TTATCATGGA CCTGATCAAC GGTAATATCA CCGACGCCGA AGCCGTCAGC CGGGCGCGCT TCCTGGGGAT CGCCACCTTG CCCCGTCTGG CCATGTTGGT GGATATTGAT AATTTTGCCG CTCTGGCCCG GGAGGGGACA GAGGTAGAAC GGCAGATTTT AAAGCAACAG GTCAAGGAAA GTCTGGAAAG GGCGACTGTG TCCTGGCCCG GGGCCCTGGT CGCCCCGGTA ACCAGGGATG AGTTCGCCAT CCTCCTGCCC CTGGACCACC TGGCCCCGGG TGCAGATAGC CACCAGGCCG CCATCGAGCT GGGAGAAGGC ATTTGCCGGC AGGTACGCCG GGATACCAGG GCTACGGTAA CGGTGGGTAT CGGCCGGCCG GTGGCAAAGG TTGCTGAACT GGCCCGTTCC TACGCCGAGG CGGTGGCGGC GGCAGAATTC CGGCTATTTT ACGGCGGGGA CCAGGTTATC CATGCTGACG ACGTTATTGC CCGGCCCAGT GCCGGCCAGT TCCTGCCGGC TCCCGAGGAG CAGGAATTAA CCCAGGCCAT CCGTATGGGT GATAGGCAGG CTGCCTACCG CCAGGCTAAA AATATTTTGA TGCAACTCCT CCTGGAGCAG GAAAAACGGC CGGCTATATT GAAGATGAAA CTCCTGGAAC TGAATACCCT GGCGGCCAGG GCCGCCCTGG AGGGCGGTGC CGACCCGGAG GCGGTTTCCG ACCTTGCCCT GGCCAGCAGC ACTGAGTTTC TTACCCTGGA CAACCTGGCT GATATGCGGG AGCGCATCCT GGAACGTTTA ATGGCCCTGG TGGCCCAGGT GGCGGAAACC CGGGAGCAGC GCAATTCCTC CCTTATTGAC CGGGCCAGCA AGTATATTGA GGCCAATTTC AGCCAGGATC TCACCCTGGA AGAGGTCGCC CGGCAGGTAT ATCTTAGCCC CTGTTATTTC AGCAAGCTGT TCAAGCAGTT CAAGGGCTTG AATTTCATAG ATTATCTAAC AAAGGTACGC CTCAAGGCGG CCAGGGAGTT ATTGCTGAAC ACCAAGCTCC CGGTAGCGGA AATCGCCACT CGCGTTGGTT ATCGTGATGC TCGCTATTTT GGGCAGGTGT TTAAAAAGCA GGAAGGCTAC ACGCCCAGTG TCTTCCGGAA AATAGGGGGT GCCCACTTTG GCAAGAGTAC TAGTTGA
|
Protein sequence | MATDIKILLV DDEPLERQAI RFLLARERPH YQIAGEAGNG GEAVKLAARL RPDIVFLDIK MPVMDGLTAG REIRAILPEA RLIFVTAYGE FDYAREAVAL GASKYLLKPV AAEEMLPLLD ELAAGVAAAR RRQQETARLR AALEEAKPFI RLGFIMDLIN GNITDAEAVS RARFLGIATL PRLAMLVDID NFAALAREGT EVERQILKQQ VKESLERATV SWPGALVAPV TRDEFAILLP LDHLAPGADS HQAAIELGEG ICRQVRRDTR ATVTVGIGRP VAKVAELARS YAEAVAAAEF RLFYGGDQVI HADDVIARPS AGQFLPAPEE QELTQAIRMG DRQAAYRQAK NILMQLLLEQ EKRPAILKMK LLELNTLAAR AALEGGADPE AVSDLALASS TEFLTLDNLA DMRERILERL MALVAQVAET REQRNSSLID RASKYIEANF SQDLTLEEVA RQVYLSPCYF SKLFKQFKGL NFIDYLTKVR LKAARELLLN TKLPVAEIAT RVGYRDARYF GQVFKKQEGY TPSVFRKIGG AHFGKSTS
|
| |