Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2125 |
Symbol | |
ID | 3833276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2222659 |
End bp | 2224320 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637830050 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_430960 |
Protein GI | 83590951 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGATTA TGGGAATTAC TGTAAGAGAA GCGCTGCGCT TGCCTCAGTT GGAGGGAGCA GTACTTGTCG GTGGTGAACA GGGACTTGAT CGTATAATCA ATTCTGTGAA CATCATGGAA GTACCGGATA TAGGAAATTA TATAAAACCC GCAGAGCTTT TGCTGACAAC CGCGTATCCT ATTAAAGATG ATATCAGAGC TCTGGAAAAT TTGATACCCG AATTAAACAA ATGCGGATTG GCCGCGCTGG CCATAAAGCC GGAACGCTAT ATCCGTGAGA TTCCAGAGAT TATGATTAAG CAGGCCAACA AATACAAGTT TCCCTTGATT AAATTGCCCA ATAGCGCTTC ATTTAACGAA ATTATAAACC CTATTTTAAC CGAAATTTTA AACCGCCAGG CGGCCATCCT GCAAAGGAAT GAACAACTTA GCAAAGCACT AACCGGTATA GTACTATATG GCGGAGGGCT GGCTGAGATC GCGCGGACCC TAGCTGGCTT ACTGGGGTTG CCAGTCTCCA TTCATGATCC TGCCTTCCGA AAGATAGCTT CATGGTTAGC GCCCCGCGAT TTCGGTGCTG ATAATAACCG GGCACTGGCT GAACTTGTAA ATGACAGCAA ACAGCTGGCA GAAGCCTGTA AGTGTGGTAG GGAACAATTG GTATTTAACT GTGCTGGTGG GTTATATGCC ATATGCCGCC CTGTCACAGT TGCTGAGGAA ATTTATGCAT ACTTGTTTAT CTGGGGGCCC CAAAAGCCTA TTTCTGAACG GGAAATAACC GGCATTGAGC AGGCGGTTAC TGTTATTGCT TTGGAAATAA GCAAGCAACG GGCTGTCTTT GATGTCGAGA GCAGGTTTAA AAGCAGCTTT ATTGAGTATC TCGTGGATGG TAAAATTACC TCAAAGGAAG ATGCGATTAT TTTGGCCGAA AGATTTGGGT GGGAGATTGC TAACGGCTTT ACAGTCATGT TATTTGAATG TGATGATTTG CAGTCGCTTT ACGATCAGGA GCCAGTATTT GCCCGCAAGA AACGGCTTAA GATAATGGAG ATAGTGAATA CAGCGGTTGT TTCGTTATCT CCGGGGTCAG CGGTTGTCGA GAGAGGGAGG CGGTTAATGG TGTTGCACCG CCCTCTGCCA GGAAAAGATA TCAAACAGTT GCGGAGAAAT TCCGAGACTT TGGCCCGTTC CTGTTGTGAA GAGCTAGCGA AGCGGTTAGA TACACGAATA CTCGTCGGGA TTAGCCGCTT TATCGCCGAC CCCATGAAGA TTGCCGAAGG TATAAACCAA TCCCACCAGG CACTGGAAAT AGGGAGACGT GCTTACGAAC GGGGTCAAGT TTTTCACTTT GACGACCTGG GTGTCTACCG CATTTTATTG ACTTCTGATG CTGCAGAGAT GCAACGTTTC TATGACGATA TCTTAGGCCA ACTGGTCTCT TATGATGAAG AAAATAAAGC CGGTCTAATT ACAACGCTGG AGATGTTATT TGCTAATGAT ATGAACCTGC AGAAAACAGC TGATGCCTTG TTTATTCACT ATAATACTTT GAGATACCGC ATAAACCGGA TACAACAAAT TACTGGCCTT GATTTAAAGT CTCCGGATGA TCGCCTGAGC CTGCAGCTTG CCCTTAAAAT CCTGCACATG AAAGGAAATT AA
|
Protein sequence | MLIMGITVRE ALRLPQLEGA VLVGGEQGLD RIINSVNIME VPDIGNYIKP AELLLTTAYP IKDDIRALEN LIPELNKCGL AALAIKPERY IREIPEIMIK QANKYKFPLI KLPNSASFNE IINPILTEIL NRQAAILQRN EQLSKALTGI VLYGGGLAEI ARTLAGLLGL PVSIHDPAFR KIASWLAPRD FGADNNRALA ELVNDSKQLA EACKCGREQL VFNCAGGLYA ICRPVTVAEE IYAYLFIWGP QKPISEREIT GIEQAVTVIA LEISKQRAVF DVESRFKSSF IEYLVDGKIT SKEDAIILAE RFGWEIANGF TVMLFECDDL QSLYDQEPVF ARKKRLKIME IVNTAVVSLS PGSAVVERGR RLMVLHRPLP GKDIKQLRRN SETLARSCCE ELAKRLDTRI LVGISRFIAD PMKIAEGINQ SHQALEIGRR AYERGQVFHF DDLGVYRILL TSDAAEMQRF YDDILGQLVS YDEENKAGLI TTLEMLFAND MNLQKTADAL FIHYNTLRYR INRIQQITGL DLKSPDDRLS LQLALKILHM KGN
|
| |