Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1713 |
Symbol | |
ID | 3833163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1754916 |
End bp | 1757162 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829638 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_430558 |
Protein GI | 83590549 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000552259 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGGAAA TAGTGAAAAC AATCACAGGC AAATGCCGGA TGTGTTATGC TTGCATCCGC AACTGCCCGG TCAAGGCGAT CAAGGTAGTC GACGGCCAGG CCCGGGTGGT ACCGGAGCTT TGTATCGCCT GCGGCCACTG CGTCCAGGTC TGTGCCCAGG GAGCCAAGCT GGTCGAGCGG GAAATAGACA AAGTAGAGGA ATTCCTGGCT GCCGGCCGGG TCATAGCCTG CCTGGCGCCT TCTTTTGTCG CGGAATTTCA CCCCGCCTGG CCCGGCCAGG TGGTGGCGGC CCTGCGGAAG CTGGGTTTTG ACGAGGTTTA TGAAGTGGCT TACGGCGCCC ATCTGGTTAC CCTGGAGTAT CAAAAGTACC TCCAGGAAGC CGGCGACCGC CAGGGGCTCA TCAGCACCCC ATGCCCGGCG ATAGTCAACC TGGTGACCAA ATACTACCCC CGCCTGCTGC CCCAGCTGGT GCCGGTGGTA TCGCCCATGA TCGCCCTGGG ACGCTACCTG AAAGCCCGCA AGGGTCCCGA TGTCCGCCTG GTCTTCATTG GTCCCTGCGT GGCCAAGAAA GGAGAGGCCA GGGAGGAAGA AGTAGCCGGG GCGATTGACG CCGTCCTGAC CTTTCTGGAG CTAAAGGAGA TGCTGGTCGC CAGGGAGATC GACATTAATC TCTGCGGTAA CAGCGATTTC GATAATGACC CGGCCTATCT GGGCCGCCTC TACCCCGTCG GGGGCGGCCT CTTACGTAGC GCCGGCATCG ATGCCGATAT CCTGACCAAC CAGGTCCTGG TAGTTGAAGG CCGTGAAGAC TGCCTGGATT TTTTAAAAGC AGTAAACGAG GGCAAGATGG TGCCTCAGAT GGCTGATATG CTCTTCTGCA AGGGGTGCAT CAACGGCCCC ATGCAGGAAA ACGATTTTTC GCCCTACCGG CGGCGGAAGA TTGTGGCCGA ATTTACCCGC CAGGGATTGC AAAATACCCG TCCCCAGCCC CTGGATCTCA CAGGGATAAA CTTGCGCCGG GAGTTTCATG ATCAGAGCGT TTTCTTTCCC CGGCCCAGCG AGGCGGATAT CCGGGCCATC CTGGCAGAAC TGGGTAAGTA TACCCCGGAG GACGAGCTCA ATTGCGGTGC CTGTGGCTAT CCCTCCTGCC GGGAGAAGGC CGTTGCCGTT TATCACGGCC TGGCGGAGAA TCGCATGTGC CTGCCCTACT TCATTGCCCA GCTGGAAAAG AGCAACTTCC ACCTGCGCCA GGAGCTTAAA GCCTTCCAGG GCAAGTTTGG CCAGCTGGTG GGCAACAGTG CAGCCATGCA GGAGGTCTAC CGCCTGATAG AGAAGGTAGC TAAAACCGAT GCCACCGTCC TCATCCGGGG GGAGAGCGGT ACCGGCAAGG AGCTGGTGGC TAGGGCCATA CATTATCACA GCCGCCGGAG CGAGGCGCCC TTCGTCAGCA TCAACTGCGC CGCCCTGCCG GAAACCTTGC TGGAGAGCGA GCTCTTCGGC CATGCCCGGG GCGCCTTTAC CGGGGCGGTC ACGGCTAAGA AAGGACTCTT TGAGGAGGCC AACGAAGGCA CCATTTTCCT GGATGAAATC GGCGATATCA GCCTGGGACT CCAGAGCAAG CTCCTGCGCG TCCTCCAGGA AGGGGAATTC TTACGGGTGG GAGAGACCAG GCCGACCAGG GTGAACGTCC GCGTCCTGGC GGCCACCAAC CGGGACCTGG AGAAGGCCGT CAAGGATGGC GCCTTCCGGC CCGAGCTCCT TTATCGCCTC AACGTCATTA CCATCTGGCT GCCGCCCTTG CGGGAACGGC GGGATGATAT CCCCCTGCTG GCCCGTCACT TCCTGGAAAA ATCCTGCCAG CGCCTGGGTA AAGAAGTCAA TGGCTTCACC CACGAGGCCA TGGACGCCCT GATACGGGCC GACTGGCCGG GCAATGTTCG GGAACTGGAG AACGTCATCG AGCGGGCGGT TATTTTGACC GAGAGTAAGA GAATCGAGCT GGGCGATTTA CCGCGGCAGT TCCGTAACAT GGCCGCCGGG GCTTCCGGCT GGGCCTTCAC CCGGGGCATG AGTTTTAAAG AGGCTGTGGC CGCCTACGAA TCGCGCCTTA TTCTGCAGGC TCTGGAGGAG AGCCAGGGGG TCCAGGCCCG GGCGGCAGAG ATCCTGGGCC TGAAGCGGAG CACTTTGAAT GAGATGATAA AACGCTATAA TTTAATGGGC CGCAAAAATG GTGGTACAAA TAAATGA
|
Protein sequence | MREIVKTITG KCRMCYACIR NCPVKAIKVV DGQARVVPEL CIACGHCVQV CAQGAKLVER EIDKVEEFLA AGRVIACLAP SFVAEFHPAW PGQVVAALRK LGFDEVYEVA YGAHLVTLEY QKYLQEAGDR QGLISTPCPA IVNLVTKYYP RLLPQLVPVV SPMIALGRYL KARKGPDVRL VFIGPCVAKK GEAREEEVAG AIDAVLTFLE LKEMLVAREI DINLCGNSDF DNDPAYLGRL YPVGGGLLRS AGIDADILTN QVLVVEGRED CLDFLKAVNE GKMVPQMADM LFCKGCINGP MQENDFSPYR RRKIVAEFTR QGLQNTRPQP LDLTGINLRR EFHDQSVFFP RPSEADIRAI LAELGKYTPE DELNCGACGY PSCREKAVAV YHGLAENRMC LPYFIAQLEK SNFHLRQELK AFQGKFGQLV GNSAAMQEVY RLIEKVAKTD ATVLIRGESG TGKELVARAI HYHSRRSEAP FVSINCAALP ETLLESELFG HARGAFTGAV TAKKGLFEEA NEGTIFLDEI GDISLGLQSK LLRVLQEGEF LRVGETRPTR VNVRVLAATN RDLEKAVKDG AFRPELLYRL NVITIWLPPL RERRDDIPLL ARHFLEKSCQ RLGKEVNGFT HEAMDALIRA DWPGNVRELE NVIERAVILT ESKRIELGDL PRQFRNMAAG ASGWAFTRGM SFKEAVAAYE SRLILQALEE SQGVQARAAE ILGLKRSTLN EMIKRYNLMG RKNGGTNK
|
| |