Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1889 |
Symbol | |
ID | 3831234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1952445 |
End bp | 1954703 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829822 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_430732 |
Protein GI | 83590723 |
COG category | [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGGA TAGTCAACAC TGCTACTGGA AGGTGCCGTC AGTGCTATTC CTGTGTCCGC AATTGCCCGG TCAAGGCCAT CAGAATAAAC AAGGGCCAGG CGGAGGTTAT CGCCGAACGC TGCATCAGCT GCGGCATGTG CCTGGCTTTT TGCTCCCAGG GGGCCAAACA GGTAGCCGGC AGCCAGGCGG CCGTCCTGGC AGCGTTAAAG GAGCACCAGG AGATGGTAGC CTGCCTGGCG CCGTCATTTC CAGCGGCTTT TCCTGGTTGG ACCGCCGGCC AGGTGGCCGG CGCCCTGAAG AAACTTGGTT TTGCCCGGGT ATGGGAGGTG GCCGTGGGGG CGCTGCTGGT TGCCAGGGAG TATCAAAGGG TGCTAAAACA GAGGAATACT CCCGCCATCA GTACGGCCTG CTATGCGGTG GTCAATCTGG TCGAGAGGCA CTTCCCGTCC CTCATTCCTT ACCTGTTACC GGTAGTCTCC CCCTCCATAG CCCTGGGAAG GCTTCTTAAA AAACACCTGG GTCCCGTGAA AGTGGCTTTT ATCGGCCCCT GTATCGCTAA AAAAGAAGAG ATTCTGGATC CGGAGGTAGC CGGCGCTGTA GATTATGTAC TGACATTTGC GGAAATTAAG GAGTTACTTG CCGTTGAGCA TCTGGAACAT CCCGGGGTTG CGGCAGCCCT GGACAGCCCG CCGGTGGCAG TCAGCCGGCT TTTTCCCCTG CCCGGGGGAC TCAGCCGGAG CATGGGCGCG ATCCCGGATA TTGCCGACCA GGATCTTTTG CTGGTTGAAG GGAAAGAAGG TGTGCTGGCG GCCCTGGAGG GCCTGGCACG GGGGGAGATC CGGCCCCGCT TAATCGACGC CCTTTTTTGC GAAGGCTGCG TCATGGGTCC CGGAATGGGT GTTGTGGTCA ACCAGGTAAA GAGAAAGGAG CTGGTAGCCG CCTACTACCG CCGCTGTCAG GAGGCGCGCG AGCCGGAAAT CCTAGCCCCC GACCTGGCGC GGAGCTTTCA CAATAAACAA TCGTCCCTGC CCCTCCCCGG CGAGGAAGAT ATTAAACGCA TCTTACGGCT GACCAACAAA TTTACGCCGG CCGATGAACT GAACTGTGGC GCCTGCGGCT ATCACTCCTG CCGGGAGAAA GCCATAGCCG TTTACCAGGG CCTGGCGGAG ATCGATATGT GCCTGCCCTA TCTCCTGGAA CAGAAGAGCG ACCTGCTGTC CCGGGCGGCC AGCAACCTGA TGCATTTCGT CAATCTATAT AAAAGTCCCG GCGACAGGCC CGGCCCCGGG GTCATGGAAT TGCTCCAGGA AAGAAACATT ATTGTCGCCA GCCCGCGGAT GTTAAGGGTC CTCTACCTGG CGGAACGGGT AGCCAGGGTG GATTCCACGG TGTTAATCCT GGGGGAATCC GGCGTCGGTA AGGAAGTAGT CGCCCGCCTG ATCCATGCCT TAAGCGAGCG CGGCAAGGGG CCGTTTGTGA AAATAAACTG CGGCGCTATT CCGGAAAACC TGCTGGAATC CGAGCTTTTT GGCTACGAAC GGGGGGCTTT TACCGGGGCC AACCGGGAGG GAAAGATGGG CCAGCTGGAG TTGGGCGAGG GGGGAACGGT ATTCCTGGAC GAAATCGCTG AACTCCCCTT AAAGCTACAG GTTAAGCTCC TGCAGGTCTT ACAGGAGCAG CGCCTGGTAC GGGTGGGGGG GATCAGGGAG ATCAAACTCA ATATTCGCAT TATCTCGGCG ACCAATAAAA ACCTCTTGCA GATGGTCCGG GAAGGGACCT TCCGGGAGGA TCTGTATTAC CGCCTGAATG TAATCCCCCT GACCATCCCC CCTTTACGGG AACGGCCGGA AGATATCGAA GCCCTCATCG ACCATTTTAT GGACCGGCTG AACCGGCGTT ACAAGCAAGA AAAAAGGATT AGCCGCCGGG CCAGGAGGTA TCTCCTGGCC TATCCCTGGC CCGGCAATGT AAGGGAACTC CATAACGTCA TCGAGCAGCT TTTCGTCCTG GTAGAAGGGA CGGAGATTCT ACCTGAGCAT TTACCCTATT ATATCCGCGA CGACCCGGCG AGATATAGCT CCCATATGCT GGTAAAAGAT ATTATACCCA TGAAAGAAGC CATTGAAGAG GTTGAAAAAC AGTTGCTGTT AAAGGCCCTG GAAAAGTACA GGAGCACTTA CCAGGTTGCC GAAAAGCTGG GGGTAAACCA GTCGACTGTA GTGCGCAAAA TCAAAAAGTA CGGGCTGGAG CATCAATAA
|
Protein sequence | MGGIVNTATG RCRQCYSCVR NCPVKAIRIN KGQAEVIAER CISCGMCLAF CSQGAKQVAG SQAAVLAALK EHQEMVACLA PSFPAAFPGW TAGQVAGALK KLGFARVWEV AVGALLVARE YQRVLKQRNT PAISTACYAV VNLVERHFPS LIPYLLPVVS PSIALGRLLK KHLGPVKVAF IGPCIAKKEE ILDPEVAGAV DYVLTFAEIK ELLAVEHLEH PGVAAALDSP PVAVSRLFPL PGGLSRSMGA IPDIADQDLL LVEGKEGVLA ALEGLARGEI RPRLIDALFC EGCVMGPGMG VVVNQVKRKE LVAAYYRRCQ EAREPEILAP DLARSFHNKQ SSLPLPGEED IKRILRLTNK FTPADELNCG ACGYHSCREK AIAVYQGLAE IDMCLPYLLE QKSDLLSRAA SNLMHFVNLY KSPGDRPGPG VMELLQERNI IVASPRMLRV LYLAERVARV DSTVLILGES GVGKEVVARL IHALSERGKG PFVKINCGAI PENLLESELF GYERGAFTGA NREGKMGQLE LGEGGTVFLD EIAELPLKLQ VKLLQVLQEQ RLVRVGGIRE IKLNIRIISA TNKNLLQMVR EGTFREDLYY RLNVIPLTIP PLRERPEDIE ALIDHFMDRL NRRYKQEKRI SRRARRYLLA YPWPGNVREL HNVIEQLFVL VEGTEILPEH LPYYIRDDPA RYSSHMLVKD IIPMKEAIEE VEKQLLLKAL EKYRSTYQVA EKLGVNQSTV VRKIKKYGLE HQ
|
| |