Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0853 |
Symbol | |
ID | 3831740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 886419 |
End bp | 887237 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637828783 |
Product | sporulation sigma factor SigG |
Protein accession | YP_429713 |
Protein GI | 83589704 |
COG category | [K] Transcription |
COG ID | [COG1191] DNA-directed RNA polymerase specialized sigma subunit |
TIGRFAM ID | [TIGR02850] RNA polymerase sigma-G factor [TIGR02885] RNA polymerase sigma-F factor [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0182418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATAA TTATCCCAGG ACCAAGGTCG GGAGGTTTCT CGCCAGGAAT GCTGAACAAA GTAGAAATCT GCGGTGTGAA TACCTCCAAA TTACCCCTGC TGAAGAGTGA TGAGATCCGC CAGCTCTTTC AACAGATGCA GGCCGGAGAA ACCGGAGCCC GGGAAAAGCT CATTACAGGC AACCTGCGCC TGGTTTTAAG CGTCATCCAG CGCTTCACCA ACCGCGGCGA ACACGTGGAC GACCTGTTCC AGGTGGGTTG TATTGGCCTC ATGAAGGCCA TTGATAACTT TGATCTGAGC CAGAACGTCA AGTTTTCCAC CTACGCCGTG CCCATGATTA TCGGCGAGAT CCGGCGTTAC TTGCGGGACA ATAACCCTAT CCGGGTCAGT CGTTCCTTGC GGGACGTGGC CTACAAGGCC CTCCAGATAA GGGACACCCT GGTGAATAAG CTCGCCAGGG AACCCTCTCT GGCGGAAGTA GCCCAGCAAC TGGACCTGCC CCAGGAAGAA GTAATCTTCG CCCTGGATGC CATTCAGGAG CCGGTGTCCC TTTTCGAACC GATTTATCAT GACGGCGGCG ATCCCATCTT TGTCATGGAC CAGATTGGGG ACGAAAAGAA CCAGGACAGC AGCTGGCTGG AGAATATCGC CATCAAAGAG GCCATGAACA AGTTGAATCC CCGGGAGCGC CTCATCCTAT CCTTACGTTT CTTTGAAGGC AAGACCCAGA TGGAAGTCGC AGACGAAATC GGCATATCCC AGGCCCAGGT CTCCCGGCTG GAAAAAGCAG CCCTGCAGCA TATGCGGAAA TATATTTAA
|
Protein sequence | MAIIIPGPRS GGFSPGMLNK VEICGVNTSK LPLLKSDEIR QLFQQMQAGE TGAREKLITG NLRLVLSVIQ RFTNRGEHVD DLFQVGCIGL MKAIDNFDLS QNVKFSTYAV PMIIGEIRRY LRDNNPIRVS RSLRDVAYKA LQIRDTLVNK LAREPSLAEV AQQLDLPQEE VIFALDAIQE PVSLFEPIYH DGGDPIFVMD QIGDEKNQDS SWLENIAIKE AMNKLNPRER LILSLRFFEG KTQMEVADEI GISQAQVSRL EKAALQHMRK YI
|
| |