Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0624 |
Symbol | |
ID | 3832518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 648290 |
End bp | 649375 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828565 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_429497 |
Protein GI | 83589488 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00076534 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGAAG AACAGCAAAA GCAAATGGTC AAGGAACTGA TTGAAAAGGG CAAGAGCCAG GGGTCCTTAA CTTACAGTGA GATTATGGAT GCCCTTCAGG GAATAGAACT CTCCCCCGAA CAGATTGACG ATATCTACGA GCAGCTGGGC CATATGGGCA TCGAGGTGGT GCCCGAAGCG GCTGAGCTGG AAGCCCTGGA AAAGGAAGAC AGCGAAGCCG GGGAGACAGA TCTGGATCTC TCCATTCCCG AGAATGTAGC CATAGATGAC CCGGTGCGGA TGTACCTGAA GGAGATTGGC CGGGTACCCC TGCTGACCCC CGAAGAAGAA ATCGAGCTGG CCAAGCGCAT GGAGGCCGGC GACGAAGAAG CCAAGCGCCG TTTGGCCGAG GCCAACCTGC GCCTGGTGGT CAGCATCGCC AAGCGCTACG TCGGTCGCGG CATGCTCTTC CTGGATCTCA TCCAGGAAGG CAACCTGGGC CTTATAAAGG CGGTAGAGAA ATTCGACTAC CGTAAGGGTT ACAAGTTTAG CACCTACGCC ACCTGGTGGA TCCGCCAGGC CATCACCCGG GCCATCGCCG ATCAGGCCCG GACCATCAGG ATCCCGGTGC ATATGGTGGA AACTATCAAT AAATTAATCC GCGTGCAGCG GCAGCTCCTC CAGGAACTGG GACGGGAACC CACCCCGGAG GAGATCGCCC ACGAAATGGA TATCCCCGTA GAACGGGTAC GGGAGATCCA GAAGATCGCC CAGGAACCGG TATCCCTGGA GACACCCATC GGCGAGGAGG AAGACAGCCA TCTGGGAGAC TTTATCGAAG ACGAGGACGC CCAGGCACCG GCAGAAGCGG CTTCCTTCAT GCTCCTGCGG GAGCAGCTAG AGGAGGTCCT GAACTCCCTC ACCCCCAGGG AAAAACGGGT CCTGCGCCTG CGCTTCGGCC TCGATGACGG CCGCGCCCGC ACCCTGGAAG AAGTGGGCCA GGAGTTCGGC GTCACCAGGG AGCGTATCCG CCAGATCGAG GCCAAGGCCC TGCGCAAGCT GCGCCACCCC AGCCGCAGCA AGAAACTCAA GGACTACCTG GAATAG
|
Protein sequence | MKEEQQKQMV KELIEKGKSQ GSLTYSEIMD ALQGIELSPE QIDDIYEQLG HMGIEVVPEA AELEALEKED SEAGETDLDL SIPENVAIDD PVRMYLKEIG RVPLLTPEEE IELAKRMEAG DEEAKRRLAE ANLRLVVSIA KRYVGRGMLF LDLIQEGNLG LIKAVEKFDY RKGYKFSTYA TWWIRQAITR AIADQARTIR IPVHMVETIN KLIRVQRQLL QELGREPTPE EIAHEMDIPV ERVREIQKIA QEPVSLETPI GEEEDSHLGD FIEDEDAQAP AEAASFMLLR EQLEEVLNSL TPREKRVLRL RFGLDDGRAR TLEEVGQEFG VTRERIRQIE AKALRKLRHP SRSKKLKDYL E
|
| |