Gene Moth_0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0624 
Symbol 
ID3832518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp648290 
End bp649375 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID637828565 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_429497 
Protein GI83589488 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00076534 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGAAG AACAGCAAAA GCAAATGGTC AAGGAACTGA TTGAAAAGGG CAAGAGCCAG 
GGGTCCTTAA CTTACAGTGA GATTATGGAT GCCCTTCAGG GAATAGAACT CTCCCCCGAA
CAGATTGACG ATATCTACGA GCAGCTGGGC CATATGGGCA TCGAGGTGGT GCCCGAAGCG
GCTGAGCTGG AAGCCCTGGA AAAGGAAGAC AGCGAAGCCG GGGAGACAGA TCTGGATCTC
TCCATTCCCG AGAATGTAGC CATAGATGAC CCGGTGCGGA TGTACCTGAA GGAGATTGGC
CGGGTACCCC TGCTGACCCC CGAAGAAGAA ATCGAGCTGG CCAAGCGCAT GGAGGCCGGC
GACGAAGAAG CCAAGCGCCG TTTGGCCGAG GCCAACCTGC GCCTGGTGGT CAGCATCGCC
AAGCGCTACG TCGGTCGCGG CATGCTCTTC CTGGATCTCA TCCAGGAAGG CAACCTGGGC
CTTATAAAGG CGGTAGAGAA ATTCGACTAC CGTAAGGGTT ACAAGTTTAG CACCTACGCC
ACCTGGTGGA TCCGCCAGGC CATCACCCGG GCCATCGCCG ATCAGGCCCG GACCATCAGG
ATCCCGGTGC ATATGGTGGA AACTATCAAT AAATTAATCC GCGTGCAGCG GCAGCTCCTC
CAGGAACTGG GACGGGAACC CACCCCGGAG GAGATCGCCC ACGAAATGGA TATCCCCGTA
GAACGGGTAC GGGAGATCCA GAAGATCGCC CAGGAACCGG TATCCCTGGA GACACCCATC
GGCGAGGAGG AAGACAGCCA TCTGGGAGAC TTTATCGAAG ACGAGGACGC CCAGGCACCG
GCAGAAGCGG CTTCCTTCAT GCTCCTGCGG GAGCAGCTAG AGGAGGTCCT GAACTCCCTC
ACCCCCAGGG AAAAACGGGT CCTGCGCCTG CGCTTCGGCC TCGATGACGG CCGCGCCCGC
ACCCTGGAAG AAGTGGGCCA GGAGTTCGGC GTCACCAGGG AGCGTATCCG CCAGATCGAG
GCCAAGGCCC TGCGCAAGCT GCGCCACCCC AGCCGCAGCA AGAAACTCAA GGACTACCTG
GAATAG
 
Protein sequence
MKEEQQKQMV KELIEKGKSQ GSLTYSEIMD ALQGIELSPE QIDDIYEQLG HMGIEVVPEA 
AELEALEKED SEAGETDLDL SIPENVAIDD PVRMYLKEIG RVPLLTPEEE IELAKRMEAG
DEEAKRRLAE ANLRLVVSIA KRYVGRGMLF LDLIQEGNLG LIKAVEKFDY RKGYKFSTYA
TWWIRQAITR AIADQARTIR IPVHMVETIN KLIRVQRQLL QELGREPTPE EIAHEMDIPV
ERVREIQKIA QEPVSLETPI GEEEDSHLGD FIEDEDAQAP AEAASFMLLR EQLEEVLNSL
TPREKRVLRL RFGLDDGRAR TLEEVGQEFG VTRERIRQIE AKALRKLRHP SRSKKLKDYL
E