Gene Moth_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0853 
Symbol 
ID3831740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp886419 
End bp887237 
Gene Length819 bp 
Protein Length272 aa 
Translation table11 
GC content54% 
IMG OID637828783 
Productsporulation sigma factor SigG 
Protein accessionYP_429713 
Protein GI83589704 
COG category[K] Transcription 
COG ID[COG1191] DNA-directed RNA polymerase specialized sigma subunit 
TIGRFAM ID[TIGR02850] RNA polymerase sigma-G factor
[TIGR02885] RNA polymerase sigma-F factor
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0182418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATAA TTATCCCAGG ACCAAGGTCG GGAGGTTTCT CGCCAGGAAT GCTGAACAAA 
GTAGAAATCT GCGGTGTGAA TACCTCCAAA TTACCCCTGC TGAAGAGTGA TGAGATCCGC
CAGCTCTTTC AACAGATGCA GGCCGGAGAA ACCGGAGCCC GGGAAAAGCT CATTACAGGC
AACCTGCGCC TGGTTTTAAG CGTCATCCAG CGCTTCACCA ACCGCGGCGA ACACGTGGAC
GACCTGTTCC AGGTGGGTTG TATTGGCCTC ATGAAGGCCA TTGATAACTT TGATCTGAGC
CAGAACGTCA AGTTTTCCAC CTACGCCGTG CCCATGATTA TCGGCGAGAT CCGGCGTTAC
TTGCGGGACA ATAACCCTAT CCGGGTCAGT CGTTCCTTGC GGGACGTGGC CTACAAGGCC
CTCCAGATAA GGGACACCCT GGTGAATAAG CTCGCCAGGG AACCCTCTCT GGCGGAAGTA
GCCCAGCAAC TGGACCTGCC CCAGGAAGAA GTAATCTTCG CCCTGGATGC CATTCAGGAG
CCGGTGTCCC TTTTCGAACC GATTTATCAT GACGGCGGCG ATCCCATCTT TGTCATGGAC
CAGATTGGGG ACGAAAAGAA CCAGGACAGC AGCTGGCTGG AGAATATCGC CATCAAAGAG
GCCATGAACA AGTTGAATCC CCGGGAGCGC CTCATCCTAT CCTTACGTTT CTTTGAAGGC
AAGACCCAGA TGGAAGTCGC AGACGAAATC GGCATATCCC AGGCCCAGGT CTCCCGGCTG
GAAAAAGCAG CCCTGCAGCA TATGCGGAAA TATATTTAA
 
Protein sequence
MAIIIPGPRS GGFSPGMLNK VEICGVNTSK LPLLKSDEIR QLFQQMQAGE TGAREKLITG 
NLRLVLSVIQ RFTNRGEHVD DLFQVGCIGL MKAIDNFDLS QNVKFSTYAV PMIIGEIRRY
LRDNNPIRVS RSLRDVAYKA LQIRDTLVNK LAREPSLAEV AQQLDLPQEE VIFALDAIQE
PVSLFEPIYH DGGDPIFVMD QIGDEKNQDS SWLENIAIKE AMNKLNPRER LILSLRFFEG
KTQMEVADEI GISQAQVSRL EKAALQHMRK YI