Gene Moth_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0959 
Symbol 
ID3832844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp990924 
End bp992072 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID637828888 
Productputative transmembrane transcriptional regulator (anti-sigma factor) 
Protein accessionYP_429817 
Protein GI83589808 
COG category[K] Transcription 
COG ID[COG5662] Predicted transmembrane transcriptional regulator (anti-sigma factor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.261704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTCCGG AAGAGGGTCT TCTTCAGGCA TATCTTGACG GCGAACTGGA TGCGGAGATG 
ACCGCCCGGG TGGCCGATCA CCTGGCGGGG TGCCCTGCCT GCCTGCAACG CCTTCAGGAC
CTGGAGGAAC TGGCGGATAG CACCACTACC GCCCTGGCGG CCTACCGGCG GGAGACCGAA
GCCGCCGGTC GTCCCTTAAG GGCACCTGGG CCCGGCTCTA TGCCCTTCCG CCATCGACCC
CTGACTAAAG CAAGGAGGAT GGTTGATATG ATCCGGCAAC ATAAGTGGTT TTCCGGTGCA
GCCGCGGCCG TCCTGGCCCT GGCCGTCTTC TTAAGCTGGG CCCCGGGCCG CAGCCTGGCC
GCCCAGTTCC TCAACATCTT CCGCATGGAA AAGATCCAGG TGGTGAAGAT CACCCCCGAG
GATATGGCCC AACTGGATAA ACTCTTTAAC GGCCAGGGGG GCGAGGTGGA CATCAGGAAC
TTCGGCCGGG TGGAGGTTAA GCGGCCGGCC CAGCCGCGGG TTGAGGCTGA ACCGGCCCAG
GTGGAGGCCT TGAGCGGGTT AAAACTGGAC CTGCCCGCCA CCCTGGCCGG GCGGGAGAGG
ATGGCCATCA ACGTCGAGCA GTCGCCGACC GTCACCTTTA CCCCCGACGT CGAGAAGCTG
AACAGCTACC TGCAGAAACA CGGCGGCGTC CTGCTGCCGG CCGACCTGGC GGGCAAGTCC
TTCACTTTAA GCATCCCGCC CCTGGTGCGG GCCGACTATG GCCGGCGGCC GTTCCAGCCG
GGGCAGAGCT TTACCATCTA CGCCGCCCGC GACCTGACCA TTGACGCGCC CCAGGGGGTT
GATCTGCTGG CCCTGCGCCA GGCGTTGTTG AACCTGCCCT TCCTGCCGGC CGGCTTACGC
CAGCAGCTGG CGGCCATTAA AGACTGGCAG CATACCTTGC CCATCCCGGA GACCCAGGGT
ATGGCGGCGC GGGAGATCAC GGTCAACGGC AACCAGGGGG TCTACTTTAC CAGCGCAACT
CCAACCCACT ACCCCAACGG GAAAGTACAG GATAGTGTCA TCCTGGCCTG GCGCCAGGGT
AATAGCTGGC GGGCCATCAC CGGCCTATCC CTGGATGAAG CCCTGCGGGT AGCAGCGGAG
GTCAAGTAA
 
Protein sequence
MCPEEGLLQA YLDGELDAEM TARVADHLAG CPACLQRLQD LEELADSTTT ALAAYRRETE 
AAGRPLRAPG PGSMPFRHRP LTKARRMVDM IRQHKWFSGA AAAVLALAVF LSWAPGRSLA
AQFLNIFRME KIQVVKITPE DMAQLDKLFN GQGGEVDIRN FGRVEVKRPA QPRVEAEPAQ
VEALSGLKLD LPATLAGRER MAINVEQSPT VTFTPDVEKL NSYLQKHGGV LLPADLAGKS
FTLSIPPLVR ADYGRRPFQP GQSFTIYAAR DLTIDAPQGV DLLALRQALL NLPFLPAGLR
QQLAAIKDWQ HTLPIPETQG MAAREITVNG NQGVYFTSAT PTHYPNGKVQ DSVILAWRQG
NSWRAITGLS LDEALRVAAE VK