Gene Moth_1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1910 
Symbol 
ID3831183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1978082 
End bp1979770 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content60% 
IMG OID637829843 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_430753 
Protein GI83590744 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000822204 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGAGC ATCTGGACTT CTGGGGGGCA ATTCTCGCCG CCCGGCGGCA GGGGATGATT 
TATGTTGACG CCGGGGGCCT TATTAAGGTG GCCAATTCCC TGGCGACCGC CATCCTGGGA
TTGGACGGCT GTGATCTCAG GGGCCGCCGC CTGACGGAAG TCTTTCCTGA GACCCGCCTG
GCCGAGGTCC TGACCCGTAA GGGCCCTCTC CTTGGGACCA GCTTTACCGC CGGCAGTAGA
ATCTATCGGG TCGATTATTT CCCCGACGAT GGGGGAGGTG TCGTCCTTCT TTTCAGTGAC
GAGCAGCCTG AGACGAATTC CCTCAAAGCC CTTAACGAAC TCTACGCCTC CCTCCTTGAC
GATTTTCCTC TATACCTGGT GGTCGTCGAC GGCCGGGGGG TGGTGGTCGC CATTAACCAT
TTATATGCCA GCCTCCTGGG TGCAAAGGCC GGTGCCCTGA TCGGCCGGGA AGTAAAGGAT
ATTTTGCCCT TTACCTCCTT GCCGGAGGTC CTATCTTCGG GGATGGCGGT AGCCGGCCGG
GAAGTGGCCT TCCGGGGCAA AAAATTGCTA TTATCGGAAG TGCCGGTTAA AGATGGGGCC
GGGCATCTCC TGGGAGGAGT CGGTAAGGCC CTGGATCTCG AGGACCTGGC AGCCGCCGGT
TTCAGCGACC TCGCCCGCCG CCTGCAGGTC CTGGAGGGCA AGGTTCTTTT CTACCAGCAG
GAACTCCAGT CAATCCAGGG CGACCAGGAC GCCTGGCAGG GCATCCTGGG GGTGAGCCCG
GCTATGGTCC GGCTAAAAAA AATGGCGGCC CGGGTAGCCC GCGGCGAGGC TAACGTTTTG
ATTACCGGCG AGAGCGGTAC CGGGAAGGAA CTGCTGGCCC AGGCCATCCA CCGGGCCAGT
CCTCGCCGGA ATGAACCATT GATCAAGATC AACTGTGCCG CCATCCCGGA AAACCTGCTG
GAGGCCGAAC TCTTCGGCTA CGAAGAGGGG GCCTTTACCG GCGCCGCCCG GGGCGGCAAA
CCGGGCAAGT TTGAACTGGC TGACGGCGGT ACCATCTTCC TGGATGAAGT CGGCGACCTG
CCCCTCAGTA TGCAGCCCAA GCTCCTGCGG GTCCTGCAGG AAAGGGCCTT TGAACGGGTC
GGGGGACGCC GGACCATCAA GGTTGATGTA AGGATTATTG CTGCCACCAA CCAGGATCTC
CTGGCCCTGG AGGCGGCCGG CAAATTTCGC CTTGACCTTT ACTACCGCCT GGCGGTGGTT
ACCCTGACGG TACCACCCTT AAGGGAACGG CCGGGGGACC TCGAAATACT GGTGGCGGCC
ATTATCCGGC GTTTAAATGC ACGCTACCAC CAGACGGTCC GGGGCCTGGA TCCGGAGGTA
CAGCGGCTTT TTTCCCGCTA TAGCTGGCCG GGGAATGTCC GGGAGCTGGA AAATGTCCTG
GAGCATGCCT TTAATTTCCT GGATCCGGGG GAAGAAATAA TTACCTTGAA GCATCTCCCG
GCCACCATGC AAAGGACGAT TGGTGGCGGG GGTGGCCTGA AGCTGAAGAG TGCTGTCGCT
ACGGCGGAAA GGGAAGTTAT CTTAAAGGCC CTCGCGGCCG CTGGTGGAAA CAAACAGGAA
GCGGCCCGTC TCCTGGGTAT TCACCCCTCC GGCCTTTATC AAAAGCTCAA GCGTTATGAT
ATCAATTAG
 
Protein sequence
MKEHLDFWGA ILAARRQGMI YVDAGGLIKV ANSLATAILG LDGCDLRGRR LTEVFPETRL 
AEVLTRKGPL LGTSFTAGSR IYRVDYFPDD GGGVVLLFSD EQPETNSLKA LNELYASLLD
DFPLYLVVVD GRGVVVAINH LYASLLGAKA GALIGREVKD ILPFTSLPEV LSSGMAVAGR
EVAFRGKKLL LSEVPVKDGA GHLLGGVGKA LDLEDLAAAG FSDLARRLQV LEGKVLFYQQ
ELQSIQGDQD AWQGILGVSP AMVRLKKMAA RVARGEANVL ITGESGTGKE LLAQAIHRAS
PRRNEPLIKI NCAAIPENLL EAELFGYEEG AFTGAARGGK PGKFELADGG TIFLDEVGDL
PLSMQPKLLR VLQERAFERV GGRRTIKVDV RIIAATNQDL LALEAAGKFR LDLYYRLAVV
TLTVPPLRER PGDLEILVAA IIRRLNARYH QTVRGLDPEV QRLFSRYSWP GNVRELENVL
EHAFNFLDPG EEIITLKHLP ATMQRTIGGG GGLKLKSAVA TAEREVILKA LAAAGGNKQE
AARLLGIHPS GLYQKLKRYD IN