Gene Moth_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1205 
Symbol 
ID3832972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1243512 
End bp1244834 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content49% 
IMG OID637829138 
ProductAraC family transcriptional regulator 
Protein accessionYP_430062 
Protein GI83590053 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.909691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGC TAAGTTTGAG CGAAAGAATG GAAGACCTCC AAATAAAAAA GAAGATGGAA 
AAGGAGCTTT TCCAGAGGTT GTTAAACTCT TTTTCTTACG CTACGAGGAT GTTTTCCGCC
ATAACAGATT TGGAAGGAAA TTGCATACTC TCCTCCGAGC AGGGTGATTG CGAGTTCTGC
CAGCTGGTCA AATCGAGTCC TACCGGAATG GCCCGTTGCC GGAGTTCTTA TGCCTGGGCC
GGGGAGCAAG CCCTCAAATG GAAAGAGCCT TATATTTTCA AATGCCATGC CGGCCTTATC
TCATGGGTCT GCCCCTTTTT CTACAGGGGC AAGCACATCG GGAATTTTAT TTGCGGCCAG
GTAATGATGT GGCAGCCCGC CGAATTCTGT CATCACTGGA TCAGGGAACT GGCGTCTGAG
ATAGAGCAGG ACCCCAACAT TTTGTTACAA TCCGTAGACA GGGTTAAGTC GGTGTCGTCG
GTAGAGATCC AGGCCGCGGC CGATCTGGTC TTTATCATTA CCAGTTACGT AGCAAAGAGC
GAGGGAGAGA TCTTTGACTT CCAGCAAAAA TTGCGAAGAG TCGGTTCCTG GATATGGACG
GAAAACAAGA AACAGAAGGA TGTCGGGAGC CAGACCGCCG GGGGCAACAC AGAGCAGGAC
CTGAGCAAGA TAGGGAACCA GATCTTTATG GAGATCAGGA GATCAGATAT CGATAAGGCA
AAAAAGCTGC TAGAGCAGCT CGTCCTGCAG ATTTTTATCC AGAGCAAGGG GCAATTGGAA
GTTATCAAGG GGCGCAGCCT GGAACTCCTG AGCTTCCTTA TCCGTACGTC GACCGAATAC
GGAGTAAAGT TCGGGGAAGT AATCCACTTA AGCGATCTGA AGCTGAGGGA GATAGACGAG
GCTGACACCG TAGAAAAGGC TGTCCTCTGG CTTCTGGCGG TGGGAAACGC CTTTATCGAG
TTGATTGCGG AAAGGAATTC CAGCGAGGGA GAGGGCATAA TCGACAGAGT TGTCGAATAT
ATCCAGAAAA ACTATAGTTC GGAGAGCCTC TCTGTTAAAG AAATTGCCAG AGCCAGCTAC
CTGAGCCCGG CATATCTGGG GCAACTGTTC AAAAAAAAGA TGGGCTATTC CCTCACCGAG
CACATTAACA AGGTGAGGAT CGAGCAGGCG AAGCTCTTGC TCAGGCAAAC CGAACAGACC
ATTGAGTCGG TAGCTATACA GACGGGTTTT AAAGAGCGCA GTTATTTCTG CAAGGTTTTT
AAAAAAATTA CCGGCTTGAG TCCTAACGAG TATAGGAGAA AGAATTTCTC TCCATTGGTC
TGA
 
Protein sequence
MLALSLSERM EDLQIKKKME KELFQRLLNS FSYATRMFSA ITDLEGNCIL SSEQGDCEFC 
QLVKSSPTGM ARCRSSYAWA GEQALKWKEP YIFKCHAGLI SWVCPFFYRG KHIGNFICGQ
VMMWQPAEFC HHWIRELASE IEQDPNILLQ SVDRVKSVSS VEIQAAADLV FIITSYVAKS
EGEIFDFQQK LRRVGSWIWT ENKKQKDVGS QTAGGNTEQD LSKIGNQIFM EIRRSDIDKA
KKLLEQLVLQ IFIQSKGQLE VIKGRSLELL SFLIRTSTEY GVKFGEVIHL SDLKLREIDE
ADTVEKAVLW LLAVGNAFIE LIAERNSSEG EGIIDRVVEY IQKNYSSESL SVKEIARASY
LSPAYLGQLF KKKMGYSLTE HINKVRIEQA KLLLRQTEQT IESVAIQTGF KERSYFCKVF
KKITGLSPNE YRRKNFSPLV