Gene Moth_1477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1477 
Symbol 
ID3832358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1524852 
End bp1526231 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content60% 
IMG OID637829410 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_430330 
Protein GI83590321 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000762888 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGCAGA AATTTTCCAC CTGGCTCCGC GGTATTCCCC TGCGGTGGCG GCTGACGGCC 
TGGTATGTTT TCTTGCTGGC CTTGATCCTG GCCGGCTTCA GCGCTTTTAT TTACTTTAAC
ATGTCCCGGA GCCTGAAACA GGGCCTGGAT TCTTTATTAT TCTCCCAGGG GGAACAGGTT
TTAAGCAGCC TGGACAACGA GAACGGCCTG CCACGCCTGG ACCCCAATTT GCCCCTCCTG
CCCGGTACTT ACTTTGGCCT TTACGACACC GGAGGTAAAG TCCTGGATAC AAACATGCCG
GCGGACCTGG CTACCGGCTT CCAGGTGAAG GGCTTAACGG CCAGCCGGCC CGCAACAGTG
GAGATTAAGG GGGCCGAATG GCGGGTGCTC CTGGTCCCGG TAAGGGAAAA AGGCCAGCAA
CCTTACTGGG TCCTGGTTGT ACGCTCGGTC GAAGAGACCG AAAAGCCCCT GGACCGCCTG
CTGTTATTTA TCCTTATTGC CATCCCCATG ACCCTGCTGG TGGCGGCGGG AGGGGGTATT
TTCCTGGCCC GGCGGGCGTT GCAGCCTATT GATAGAATTG CCGCCAAAGC CCGCCAGATC
AGCGCTACTG ACCTGAGCCG GCGCCTGGAC CTGCCCCACG GTAACGACGA GGTGGGGCAC
CTGGTGGCCA CCCTGGACGA GATGCTGGAT CGCCTGGACC GGGCCTTTCA GCGCCAGCGG
CAGTTTACTG CCGACGCCTC CCACGAATTT CGCACCCCCC TGGCCGTCAT CCGCAGCCAG
GCCGAAGCGG CCCTACAGCG GCAGCACTCG CCGGCAGAGT ACCGCCAGGC CCTGGAAATA
ATCCGTGATC AGGCGGAGTG GATGGGTAAC CTGGTCGCCA AGTTATTGCT TTTGGCCCGG
AGCGACGACA GGATGGAACA GATGGAGATG GAACCCCTGG ATTTGGGCGA ACTGGTGGAA
GGCGTCACGG CCGAATTCCA GGGGATGGCG GCGGAAAAGG GCCTGAGGCT GGTAAAAAAA
ATTAAGGAAA AAGTGGTCGT TCGCGGGGAT CAGACACGCT TGACCCAGCT CCTGGCCAAT
CTGGTGGATA ATGCCATCAA ATATACGCCG GAAGGGGAGG TGGTCGTCAG CCTGGAACGG
CGCGGCCGGC AGGCCCTGCT GCAGGTGCAG GATACGGGAG TAGGTATCCC GGAGGAACAT
CTGGCCCATA TCTTTGAGCG ATTCTACCGG GTCGATAAAG CCCGTTCCCG GGCGGAAGGG
GGCTTTGGCC TGGGACTGGC TATCTGCGAC TGGATCGTCC GCGCCCATAA CGGCAAAATT
GAGGTAGAAA GTGCGGTGGG GCGGGGAACA ACCTTTAAAG TATGGTTGCC GGTTGAATGA
 
Protein sequence
MRQKFSTWLR GIPLRWRLTA WYVFLLALIL AGFSAFIYFN MSRSLKQGLD SLLFSQGEQV 
LSSLDNENGL PRLDPNLPLL PGTYFGLYDT GGKVLDTNMP ADLATGFQVK GLTASRPATV
EIKGAEWRVL LVPVREKGQQ PYWVLVVRSV EETEKPLDRL LLFILIAIPM TLLVAAGGGI
FLARRALQPI DRIAAKARQI SATDLSRRLD LPHGNDEVGH LVATLDEMLD RLDRAFQRQR
QFTADASHEF RTPLAVIRSQ AEAALQRQHS PAEYRQALEI IRDQAEWMGN LVAKLLLLAR
SDDRMEQMEM EPLDLGELVE GVTAEFQGMA AEKGLRLVKK IKEKVVVRGD QTRLTQLLAN
LVDNAIKYTP EGEVVVSLER RGRQALLQVQ DTGVGIPEEH LAHIFERFYR VDKARSRAEG
GFGLGLAICD WIVRAHNGKI EVESAVGRGT TFKVWLPVE