Gene Moth_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2114 
Symbol 
ID3833265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2209107 
End bp2210930 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content58% 
IMG OID637830039 
Producthistidine kinase 
Protein accessionYP_430949 
Protein GI83590940 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3275] Putative regulator of cell autolysis
[COG5012] Predicted cobalamin binding protein 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.156393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCT TTTCCCTGGA AAATCTAATC CAGGCTGTAA TCGACGGCAA CGCTGTCAAG 
GTCCGGGAAG AAGTAAAAAG GGCCTTGGCA GCGGGGATCG AACCGGCCCG GATCATAACC
GATGGCTTTG TGGCAGCCAT GGATGTAGTG GGGGAAAGGT TCGAACGCAA CGAGATTTAC
GTAACTGACT TGATTATTAC CGCCCGGGCC ATGCATACCG GCCTCAAGGA ACTTAAGCCC
CTGATGCTGG CCGGTAAGGT GCAACCGGTG GGGCGGGCCA TTGTCGGTAC TGTCCAGGGG
GACATCCATG ATATCGGCAA GAATCTCCTC GCCATCATGC TCGAGGCATC GGGCTTTGAG
GTTATCGACC TGGGCGTGAA CGTAGCCCCC AGCACCTTTG TGGAGGCGGT CATTAAGCAC
CGTCCCGATG TCCTCTGCCT CTCGGCCCTC CTTTCTTCTA CCCGCAAGGG TATGGAGGAA
ACTATTACTG CCCTGCGGGA GGCCGGCTGG CGGGATAAAG TAAAGGTTGT CGTAGGCGGC
ACCCCCCTGA ATGAAAAGAT TGCCGCCAGG ATGGGAGCTG ACGGCTACGC CCCCGATGCC
ACGGCGGCTA TACCTCTGAT TAAGAGCCTG ATCGGTGCCG ACCGCAAGCG CCGGGCCGTC
CTGGCTCCGG CTACCCTGGA CCTGTTTTTC GGGGAGGGTT CCCTGGAAGA CCTGCAGCGT
GCCTTTACCC GGATGACAGG CCTGCATCTG GTCATGGTGG ATGCCGCCGG CCGCCCATTG
ACTTCCCTGG GCGGTTTCCT GGAGTGTTCC CGCCATTGCC ACCTGCTTAA GGAAAACCCG
GCCAGAGCCC AAGATGTCAC CACCCTCCAG GGCAATTTTA AAGAAGCTTT TATTTATCGC
TGTCATGCCG GGTTGGTGGA AATTTCCTAC CCTCTGGCCA ATGAGGATGG GACGGTGGGG
GCCGTCCTCT GTGGCCACTG CCTCTTGAGA GGCGACCCTG ACCCGGCCGA TTTGAAGGCA
GCCGTCCCAG TCTTATCCCT AACGGACCTG GAAGCTGTTT GCGGTCTTCT CTCCTTTGTA
TCCGGCCAGA TCATGCAGCT CAACACCGTT TTACTGGTCA ATAAGGAACT GGAGGACCAG
CAGGCGAGCT TCATCCACTT CCTTAAGCGG CAGCACCAGC TGGAACAGGC CCTGAAGGAC
GCCGAACTCA AGGCCCTCCA ATCCCAGGTC AACCCCCACT TCCTCTTTAA TTCTTTAAAT
ACCGTAGCCC GCCTGGCCCT CCTGGAAGGG GCGGCCAATA CGGAAAAAAT GGTCCGCGCC
CTGGCCCGCC TCATGCGCTA CAGCCTCTAC CAGGTCAAGG GAACGGTTTC CCTGGCAGAA
GAAATAGCTG CCGTGCGCGA CTACCTTTTT ATCCAGGAAA CGCGGTTTTC AGACCGGGTC
CGGAGCCGGG TGAAAGTAGA AGAGGCCGCC ATGCAGGCCC GGCTGCCCTG CATGGTCCTG
CAACCCCTGG TGGAGAATGC TATTATCCAT GGCCTGGAAC CCAAGGAGGA GGGAGGGGAA
ATCACCGTAT CCGCCCGCCT GGTGGGCGAC CAGGTCCGGG TAGAAATCAA GGATGACGGG
GTCGGAATAC CGCCGGAGGT GAAAAAGGCG ATCTTTGACC TGGAAGTCCG GCGGAGCGGT
AAAGGCCAGG TAAGCGGCCT GGGGATAGTC AATGTCTACC GGCGCCTGCA GCACCATTTT
GGTAGCAACT GCGCCCTGGA TGTAGCCAGT ATGCCGGGAA AAGGTACTTG CGTCCAGCTG
ACTTTTCCTT ATACTGTGGA TTAG
 
Protein sequence
MASFSLENLI QAVIDGNAVK VREEVKRALA AGIEPARIIT DGFVAAMDVV GERFERNEIY 
VTDLIITARA MHTGLKELKP LMLAGKVQPV GRAIVGTVQG DIHDIGKNLL AIMLEASGFE
VIDLGVNVAP STFVEAVIKH RPDVLCLSAL LSSTRKGMEE TITALREAGW RDKVKVVVGG
TPLNEKIAAR MGADGYAPDA TAAIPLIKSL IGADRKRRAV LAPATLDLFF GEGSLEDLQR
AFTRMTGLHL VMVDAAGRPL TSLGGFLECS RHCHLLKENP ARAQDVTTLQ GNFKEAFIYR
CHAGLVEISY PLANEDGTVG AVLCGHCLLR GDPDPADLKA AVPVLSLTDL EAVCGLLSFV
SGQIMQLNTV LLVNKELEDQ QASFIHFLKR QHQLEQALKD AELKALQSQV NPHFLFNSLN
TVARLALLEG AANTEKMVRA LARLMRYSLY QVKGTVSLAE EIAAVRDYLF IQETRFSDRV
RSRVKVEEAA MQARLPCMVL QPLVENAIIH GLEPKEEGGE ITVSARLVGD QVRVEIKDDG
VGIPPEVKKA IFDLEVRRSG KGQVSGLGIV NVYRRLQHHF GSNCALDVAS MPGKGTCVQL
TFPYTVD