Gene Moth_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2494 
Symbol 
ID3831597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2599308 
End bp2600777 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID637830416 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_431319 
Protein GI83591310 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.228009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAC TACGCTGGAA GATTACCCTA AACTTTTTGA CCCTCCTTTT CTTCACTCTG 
TTGGGGGCCT ATCTCTACTT ACACCAGGCT ATCTTGAAGG CCATGGGATT ACCATGGTTA
CCCCCCTTCC GGGCCGGGTT CCTGGCTGCC AGGTTAGAGG GACAACTGCT GGCCGTCATG
ATCCTAGTTT TGATTATAAT GGGCATTGGC ACCTTTATCC TGGCCCGGGG GATTATAACC
CCCCTGACGG CCCTCCTGCC CCTGACCCGC AGGATTGCCG CCGGTGACCT GGAACAGCGG
GTAGAGATCC AGAGTGACGA TGAGGTGGGT TTATTAAGCC ATCATCTAAA TATCATGGTG
GAAACTCTAC GCAATAATTT CCGGGAAATA GCAGACGAGC GCAATAAAAT GAAGGCTATC
CTGGCCAGTA TAACCGACGG CCTGGTAGCT GTTGACCAGG TGGGCCGGGT TATAATGCTC
AATCCGGCGG CAGAGAAGAT GTTCGGTAAA AAGGGGGCAG AGGTCGAGCA CAAGTATCTC
CTCAAGGTTG TCCGTAACCA TGAAATCGAT GCCATGGTAA AGGAGATCCT GGCCAGTGGC
CTGCCCCTGG AGAATGAGGT CCGGCTCTTC CCGACTACCA GTCAGTTATT CAGAATCTAT
GGTACGCCCA TCACCAGCGA ACAGGGACGA ATAATCGGGG CCGTGCTCAC CATCCGGGAT
ATTACCGACA TCCGCCGCCT GGAGCAGATG CGGACGGAGT TTGTGGCCAA CGTCTCCCAT
GAATTACGTA CCCCCCTGAC CTCAATCCGG GGCTTTGTTG AGACTCTGCT GGAGGGGGCC
CTTGAAGACC CGGAGGTCAG CCGGCGCTTC CTGGGAATTA TCAACCATGA AGCCCAGCGA
TTGCAGCAAT TAATCGAAGA CCTTCTCTCC CTGTCACGAC TGGAGAGCCA ACCAAAGCGA
CAGGATGCTG GGCGTGCGGA CCTGGCGGCC ACCTTGGACC GGGTCCTCAC TACTGTTAAC
CAGTTAGCAA GGGAGAAAGG AGTCGCCCTG GAGAAGGAGA TACCGGCGGA GATACCGGAG
TTGGCCATCA GTGAGAGCTA TCTGAACCAA GTGCTCCTTA ATCTGATTGA TAATGGCATT
AAGTATACCC CCGCCGGTGG CAGGGTAACT ATACGTGCTG CCCGGTTAGG GGAATTAGTT
CAGGTAGAGG TGGCAGATAC CGGCATAGGC ATCCCCCCCG AGAGCCTTCC CCGCGTATTT
GAACGATTCT ACCGGGTAGA TAAGGCGCGT TCCCGGGAGA TGGGAGGCAC CGGTCTGGGC
CTGGCTATCG TCAAGCATAT AGTCGAGTCC CATGGTGGCA GTATCAGTGT GACCAGCAGG
CCGGGCCAGG GCAGCCATTT CTTCTTTACC CTCCCCATTG CCGCTGAGGA AGGGGGGCGA
AGCAATTACC AGGAAGAACC GGGCACCTGA
 
Protein sequence
MHSLRWKITL NFLTLLFFTL LGAYLYLHQA ILKAMGLPWL PPFRAGFLAA RLEGQLLAVM 
ILVLIIMGIG TFILARGIIT PLTALLPLTR RIAAGDLEQR VEIQSDDEVG LLSHHLNIMV
ETLRNNFREI ADERNKMKAI LASITDGLVA VDQVGRVIML NPAAEKMFGK KGAEVEHKYL
LKVVRNHEID AMVKEILASG LPLENEVRLF PTTSQLFRIY GTPITSEQGR IIGAVLTIRD
ITDIRRLEQM RTEFVANVSH ELRTPLTSIR GFVETLLEGA LEDPEVSRRF LGIINHEAQR
LQQLIEDLLS LSRLESQPKR QDAGRADLAA TLDRVLTTVN QLAREKGVAL EKEIPAEIPE
LAISESYLNQ VLLNLIDNGI KYTPAGGRVT IRAARLGELV QVEVADTGIG IPPESLPRVF
ERFYRVDKAR SREMGGTGLG LAIVKHIVES HGGSISVTSR PGQGSHFFFT LPIAAEEGGR
SNYQEEPGT