Gene Moth_0828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0828 
Symbol 
ID3831525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp858289 
End bp860103 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content62% 
IMG OID637828758 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_429688 
Protein GI83589679 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000197924 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.403449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCA GGAGTATTAC CGCCAAGCTC TGGTTGCTCC TGGTTTCCCT GGTTGTCGTC 
AGCCTCCTGG TCATTGGCAT ATCCCTGCAC GGCCTTCTGG GGAATTACTA CTACCGGCAG
CAAGCCCAGA CCATGCTGGA TAAAGGGGAA CTGCTGGCCC GGAGCCTGGC GACCGGTAGC
GGCGGTGACC TGGCCGGCCA GGCCGACCTC CTGGGCCGGA TGGCCGGTAC CGGAGTGATG
ATCATCGACC GCCAGGGCCT GGTCCTTTCC TGCAGCAGCG AAGCCGGGCC GGGTGACGGG
TCCGGTATGG GGATGATGGG CCGGGGCCGC GGTTACGGCC GCATGATGCA CGGTAACTTT
CCAGTAACCG GTATGCACCT GGAGGGTGCC GAGGTCCAGC AGGTCCTGGC CGGGAATACC
GTCGTGAAAC GGGGTTACCA GCAGGCCTTT AATACCAGTA TGCTGACGGT GGCGGTCCCT
ATCAAGACCG GCAATGAAGT TAACGGGGCA GTTATCCTTT TTGCCCCGGA GGCTTCGTTA
AGCGCCGCCA TGGGCGCCAT GAGTCGACTG ATCCTTTATG CCGGCCTTGT GGCCGTACTC
CTGGCGACCA TCCTGGCCCT CTTTGCCGCC CGCAGGGTGA CCAGGCCTCT GAAAAGCTTA
AGCCTGGCGG CCCGGCAAAT GGCCAGGGGT GATTTCAGCG TCCGGGTGCC GGTAGCTTCA
GCGGACGAAC TGGGCCAGCT GGCCGGGAGC TTTAATTTTC TGGCCGGGGA ATTGTCCCGG
ACGGTGGCCG CCCTCTCCCG GGAGAAGGAA AAGCTGAATA GGGTGGTCCG GGATATGACT
GACGGGGTTC TCGCCTTTAC AGCCAGTGGC CGGGTCCTCT TTGCCAACCC TCGGGCGGAA
AAGCTCCTGG GTTTACCCTT GTCACCGGGG GCGGAACTGC CGGTCGAACT CCTGGACCCC
TTACGGGCGG CAGTGGCCGG GGAGGGAACT ACCGGCGAAA TAAACTGGCA GGAGCGGGTG
CTGGCAGTCC GGGCCGCTCC CCTGCAGGAA GAGGACCCCT GCGGGGAGGC AGCGGTGGCC
ATACTCCAGG ATATAACCAC CCAGAAAAAG ATGGAGCAGA TGCGCCGGGA GTTCCTGGCC
AGCGTCTCCC ATGAATTGCG CACGCCCCTG AGTTTTATCC AGGGTTACGG CGAAGCCCTG
GCCGACGGCC TGGCGACGGG TGAGAAGGAA CGCCAGGAGT ATACCGGCAT TATCCTGGCC
GAGGCCAACC GCCTGCGCCG CCTGGTAGAT GACCTTTTTG ACCTCAACAA GATGGCTGCC
GGGCACCTGC CCCTGGAACT GGCCGAGGTA GACCCCGGGG AACTGGTAAC TGGAGTGGCC
AGGAAGTACC AGCCCTTGCT GGCGGAACAC GGTTTAGTCC TGGAGGTGGA GCTCGAGCCC
TACCTGCCGC CGGTATGGGC TGACGCCGGG CGTCTGGAGC AGGTCCTGGT CAACCTCCTG
GATAACGCCC GGCGACATAC GTCTCCCGGG GGTCGGATTA CCATCAGCGC CGGCCTGGCC
GGTAGGGAGT TAAAGATAAG CGTAGCCGAC ACCGGCAAGG GCATCCCGGC AGGGGAACTG
CCCTATATCT GGGAGCGATT TTACAAGGTG GACAAATCCC GATCCCGGGG CGATAGTGGC
AGCGGCCTGG GCCTGGCCAT CGTTAAGGGC CTGGTAGAAG CCCATGGCGG CCGGGTCGAA
GTAGTAAGTG AACCGGGTCG GGGCAGCATC TTTAGCTTTT ATCTGCCGTT GCATATTGAC
AGCGAAAATG GATAA
 
Protein sequence
MISRSITAKL WLLLVSLVVV SLLVIGISLH GLLGNYYYRQ QAQTMLDKGE LLARSLATGS 
GGDLAGQADL LGRMAGTGVM IIDRQGLVLS CSSEAGPGDG SGMGMMGRGR GYGRMMHGNF
PVTGMHLEGA EVQQVLAGNT VVKRGYQQAF NTSMLTVAVP IKTGNEVNGA VILFAPEASL
SAAMGAMSRL ILYAGLVAVL LATILALFAA RRVTRPLKSL SLAARQMARG DFSVRVPVAS
ADELGQLAGS FNFLAGELSR TVAALSREKE KLNRVVRDMT DGVLAFTASG RVLFANPRAE
KLLGLPLSPG AELPVELLDP LRAAVAGEGT TGEINWQERV LAVRAAPLQE EDPCGEAAVA
ILQDITTQKK MEQMRREFLA SVSHELRTPL SFIQGYGEAL ADGLATGEKE RQEYTGIILA
EANRLRRLVD DLFDLNKMAA GHLPLELAEV DPGELVTGVA RKYQPLLAEH GLVLEVELEP
YLPPVWADAG RLEQVLVNLL DNARRHTSPG GRITISAGLA GRELKISVAD TGKGIPAGEL
PYIWERFYKV DKSRSRGDSG SGLGLAIVKG LVEAHGGRVE VVSEPGRGSI FSFYLPLHID
SENG