Gene Moth_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0742 
Symbol 
ID3831134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp777335 
End bp779392 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content60% 
IMG OID637828673 
ProductCheA signal transduction histidine kinases 
Protein accessionYP_429603 
Protein GI83589594 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC TGGATATGTC CCAGTACCTG GGCATCTTCC TGGATGAGGC CGAGGAACAG 
CTCCAGCAAC TGGACGAGGC CGTTGTCCAG CTGGAGCAGA CACCGGACGA TCAGGAACTG
TTAAATACCA TCTTCCGGGC GGCCCATACC CTGAAGGGCT CCTCGGCCTC CATGGGTTTT
AATCGCCTGG CTACCCTTAC CCACCGCATG GAAAGCGTCC TGGATAGCTT GCGCCAGGGG
AAGCTCGCCG TTTCCCGGGA GATTATCGAC ATCCTCCTGG CCAGTGTTGA TACATTGCGG
GCTTTAAAGG ACAGCATCGC CGCCGGCAAG GGTGAAGAAG GCGACGTGAA CGAGGTAGTT
GCTCGCCTGG AAGCCGTCCT GGCCGGCCCG GCCGCCCCGG TGGCAAACAA AGCTGTTGCT
AATGCAGAAT TGACCCTTGA CGACATTGAG CAGAACGTCA TCCGGGCAGC GGAAGTGAAG
GGCTTTCGCG CCTACGAGAT CCGGGTGCAG CTGGAAGCCG GCTGCCAGAT GAAATCGGCC
CGGGCCTACC TGGTTTTTAA CAACCTGAAG GAGCTGGGGG AAATCATCAA GAGCGTGCCC
CATACCCAGG ATCTGGAGGC GGAGAAGTTT GACGACACCT TCACCCTGGC CTTTGTCAGC
CGGGAGGACG CTGATACCCT GGCCAACGTG GTCAAGTCGG TGTCGGAGAT CAAAGACGTC
CTGGTGCGGC CCATTGTCCT GGAGGAGGAC CAGCCTGCGG CGGCAAGGGC AGGGACGGCC
CCCGGCAAGC CTGGGGTGGA AGCTGCCGGC GGGAAGAATG GTACAACTGC TCCTGCAGGC
GAGCATCACG TCAACCAGAC GGTGCGGGTG GATGTCCAGC GCCTGGAGAA CCTCATGAAC
CTGGTGGGAG AACTGGTCAT CGACCGCACT CGCCTGACCG AGGTGGGCAA CGGTCTCAAG
AACCGCCTGG GCAACGAGGA GCTCCTGGAG ACATTAGAGG AGGTCTCTTT GCACATCGGT
CGCATTACTT CCGACCTCCA GGAGGAGATC ATGAAGGCGC GCATGTTCCC CATCGACCAG
GTCTTCAACC GCTTCCCCCG CATGGTCCGG GATCTGGCCC GCAAGGCCGG CAAGGAGATC
GATTTTATCA TCGAAGGCCG GGAAACGGAA CTGGACCGCA CGGTCATCGA GGAGATCGGC
GATCCCCTTA TCCACCTCTT ACGTAACGCC ATCGACCACG GTATCGAAGA ACCGGAGGTC
CGGCTTCGCC AGGGTAAACC CCGCCATGGC ACGGTACGCC TGAAGGCTTT TCACCAGGAG
AACCAGATCG TCATTACCGT GGAAGACGAC GGTGCCGGTA TGGATGCAGA GAAGATCAAG
GCTAAAGCCA TCGCCAGGGG TCTTATCAGC CCCGAGTCGG CGGCGCGGCT GAGCCGGCGG
GAGGCCCTGG ACCTCATTTT CCTGCCGGGT CTCTCGACCT CCGATAAAGT GACTGACGTC
TCCGGCCGGG GTGTGGGTAT GGATATCGTT CGCAACCATA TTGAAAAGAT CAACGGCACC
ATCGACATCC GCACCACCCC CGGTAAAGGA ACCTGCTTTA CCATCAAGCT GCCCCTTACC
CTGGCCATCA ACCGTTCCCT CCTGGTTCAC GTGGATGGCC GGGTCTATGC CTTCCCCCTG
GCCAACGTGG TCGAGATCAT CGATGTTACT CCCGACAGCA TTCAGCACGT CCATCGTCAG
CAGGTGGTCG TGGTCCGCGG CCGGGTGCTG CCCCTCATCT ACCTCGGCCA GGCCCTGGGG
CTGGGAACAC CGATCCCGGC GGCGGATACT TATGCCGTCG TCATCGTCGG TCTGGCCGAG
AAACAGGTCG GCTTTATCGT CGATGACCTC ATCGGCGAAC AGGAGATCGT TATTAAATCC
CTGGGGAACT TTATAGGTAA GATACCGGGG ATCGCCGGCG CCACCATCAT GGGCGATGGC
AGCGTAGCCC TTATTCTCGA CGTGCGCAGC CTGATGAACT TCGTGGGGGA GGAGGCGGAC
CGTGAACTGG CCAGTTAG
 
Protein sequence
MSDLDMSQYL GIFLDEAEEQ LQQLDEAVVQ LEQTPDDQEL LNTIFRAAHT LKGSSASMGF 
NRLATLTHRM ESVLDSLRQG KLAVSREIID ILLASVDTLR ALKDSIAAGK GEEGDVNEVV
ARLEAVLAGP AAPVANKAVA NAELTLDDIE QNVIRAAEVK GFRAYEIRVQ LEAGCQMKSA
RAYLVFNNLK ELGEIIKSVP HTQDLEAEKF DDTFTLAFVS REDADTLANV VKSVSEIKDV
LVRPIVLEED QPAAARAGTA PGKPGVEAAG GKNGTTAPAG EHHVNQTVRV DVQRLENLMN
LVGELVIDRT RLTEVGNGLK NRLGNEELLE TLEEVSLHIG RITSDLQEEI MKARMFPIDQ
VFNRFPRMVR DLARKAGKEI DFIIEGRETE LDRTVIEEIG DPLIHLLRNA IDHGIEEPEV
RLRQGKPRHG TVRLKAFHQE NQIVITVEDD GAGMDAEKIK AKAIARGLIS PESAARLSRR
EALDLIFLPG LSTSDKVTDV SGRGVGMDIV RNHIEKINGT IDIRTTPGKG TCFTIKLPLT
LAINRSLLVH VDGRVYAFPL ANVVEIIDVT PDSIQHVHRQ QVVVVRGRVL PLIYLGQALG
LGTPIPAADT YAVVIVGLAE KQVGFIVDDL IGEQEIVIKS LGNFIGKIPG IAGATIMGDG
SVALILDVRS LMNFVGEEAD RELAS