Gene Mext_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2117 
Symbol 
ID5833189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2375020 
End bp2376915 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content70% 
IMG OID641367914 
Producthistidine kinase 
Protein accessionYP_001639583 
Protein GI163851540 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0208634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAC CGCTCGCGCC GCTGCATCAA GGGCGGCGTG AGCGCATGCG TTACGCTGAT 
TCGGTCGGCG GATGCATGCG GGCGACCTCG AAGACCGCGG AAGGCCTTCA CGCGATGAAC
GACGACGACT GGCTCGAACT CATCCCCGAC GAGGCTGCGG AAGCGCCGAC CCGAGCCGGA
ACCTGGACCA TCGCGGTGGT CGACGACGAT CCGGCCGTCC ACGAGGGCAC CCGCTACGCG
CTCGCCGGCT ACAGCCTCGA CGGGCGGGGC CTCGACATCC TCTCCGCCTA TTCGGCGCAG
GAGGCGCGGG CGCTGCTCGC CGAGCGGCGC GACATCGCCA TCGTGCTGCT CGACGTCGTG
ATGGAGACCG ACGATGCCGG ACTGAGGCTC GTCGATTACA TCCGCCGGGA GCTGAAGGAG
GAGACCGTCC GCATCATCCT GCGCACCGGC CAGCCGGGGC AGGCGCCGGA GCGGCGCGTC
ATCGTCGATT ACGACATCAA CGACTACAAG GCCAAGACCG AACTCACCGC CGACAAGCTC
TTCACCAGCC TCACCGCGGC GCTCCGGGCC TACCAGCAGC TGCGGCGGCT CGACGAGACC
CGGCGTGGCC TGGAAATCAT CATCGACGCC GCGCCGATGC TGCTCGACCA CAAGTCGATG
CAGCGGCTCG CCGAAGGGGT GCTGACCCAG GTCGCCTCGC TCCTCAATGT CGATTGCGCC
GGCATCCTGG TTCTGCGCGA GAGCGAGGAT GCCGCCGACC GGCGGGAGGG CGGATTCTGC
GTGTTGGCGG GCTCGGGGCT CTATGGCGGC TATGTCGGGC GCGACCCCGG CTGGCCGCTC
GATCCCGGCA TCCAGCCCCT GGTCGAGCAA GCCTTCGCCG CGCGCTGCCA CAACTTCGGC
GACAACCTCT CCACGCTCTA TGTCCAGACT GCGAGCGGCA GCGAGATCGT CGCGCTGATC
GACACCGACC GCCCGCTCTC CGACACCGAC CGCGCGCTCA TCACCCTTCT GGCGGGGCGC
CTCTCGGTCG CCTTCGACAA CGTGATTCTC TACGAGCGGC TGCAGCGCGC CAACGTCACC
TTGGAGCAGC GGGTGGTCGA GCGCACCGCC GAGCTGATCC GGGCCAACCG CCGCCTCGAC
ATGCAGCGCT CGGACCTGCG CCGGGCCAAC AGCCTCAAGA CCGAGATTCT GGGCACCATC
GCCCACGACC TGAAGAACCC GCTCTCGGTG ATCCTGGGTC GCTCCGAGAT GCTGGCCGAC
CTGATCGGCC TCGATCAGGG GGATGCCTCC GGCGCGGAGA AGGCGCAGGC GGCGATGCTG
ACCCAGGTCG AGCACATCCG CGCCTCGGCC ACGCGTCTGA TCGACATGAT CGACAGCCTC
ATGGCCGATG CCATGAACGA CGCCCTCGAC ATCACGCTGC GGCGGGAGCC CGTCGATCTC
GCCGGCCTCG CCCGCGAGGT CTGCGAGGCC AACCGGCCGC TCGCCGAATC GAAGGGCCAG
AGCCTGGTGA TGGATCTCGC CGGCCCGCTC ACTCTTTGCG GCGACGCCGA GCGGCTGCGC
GAGGCGCTCG ACAACCTCGT CTCGAACGCG ATCAAGTATT CCTATCCCGG CGGCGCGATC
GCGGTGAGCG TGCGGGAGGA GGGAGGCGAC CTCGTCTGCG CGGTGGCCGA CCAGGGCCCC
GGCCTGTCGC CGGAGGATGC CGGCCGGCTT TTCGGCCGCT ACCAGCGCCT CTCGGCCAAG
CCCACCGGCG GCGAGGGCTC GACGGGGCTC GGCCTCTCCA TCGTCAAGCG CATCGCCGAA
CTCCACGGCG GCCGCGCCGA GGCGTTTTCC GACGGACCGG GGCAGGGGGC CGAGTTCGCG
ATGCGGTTTC CAAGGGAGGC GGTGGGGGTG CCTTAG
 
Protein sequence
MASPLAPLHQ GRRERMRYAD SVGGCMRATS KTAEGLHAMN DDDWLELIPD EAAEAPTRAG 
TWTIAVVDDD PAVHEGTRYA LAGYSLDGRG LDILSAYSAQ EARALLAERR DIAIVLLDVV
METDDAGLRL VDYIRRELKE ETVRIILRTG QPGQAPERRV IVDYDINDYK AKTELTADKL
FTSLTAALRA YQQLRRLDET RRGLEIIIDA APMLLDHKSM QRLAEGVLTQ VASLLNVDCA
GILVLRESED AADRREGGFC VLAGSGLYGG YVGRDPGWPL DPGIQPLVEQ AFAARCHNFG
DNLSTLYVQT ASGSEIVALI DTDRPLSDTD RALITLLAGR LSVAFDNVIL YERLQRANVT
LEQRVVERTA ELIRANRRLD MQRSDLRRAN SLKTEILGTI AHDLKNPLSV ILGRSEMLAD
LIGLDQGDAS GAEKAQAAML TQVEHIRASA TRLIDMIDSL MADAMNDALD ITLRREPVDL
AGLAREVCEA NRPLAESKGQ SLVMDLAGPL TLCGDAERLR EALDNLVSNA IKYSYPGGAI
AVSVREEGGD LVCAVADQGP GLSPEDAGRL FGRYQRLSAK PTGGEGSTGL GLSIVKRIAE
LHGGRAEAFS DGPGQGAEFA MRFPREAVGV P