Gene Mext_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0802 
Symbol 
ID5831478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp859673 
End bp862870 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content67% 
IMG OID641366578 
ProductPAS sensor protein 
Protein accessionYP_001638278 
Protein GI163850235 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA GTGCCGGCTC CCCGAACCTC AACTTCCTCT CGGGCGGGGG ACGGATGGGC 
GAACGTATGC GCGCCCATGA CTGGTCGAAT TCAACGCTCG GGCGGCCGGA GACCTGGCCG
CAGTGCCTGC GCTCGGCCGT GAGCCTGATG CTCGGCTCCA AGTTCCCGAT GTTCGTGGCT
TGGGGGCCGG AGCTCGGATT CCTTTATAAC GAGCCTTACG CGGAAATCCT CGGCGACAAG
CATCCGGCCG CCCTCGGAAG CCGCTTCCAC GACATCTGGG CCGAGATCTG GGACGACATC
GCGCCGCTGA TCGACCGCGC CATGGCGGGC GAGGCGACAT GGGCCGAGAA CTTGCCGCTC
GTCATGAACC GACACGGCTA CGACGAGCGG ACCTGGTTCA CCTTCTCCTA CTCGCCCGTT
CGGGACGACG AAGGCGAGGT CGGCGGCATG TTCTGCGCCT GCACGGAAAC GACCGCCAAG
GTGAAAGCCG AGGAGGCCCT GCGCGCGAGC GAGGCCCGAG CCTCCGGCGT GCTGGAGGGC
ATGGGCGAGG GCTTCATGCT GCTCGACCGT GATTTCCGCA TCCTACAGAT GAACGCGGAA
GGCTTCCGGT TGGAGGAGCG GCCGCCCGGC GAGGTGCTCG GGAGCTCGCA TTGGGAGATC
TATCCGGGCT CGGAGCAGAT GCCCATCGGA CAGATGTATC TGCGAGCCAT GCGCGAGCGG
ATGCCCTTGA CGCTGGAGCA CCGTCACGTC TGGCCCGATG GGCACGCGGC TTGGCTGGAG
GTGCGAGCCT ACCCGCATCC GGAAGGGTTG GCGTTGTTCT ACCGCGATGT GACGGAACGC
CGGGAGCGCG AGAAGGCGCT GCGCGAGGCC GAGGCGCGAC TGCGCGCTCT GGCGGACAAC
CTGCCTGGCG GCATGGTCTA CCAAATCGCG ATGGACAGGG ATGGTTCGAA CCGGCGCTTC
CTCTACGTGT CGCGGGGCTT CGAGCGCATG ACCGGCATGT CGGCCGAGGC CGCGCTGCTC
GACCCGGCCG CGGCCTACGA CCTGATCCTG CCCGAGTACC GCGAGGGCTT GGCCGCAGCC
GAGCACGTCG CGATCCGGGA CCTCGCGCCG TTCGACTTCG AGACGCCCTT CCGTCGTAGC
GACGGGGAGG TGCGCTGGAG CCGCATCATC TCGGCGCCTC GGCGGCAGCC CGACGGTTCG
CTCGTCTGGG ACGGCATCCA GTTCGATGAT ACCGACCGCA AGCAGGTCGA GGAGCGCCTT
CGCGAGAGCG AGGCCAAGTT CCAGGCCATC GCCAACTCCA TCGACCAGAT GGTGTGGTCG
ACGGGTCCCG ACGGCCACCA CGATTACTTC AACCAACGCT GGTACGACTT CACCGGCGTG
CCGGCAGGCT CGACGGACGG CAAGGACTGG GAGGACATCG TCCACCCGGG TGACCGGGCG
CGCACGTGGA CGTCCTGGAG TGCGTGTCTT GCAACGGGCG AGCCCTACCG GATCGAGTAC
CGGCTGCGCC ACCGCACAGG GCAGTACCGC TGGACGCTCG GACGAGCCCT GCCAATGCGC
GACGAGGCGG GCCGGATCAC GCGCTGGTTC GGCACCTGCA CCGACATCCA GGACATCGTC
GAGGCGCGCG AGGTCCTGGC ACGCTCGCGC GAGGACCTTG AACGGTTGGT AGCGGAGCGT
ACGGCCGACC GTGACCGCAT GTGGCGGCTC TCGACGGACG TCATGCTGGT CGCGCGCTAC
GACGCCACCA TCGAGGCGGT GAACCCGGCC TGGACCACGC TGCTCGGGTG GGATGAGCGT
GAGCTCATCG GTAGCGCCTT CATGGACCTC GTCCATCCGG ACGACGTCGC CGCCACGCTT
GCGGAGGTCG GGAAGCTGTC GGGGGGGGTG ACCACCCTAC GGTTCGAGAA CCGCTATCGC
CAGAAGGACG GCAGCTACCG CTGGCTCTCG TGGACGGCCG TTCCCGCCGA GGATCTCATC
CACGCGGTCG GGCGTGACAT CACTGCCGAA AAGGAGGCCG CACAAGCCCT CGCTGAAACC
GAGGAGGCGC TGCGCCAAGC CCAGAAGATG GAAGCCGTGG GCCAACTGAC GGGCGGCATC
GCACACGATT TCAACAACCT CCTGACCGGC ATCGTCGGCT CCCTCGACAT GATGCAGACC
CGCGTCGCGC AGGGTCGAAC CGACACCATC GAGAAGTATG CCAAGGCGGC GATGTCCTCG
GCCAATCGCG CGGCCGCCCT GACGCACCGC CTGCTGGCCT TCGCTCGCCG GCAGCCCCTC
GACCCGAAGC CGGTCAACGC CAACACGCTC GTGACCTCGC TGGAGGATCT GCTGCGCCGG
ACAATCGGCG AGGCCGTCAG CTTGGAGATC GTCACCGCAG GCGGCCTGTG GCCGACGCTT
TGCGACCCGC ACCAGCTCGA AAGCGCTGTC CTGAACCTGG CCATCAACGC GCGCGACGCC
ATGCCAGACG GCGGCAAGCT CACCATCGAG ACCTGCAACA CGCACCTCGA CCGGGCCTAT
GCCAAGCTGC ACCCGGGGGT GGAGCCCGGC CAGTACATTT GCATCTGCGT GACCGATACC
GGCACCGGCA TGCCGCCGGA CGTCGTCGCG CGGGCCTTCG ACCCGTTCTT CACGACGAAA
CCCATCGGCC AGGGAACGGG GCTCGGCCTG TCGATGATCT ACGGCTTCGC GCGCCAGTCG
GAGGGGCATG CCAAGATCTA CTCGGAGGTC GGGCAGGGCA CGTCAGTGAA GATCTACCTG
CCGCGCCATC GCGGAGCTTT GACCGGGGAG GACACGCAGG CATCGGGCCT GACGCAAGCG
CACCGGGCCG AGACCGGCGA GACGGTGCTG GTCGTGGAGG ACGAGCCGGT GGTGCGCGAC
CTCATCGTCG AGGTGCTGCA CGATCTCGGC TACCGGGCCT TGGAGGCGCA GGACGGACCC
TCCGGCCTCG CCGTTCTGCA GTCGCACGAG CGCATCGACC TCTTGGTCAC CGACGTCGGC
CTACCGGGCT TGAACGGGCG CCAGCTCGCC GACCAGGCCC GCGAAGGGCG GCCCGAGCTC
AAGGTGCTGT TCATCACCGG CTATGCCGAG AACGCGATGT TCGGGAACGG CCACCTCGAA
CCAGGCATGC AGATGATGAC CAAGCCCTTC CCGGTTGAAG CCCTGGCGAC GCGCATCCGC
GAGATAATCC AGAGTTGA
 
Protein sequence
MTESAGSPNL NFLSGGGRMG ERMRAHDWSN STLGRPETWP QCLRSAVSLM LGSKFPMFVA 
WGPELGFLYN EPYAEILGDK HPAALGSRFH DIWAEIWDDI APLIDRAMAG EATWAENLPL
VMNRHGYDER TWFTFSYSPV RDDEGEVGGM FCACTETTAK VKAEEALRAS EARASGVLEG
MGEGFMLLDR DFRILQMNAE GFRLEERPPG EVLGSSHWEI YPGSEQMPIG QMYLRAMRER
MPLTLEHRHV WPDGHAAWLE VRAYPHPEGL ALFYRDVTER REREKALREA EARLRALADN
LPGGMVYQIA MDRDGSNRRF LYVSRGFERM TGMSAEAALL DPAAAYDLIL PEYREGLAAA
EHVAIRDLAP FDFETPFRRS DGEVRWSRII SAPRRQPDGS LVWDGIQFDD TDRKQVEERL
RESEAKFQAI ANSIDQMVWS TGPDGHHDYF NQRWYDFTGV PAGSTDGKDW EDIVHPGDRA
RTWTSWSACL ATGEPYRIEY RLRHRTGQYR WTLGRALPMR DEAGRITRWF GTCTDIQDIV
EAREVLARSR EDLERLVAER TADRDRMWRL STDVMLVARY DATIEAVNPA WTTLLGWDER
ELIGSAFMDL VHPDDVAATL AEVGKLSGGV TTLRFENRYR QKDGSYRWLS WTAVPAEDLI
HAVGRDITAE KEAAQALAET EEALRQAQKM EAVGQLTGGI AHDFNNLLTG IVGSLDMMQT
RVAQGRTDTI EKYAKAAMSS ANRAAALTHR LLAFARRQPL DPKPVNANTL VTSLEDLLRR
TIGEAVSLEI VTAGGLWPTL CDPHQLESAV LNLAINARDA MPDGGKLTIE TCNTHLDRAY
AKLHPGVEPG QYICICVTDT GTGMPPDVVA RAFDPFFTTK PIGQGTGLGL SMIYGFARQS
EGHAKIYSEV GQGTSVKIYL PRHRGALTGE DTQASGLTQA HRAETGETVL VVEDEPVVRD
LIVEVLHDLG YRALEAQDGP SGLAVLQSHE RIDLLVTDVG LPGLNGRQLA DQAREGRPEL
KVLFITGYAE NAMFGNGHLE PGMQMMTKPF PVEALATRIR EIIQS