Gene Mext_4542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4542 
Symbol 
ID5832113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5073567 
End bp5076857 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content68% 
IMG OID641370336 
ProductPAS sensor protein 
Protein accessionYP_001641981 
Protein GI163853938 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0899725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC GTATTCGTGC CTACGATTGG GCACGGACAC CTCTTGGCGC CATTGGTGAT 
TGGCCGCAGA GTTTGAGGAC GATGGTCGAG TTGATGCTCG GCTCTCCTTT GCCGGCCGCG
ATCGCCTGGG GTCCTGAACT TACGGTAATC TATAACGACG GTTTCGACGC GATCGACAAC
TTCAGCGATC CGCCGCCCCT TGGCCGCCCG TTCACGAAAG TCTGGCCAAA GCCGGAAAGC
GACGGGATCG TCGACTCGAT ACGACAGGGG CAGGCCCGTC AGATCATTGA TCGCTACTGG
GATCTTCCCG CGCGGCCGGA GCGACCCTTC GGCTGGTTCA CGTCCCAGTG GACGCCGTTG
CGGGACGAAG CGGGAGGCTT CGCCGGGTTC TATCTCGCCG CCTTCGAGAC GACCGATCGC
GTCCTCGTCG AGCGGGCCTT GCTCGAGCGC GAGGAGCAGC AGGCCTTCGT GCTCGGCCTC
AGCGACGCGC TGCGCCCCCT GGCCGATCCG CTCGAAGTCC AGGCCGTGGC CTGCCGCTTG
CTCGGCGAGC ACCTTCAGGC GGACTACACG TACTACCTCA ACCTGTACGA GGCGGAGGGT
TTCGCGATCA TCGCACAGGA TTTCGCGCGC GCGGGTCTCC CCTCGCGAGC CGGGAAATAT
CCCCTCAGCC TCGTCGGTTG GGCGATGCCC CATCAGGGCC AGGGCCAGCC GATCGCGATC
ACGGACGTGC AGACATCCCC CCTGGTCCCG GACGACTGCC GGGAGAGGGT GCTGGCCGAC
CGTGTCGTGA GCCTGCTCGG CATTCCGCTG GCGAAGCAGG GCCGCCTCGT CGGCGCACTT
GCGCTCTCCA TGACGACGCC GCGAGAATGG ACCCCTGCGG AGATCGCGCT GGCCACCGAG
ATCGGCGAGC GCACCTGGGC GGCGATGGAG CGGGCCCGCG CGGAGAAGAC GGCGCGCGAG
GCCGACATTC GCCTGCGCAC CCTGGCCGAT GCCGCGCCCG TCCTGATCTG GGACGCGGAT
TCGAGCGGCA CGATCCTCGT CAACGACCAC TATCTCGACT TCTTCGGTGT CGGCTTCGAG
GCCGTGGCGG GGTATGGCTG GCAGAAGTTC CTGCATCCCG AGGATGCCGA GCGACACCTT
TCTCTCTTCC AAGAGGCGTT CGCGCAGCGC CATTGCTTCA CCGACGAGGC CCGGCTCCGC
CGTGCCGACG GGCAGTACCG CTGGCTCAGC ACGTCCGGCC GGCCGCTGGA GGACGGGCGC
TTCGTCGGCG TCTCGATCGA CGTCACCGAG CGGCGCCTCG CCGAGCAGCG CGTCAGGCGC
AACAATGCCG TCCTCCAAGC GATCAACCTC GTCTTCAGCG AGACGCTCGG CGCCTCCTCG
GAGGAGGAAT TGGCGCGCAT CTCCCTCGAA GTCGCCGAAG AGCTGACCGG CAGCGCCATC
GGCTTCATCG GTGAGATCAA TGAGACCACC GGGCGCCTCG ACGGCCTGTT CTTCAGCGAC
CGGAGCCGGG CCCGCTACGC CGCGCACGCA TCGGAGGGGG ATGACAGCTT CCCGATGGGC
AAGTCGGCGC TGGGGCTTGC GATTCACGGC ATCTACGGGC GGGTGTTGCA GGATGGCGCG
GGCGTCATCG TCAACGATCC CGCCTCCCAC CCCGACCGCA TCGGCACGCC GGCCGGCCAC
CTGCCGCTGA CGGCCTTCCT CGGCGTGCCG CTGAAGCAGG GTGACAGGAC GCGGGGTCTG
ATCGGGCTCG GCAACCGCCC GGGCGGCTAC CGGCCGGAGG ACCTTGAGGC GGCGGAGGCG
CTGGCGCCCG CGATCTGGCA CGCCTTGCGC AGCAAGCGGG CGGAGCTGCG CCTGCGCGAG
AGCGAGGAGC GTTTCCGGCA ATTCGCCGAG GCTTCCTCCG ACGTCCTCTG GATCCGCGAT
GCGGAGACGT TCGAGATGGA GTTCGTCAGC CCGGCGCTGC GGACGGTCTA CGGCATCGAG
CCCAACGTGC TCGGCCCCGA GATCCGGCGG TGGGCCGGCC ACATGCTGCC GGAGGATCGC
GAGAACGCCC TGCAGAATCT GCAGCGGGCG GCGACCGGCC AATCGCTGCT GAACGAGTTC
CGCATCAAGC GGCCGAGCGA CGGCGCCTTC CGCTGGATCC GCAGCACGCT GTTTCCCCTG
CGCGACGAGC AGGGCCGGGT GCGGCGCATC GGCGGTCTGT CCTCCGACAT GACCGAGGCC
AAGCTGCTGA TCGGGCATCA GGCCGTGCTG CTCGCCGAAT TGCAGCACAG GGTGCGCAAC
ATCATGGCGG TGACCCGCTC GATCGTCGCC CGCACGGGCG AGCGGGCGGA GACCGTGTCC
GATTACGCGT CCCTGGTGGG AGGGCGCCTC CTGACGCTGG CCCGCGTCCA GGCCCTGCTG
ACCCGCTCGC CCAATGCCGG CGTGCCGGTC GCCACCATCG TCCGCGACGA GATCGACGCC
CAGGACCTGC GCGCGGACCA ATACGATCTG TCCGGACCCA AAATCGAACT CTCGCCCAAG
GCGGCGGAGA TCCTGACGCT CGCCGTTCAC GAACTGGCGA CCAATGCCCT GAAATATGGA
GCCTTGTCGG TGCCGGATGG GCGCGTGCGG GTGAATTGGT CCTCGTTCGA GAAGAGAGGC
GAGCCCTGGC TCGGCTTCGA TTGGGCAGAG GATGGTGTGC CGGAAGCGAA GATTGCGGGC
GAGCGTGAGG CGAATCGTCG CACCGGCGCG CGCCGTGGTT TCGGCCGCGA ACTGATCGAG
GTGCGCCTGC CTTACGAACT CGGGGGGCGC GGTCGCCTGG AGATCGGTCC GGAGGGGGCG
CGGTGCCGCC TCGAATTTCC GCTCCGTGAC GGCGCCAGCA TTCTTGAAAC CGATGCCCCG
CAACGAGCGA CCGTGTTCGG AGGAGCGCTC GACATGACCG GCGAACCAGA TTTGAGCGGC
TATCGTATCC TCGTGGTGGA GGACGATTAC TATCTCGCCA CCGATACGGC ACGCGCGCTC
CAGGGAGCGG GCGCGGAGGT GGTTGGCCCC TGCTCCAGCG AGGAGGCGGC GCGCGAGGCG
CTCGATGAAG GGGCGCTGGC AGCGGCGCTG GTGGATATCA ATCTCGGATC AGGGCCGTCC
TTCACCCTGG CGGCCCTGCT TCGGGAGCGC GGCGTGCCGT TCGTGTTCAT CACCGGTTAC
GACGAGGGGG TGATTCCGCC GGATTTCGCC GATGTCGAGC GTCTCCAGAA GCCCGTCGAG
CTGAAACGCG TCGTCAATTT CCTCGCCGAC ACGCTGCAGG CGGCGCAGTG A
 
Protein sequence
MSARIRAYDW ARTPLGAIGD WPQSLRTMVE LMLGSPLPAA IAWGPELTVI YNDGFDAIDN 
FSDPPPLGRP FTKVWPKPES DGIVDSIRQG QARQIIDRYW DLPARPERPF GWFTSQWTPL
RDEAGGFAGF YLAAFETTDR VLVERALLER EEQQAFVLGL SDALRPLADP LEVQAVACRL
LGEHLQADYT YYLNLYEAEG FAIIAQDFAR AGLPSRAGKY PLSLVGWAMP HQGQGQPIAI
TDVQTSPLVP DDCRERVLAD RVVSLLGIPL AKQGRLVGAL ALSMTTPREW TPAEIALATE
IGERTWAAME RARAEKTARE ADIRLRTLAD AAPVLIWDAD SSGTILVNDH YLDFFGVGFE
AVAGYGWQKF LHPEDAERHL SLFQEAFAQR HCFTDEARLR RADGQYRWLS TSGRPLEDGR
FVGVSIDVTE RRLAEQRVRR NNAVLQAINL VFSETLGASS EEELARISLE VAEELTGSAI
GFIGEINETT GRLDGLFFSD RSRARYAAHA SEGDDSFPMG KSALGLAIHG IYGRVLQDGA
GVIVNDPASH PDRIGTPAGH LPLTAFLGVP LKQGDRTRGL IGLGNRPGGY RPEDLEAAEA
LAPAIWHALR SKRAELRLRE SEERFRQFAE ASSDVLWIRD AETFEMEFVS PALRTVYGIE
PNVLGPEIRR WAGHMLPEDR ENALQNLQRA ATGQSLLNEF RIKRPSDGAF RWIRSTLFPL
RDEQGRVRRI GGLSSDMTEA KLLIGHQAVL LAELQHRVRN IMAVTRSIVA RTGERAETVS
DYASLVGGRL LTLARVQALL TRSPNAGVPV ATIVRDEIDA QDLRADQYDL SGPKIELSPK
AAEILTLAVH ELATNALKYG ALSVPDGRVR VNWSSFEKRG EPWLGFDWAE DGVPEAKIAG
EREANRRTGA RRGFGRELIE VRLPYELGGR GRLEIGPEGA RCRLEFPLRD GASILETDAP
QRATVFGGAL DMTGEPDLSG YRILVVEDDY YLATDTARAL QGAGAEVVGP CSSEEAAREA
LDEGALAAAL VDINLGSGPS FTLAALLRER GVPFVFITGY DEGVIPPDFA DVERLQKPVE
LKRVVNFLAD TLQAAQ