Gene Mext_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4259 
Symbol 
ID5833933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4736842 
End bp4740102 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content70% 
IMG OID641370050 
ProductPAS sensor protein 
Protein accessionYP_001641699 
Protein GI163853656 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.127746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACG GGCCCTTCTT CCTCGACGGC GGCGGCGAGG CCGGCGCGCT GATCCGAGGG 
ATCGACTGGG CGGCGACGCC GCTCGGGAGC CCGGAGGCGT GGCCGGCGGC GCTCAAGACT
CTGGTCGGCG TGATGCTCGG CTCGCAACAG CCCATGCTGA TCGTTTGGGG CGAGGCGCAC
GTCACCCTCT ACAACGACGG CTACGCACCG ATGTGCGGCG CCCGCCACCC GCGCGCGCTG
GGGCAGGCGT TCGACGAGGT GTGGCACGAC ATCTGGGACC AGGTGGGGCC GATCCTGTCG
CGGGCCTATG CGGGCGAGGG CACGCATATG GATGACATCG CCCTCACGAT GCATCGCAAC
GGCTACCCGG AGGAGACCCA CTTCGCCTTC GGTTATACGC CGGTGCGGAT CGAGGACGGC
ACGGTCGCGG GCATGTTCTG CGCCTGCTCG GAGACCACCG CCTCGGTGCG GGCCGGCCGG
CAGATGCATG CCGAGCGCGA GCGCTTCGCG CGGCTGTTCG AGCAATCGCC GAGCTTCGTC
GCGGTGCTCG ACGGGCCCGA CCACGCCTTC GCCTTCGCCA ACGCCGCCTA TCGCCAACTC
ATCGCCCATC GCGACGTTCT CGGTCGGCCC GTGCGGGAGG CTCTGCCGGA AATCGAGGGC
CAGGGCTTCT TCGAGCTGCT CGACGACGTG TTCTCCACCG GCCGCACTCA CACGGCGCAC
GGCGCACCGG TGACGATCCT GCGCGTGCCC GGCGGCATGC CGGAGCGGCG CTTCCTCGAT
TTCGTCTACC AGCCCATGCG GGACGCGGCC GGGACGATCA CCGGCGTGTT CGTCGACGGC
TCGGACGTGA CCGAGCGGAT CACCGGCAAT GCGGCGCTCG CCGAGAGCGA GGCGCGCTTC
CGCACCATGG CGGACGACGC GCCGGTGATG ATGTGGGTGA CCGGCCCGGA CGGCGCCTGC
CAGTATCTCA ACCGGCGCTG GTACGACTTC ACCGGCCAGA ACGAGGCGCA GGCCCTCGGC
CTCGGCTGGC TCGAGGCGGT GCATCCGGAC GATCGCGGCT GGTCGGGCGA CACTTTCCTG
CGGGCGAATG CCCGGCACGA GGGCTTCAGC CTCGAATACC GCCTGCGCCG CCGCGACGGC
GTCTACCGGT GGGCGATCGA CACCGCGAGC CCGCGCTTTG CCGCCGACGG GAGCTTCCTC
GGCTATATCG GCTCGGTGGT CGATATCGAG GAGCGTCGCG CTGCCGAACT CGCGCTCGCC
GAGAGTGAGG AGCGCCTGCG GCTCGCCGTC GAGAGCGGCG AGATCGGCCT GTGGGACTTC
GATCCGCGGG CCGGCACCCT GTTCTGGCCG CCGCGGATCA AGGCGATGTT CGGGTTGGCC
CCGGGCGCGG ACGTGACCCT CGACGACTTC GCCGACGGCC TCCACCCGGA CGACCGGGCG
CGGGTCACCG CCGCCTTCGC CGCCGCCCTC GATCCCGTGG CCCGGGCCTT CTACGACGAG
GAGTTCCGCG CGATCGGCCG CGCCGACGGC ACGGTCCGCT GGGTCGCGGC CAAGGGGCGC
GGCGTGTTCG ACGCGGAGGG CCGCTGCCTG CGCGGCGTCG GCAGCGCCAT CGACATCACC
GCACGGAAGG CGACCGAGGA GCGTCTCGTC GAGGCCACCC GCCGCCTCGA CGCGGTGCTC
GACAACGCGA CGCAGGCAAT CTTCATGATG GATGAGCGCC AGCACTGCGC CTACATGAAC
CGCGCTGCCG AGCGGCTGAC CGGCTACAGC CTGGACGAGA CGCACGGCAA GGCGCTGCAC
GACGTCGTCC ACCATACCCG GCCGGACGGC AGCCCCTACC CGCTCCACGA ATGCCCGATC
GATCAGGCCT TCCCCGAGAA CAACCAGGAG CAGGGCCAGG AGATCTTCGT CCACCGCGAC
GGCTCGTTCT ACCCCGTCGC CTTCACCGCC AGCCCGATCC GCGACGAGGG CGGCACGCCG
ATCGGCACGG TGATCGAGGC GCGCAACATC GAGGGCGAGC TGCGTGCCAA GGCGCAGCTG
GAGGCCTTCA ACGCGAGCCT GGAGCAGCAG GTCGCGGCTC GAACGGCCGA GTTGATGCGG
ACCGAGGAGG CCCTGCGCCA GAGCCAGAAG ATGGAGGCGG TGGGCCAGCT CACCGGTGGG
CTCGCCCATG ACTTCAACAA CCTGCTCACC GGCATCACCG GCTCGCTCGA ACTGCTGCAG
ACCCGCCTCG CGCAGGGGCG GATCACCGAG ATCGACCGCT ACGTCAACGC CGCGCAGGGC
GCCGCCAAGC GCGCGGCGGC GCTGACCCAC CGCCTTCTCG CCTTCTCGCG CCGCCAGACC
CTGGACCCGA AGCCCACCGA CGTGAACCGG CTGGTGATGG GCATGGAGGA TCTGCTCCGC
CGCACCATCG GCCCGTCGAT CACCCTCGAA GTGGTAGCCG CGGGCGGGCT GTGGTCGGTG
CTGGTCGATC CGAGCCAGCT CGAGAACGCG CTGCTGAACC TCTGCATCAA CGCCCGTGAC
GCGATGCCGG ATGGCGGGCG CATCACCATC GAGACCGCCA ACAAGTGGCT CGACGACCGC
GGCGCCAAGG AGCGCGACCT CGATCCCGGC CAGTACCTCT CGCTCTGCGT GACCGATTCC
GGCACCGGCA TGAGCCCGGA TGTCATCGCC AAGGCGTTCG ACCCGTTCTT CACGACGAAG
CCGATCGGCC AGGGCACGGG CCTCGGCCTG TCGATGATCT ACGGCTTCGT GCGCCAATCG
GGCGGGCAGG TGCGGATCTA CTCGGAGGTC GGCCAGGGCA CGACGATGTG CCTCTACCTG
CCGCGCCACT ACGGTGCAGC GGAAGAGCCG GAGGCAGCCC TGGATCTGGC CGCCGCGCCC
CGCGCCGAGC AGGGCGAGAC AGTGCTGATC GTCGATGACG AGCCGACGGT GCGGATGCTG
GTGACGGAGG TGCTGGAGGA TCTCGGCTAC ACCGCGATCG AGGCCGCTGA CGGCCCGGCC
GGCCTCAAGG TGCTGCAGTC GGACGTGCGC CTCGACCTTC TGGTCACCGA TGTCGGGCTT
CCCGGCGGCA TGAACGGGCG CCAGGTCGCC GATGCAGGCC GTGTCCTGCG GCCGGACCTG
AAGGTGCTGT TCATCACCGG CTACGCCGAG AACGCGGCGG TGGGCAACGG CCATTTGGAG
CCCGGCATGC AGGTCATCAC CAAGCCCTTC GTGATGGAAG TGCTGGCCGC GCGCATCAAG
GAGATGATCA ACTCGCGGTG A
 
Protein sequence
MSDGPFFLDG GGEAGALIRG IDWAATPLGS PEAWPAALKT LVGVMLGSQQ PMLIVWGEAH 
VTLYNDGYAP MCGARHPRAL GQAFDEVWHD IWDQVGPILS RAYAGEGTHM DDIALTMHRN
GYPEETHFAF GYTPVRIEDG TVAGMFCACS ETTASVRAGR QMHAERERFA RLFEQSPSFV
AVLDGPDHAF AFANAAYRQL IAHRDVLGRP VREALPEIEG QGFFELLDDV FSTGRTHTAH
GAPVTILRVP GGMPERRFLD FVYQPMRDAA GTITGVFVDG SDVTERITGN AALAESEARF
RTMADDAPVM MWVTGPDGAC QYLNRRWYDF TGQNEAQALG LGWLEAVHPD DRGWSGDTFL
RANARHEGFS LEYRLRRRDG VYRWAIDTAS PRFAADGSFL GYIGSVVDIE ERRAAELALA
ESEERLRLAV ESGEIGLWDF DPRAGTLFWP PRIKAMFGLA PGADVTLDDF ADGLHPDDRA
RVTAAFAAAL DPVARAFYDE EFRAIGRADG TVRWVAAKGR GVFDAEGRCL RGVGSAIDIT
ARKATEERLV EATRRLDAVL DNATQAIFMM DERQHCAYMN RAAERLTGYS LDETHGKALH
DVVHHTRPDG SPYPLHECPI DQAFPENNQE QGQEIFVHRD GSFYPVAFTA SPIRDEGGTP
IGTVIEARNI EGELRAKAQL EAFNASLEQQ VAARTAELMR TEEALRQSQK MEAVGQLTGG
LAHDFNNLLT GITGSLELLQ TRLAQGRITE IDRYVNAAQG AAKRAAALTH RLLAFSRRQT
LDPKPTDVNR LVMGMEDLLR RTIGPSITLE VVAAGGLWSV LVDPSQLENA LLNLCINARD
AMPDGGRITI ETANKWLDDR GAKERDLDPG QYLSLCVTDS GTGMSPDVIA KAFDPFFTTK
PIGQGTGLGL SMIYGFVRQS GGQVRIYSEV GQGTTMCLYL PRHYGAAEEP EAALDLAAAP
RAEQGETVLI VDDEPTVRML VTEVLEDLGY TAIEAADGPA GLKVLQSDVR LDLLVTDVGL
PGGMNGRQVA DAGRVLRPDL KVLFITGYAE NAAVGNGHLE PGMQVITKPF VMEVLAARIK
EMINSR