Gene Mext_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1190 
Symbol 
ID5832821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1310553 
End bp1313678 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content71% 
IMG OID641366983 
ProductPAS sensor protein 
Protein accessionYP_001638663 
Protein GI163850620 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0314947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCGC CCGGTGAGGC GCTGCCTTCC CGTCGAAACG AGGTCCGGCC TCACTTTCGC 
CTCCGCCGGA CCTCCATGGT GGAACACTGC GTCGAGCCCC AGCCTGTCCG AACGATGAAT
CCCGCCCCGA TGAGCGAGCA AGAGGCTGCG CGCCTTCGTG CCTTGGACCG CTATCGGTTG
CTCGACACCC CGCGCGAGCA GGATTTCGAC GAGATCGCCG AGGCCGCCGC CGAGCTGTGC
GAGGCGCCGA TCGCGGTGGT CAATCTCGTC GGCGACGGGC GGCAGTTCTT CAAGGCGGAG
GTCGGCCTCG GTGTGCGCGA GACGCCGCTC GAAACCTCCT TCTGTCGGCA GGCCATCCTG
CACGACGACT TCCTCTACGT GCCCGACACC GCGCGCGATC CGCGCTTCGA AGGCAACCCG
CTCGTCAGCG GCGATCCCGG CCTGCGCTTC TACGCCGGCG CCCTGCTGAG GACCGACGAG
GGGCAGCCGA TCGGGACCGT CTGCGTCCTC GATACCCGCC CGCGCGAACT CTCGGAGCGG
CAGCGCCGCG GCCTGATGCG GCTCGCCCGC CAGGCCATGA CGCAGATGGA ACTGCGCCGC
TCGCTGCGTG AGCAGGCGGA GCAGCGCCTG CTGCACGAGC GCATCCTCGA CAGCGCCACC
GACTACGCGA TCGTGGCCAT GGACCCCCAG GGCCGCGTCA CGCGCTGGAA CACCGGCGCC
GAGCGCATCC TCGGCTGGAC CGAGGCCGAG ATGCTGGGCC GGACGGTCGA TGCGTTCTTC
ACGCCGGAGG ATCGGGCGGG CGACCGGCCC GATGTCGAGA AGCGCCTGGC GGCGCAGACC
GGCAGCGCCC CGGACGAGCG CTGGCACATG CGCAAGGACG GAACCCGCTT CTGGGCCTCG
GGCGAGATGA TGCCGCTGAC GGCCGAGGAC GGCGGGCTCA TCGGCTTCCT CAAGATCCTG
CGCGACCGCA CCGGGCAACG CGCCTCGGAG GCGGCTTTGC AGGCGAGCGA GTTGCGCTAC
CGCTCTCTTG TCGAGGTCAG CCCGCAGGTC GTCTGGTTCG GCGACGAGGC CGGCCGTGTC
ACCTACTGCA ACACCTATTG GTACGACTAT ACGGGGCTGC CCCCCGGCGA GACCGGCGAG
GCGAGCTGGA TGGGTGTGAT CCATCCCGAT CACCGCGAGC GCGTTCGCGA TGCGTGGCTT
GCCGCGGCGC GAAGCCGGGG GGGCTACGAG GTCGAGTTCC CCCTTCGCCG CGCCGACGGG
CAGTACCGCT GGTTCCTGTC GCGGGCGCGG CCCGTGCGCG ACGAGGCCGG GCATCTCAGG
AGCTGGATCG GCACCACCCT CGACATCCAC GAGCGCAAGG TGGCCGAGGA GCGCTTCGCG
GCGCTCACCG AACTGGCACC GGCCATCATC TGGTTCGGGA ATCCGGACGG CAGCCTCAGC
TACCTCAACG ACCGCTGGTA CGCCTATACC GGCCAGACCC CCGAGCAGGC ACTGCCGCTC
GGCTGGGGTG AGGCGATCCA CCCGGACGAC GTGGACGGCC TTCTCAAGGT CTGGGAGGCG
GCCCGCACCC ACGAGACCGT CTACGACACC GAGGCACGCC TGCGGGCGCG CGACGGAACC
TACCGCTGGT TCCTGATCCG TGCGGAGCCG CGTCGGGACG CGAGCGGCGC GGTGGTCGGC
TGGCTCGGCA GCAACAGCGA CATCCACGAC CGTCGGCAGG CGGACGAGGA TCTGCGCCGG
GCGCGGGAGC AGTTGCACCT CGCCGTCGAG GCGACCGGAA CCGGCATCTT CGACTACGAC
CTCGTCACGG ACACGCTGGA ATGGGACGCG CGCACCCGCG CGCTGTTCGG CCTGGGACCG
GAGGCGCCGG TCAGCTACGA CGTGTTCCTG GCCGGGCTGC ATCCGGAGGA CCGGTCCTGG
GTCGATCGGG CGGTCGAGGC CGCGCTCGAT CCGGCCGGCA GCGGCACCTA CGACATTGCC
TACCGGACCA TCGGCCTGGA GGATGGTATC GAGCGCTGGG TCGCCGCCAA GGGACAGGCC
TTCGTTGCCA GCGGCCGCAC CGTGCGCTTC ATCGGCACCG TGCGCGACGT CACGCAGAGC
CGGCGGGCCG AGCAGACCCT GCGCGAGACC GAGGAGCGTT ACCGCCTCGC GGCGCGTGCC
ACCAACGACG CGATCTGGGA CTGGAACCTC GCGACCAACC AAGTCCTCTG GAACGAGGCG
CTCACGGTCG CCTACGGCTA TCCGCCGGAG GCGGTCGATC CGACCGGCGA TTGGTGGATC
ACCCATATCC ATCCCGACGA CCGGGCGCGG ATCGACACCT CCATCCACGC GGTCATCGAC
GGGACCGGCA CCGCCTGGAG CGACGAGTAT CGTTTCCTGC GCGCGAACGG CACCTATGCC
GACATCCTCG ACCGAGGCTA CGTCATCCGT GACGGGCACG GGGCGGCGGT GCGGATGATT
GGGGCGATGC TCGACATCAG CGAGCGCAAG CGGGCCGAGG AGCACCAGCG CCTGCTCACC
GGCGAGTTGC AGCATCGGGT CAAGAACACG CTCACCCTCG TTCAGGCGAT CGCCAGCCAG
ACCCTCCGCA ACGCCCCGGA TCTCGATGCG GCCCGCGAGG CTTTCGCCGC GCGCCTGATC
TCGCTCGGCC GCGCGCACGA CATCCTGACC CGGTCGAGCT GGACCGAGGC GCCCATCGCG
GAAGTCGTGG AGGGGGCTCT GGCGGTCCAT CGCGGCGCTG CCATGGCGCG CATCCGCGCG
AGCGGGCCGA GCGTGCTGCT CGGCGCCAAG GCGGCGCTCT CGCTCGCGCT CGCCCTGCAC
GAGCTTGCCA CCAACGCGAC CAAGTACGGC GCCCTCGCCA ACGAGGTGGG ATGCGTCGAA
CTGCGCTGGC ACGTGGTGCA TGAGGACGAG GCACCCCGCT TCTGCCTGAC ATGGTCCGAG
CAGGGCGGTC CGCCCATCCT GAGCCAGCCC TCGCGCCGCG GCTTCGGCTC GCGCCTGATC
GAGCGCAGCT TCGCTGCCGA GGTCGGCGGA GAGGTCAAGC TTACCTACGC GCCGACCGGC
CTCGTCTGCC GCCTGGAAGC CCCCCTCGCA TCGATGCAGG AGCCGCGCGA CGAGGTCGCC
GCCTGA
 
Protein sequence
MRPPGEALPS RRNEVRPHFR LRRTSMVEHC VEPQPVRTMN PAPMSEQEAA RLRALDRYRL 
LDTPREQDFD EIAEAAAELC EAPIAVVNLV GDGRQFFKAE VGLGVRETPL ETSFCRQAIL
HDDFLYVPDT ARDPRFEGNP LVSGDPGLRF YAGALLRTDE GQPIGTVCVL DTRPRELSER
QRRGLMRLAR QAMTQMELRR SLREQAEQRL LHERILDSAT DYAIVAMDPQ GRVTRWNTGA
ERILGWTEAE MLGRTVDAFF TPEDRAGDRP DVEKRLAAQT GSAPDERWHM RKDGTRFWAS
GEMMPLTAED GGLIGFLKIL RDRTGQRASE AALQASELRY RSLVEVSPQV VWFGDEAGRV
TYCNTYWYDY TGLPPGETGE ASWMGVIHPD HRERVRDAWL AAARSRGGYE VEFPLRRADG
QYRWFLSRAR PVRDEAGHLR SWIGTTLDIH ERKVAEERFA ALTELAPAII WFGNPDGSLS
YLNDRWYAYT GQTPEQALPL GWGEAIHPDD VDGLLKVWEA ARTHETVYDT EARLRARDGT
YRWFLIRAEP RRDASGAVVG WLGSNSDIHD RRQADEDLRR AREQLHLAVE ATGTGIFDYD
LVTDTLEWDA RTRALFGLGP EAPVSYDVFL AGLHPEDRSW VDRAVEAALD PAGSGTYDIA
YRTIGLEDGI ERWVAAKGQA FVASGRTVRF IGTVRDVTQS RRAEQTLRET EERYRLAARA
TNDAIWDWNL ATNQVLWNEA LTVAYGYPPE AVDPTGDWWI THIHPDDRAR IDTSIHAVID
GTGTAWSDEY RFLRANGTYA DILDRGYVIR DGHGAAVRMI GAMLDISERK RAEEHQRLLT
GELQHRVKNT LTLVQAIASQ TLRNAPDLDA AREAFAARLI SLGRAHDILT RSSWTEAPIA
EVVEGALAVH RGAAMARIRA SGPSVLLGAK AALSLALALH ELATNATKYG ALANEVGCVE
LRWHVVHEDE APRFCLTWSE QGGPPILSQP SRRGFGSRLI ERSFAAEVGG EVKLTYAPTG
LVCRLEAPLA SMQEPRDEVA A