Gene Mext_3826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3826 
Symbol 
ID5835276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4248231 
End bp4249928 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content64% 
IMG OID641369617 
ProductPAS sensor protein 
Protein accessionYP_001641270 
Protein GI163853227 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.464112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGT TCTTTCGCGG CGATCATGCC GTGTTGCACG CTTTCGGTCG TGCGCATGGC 
ATGGTGGAGC TCACGCCGGC TGGCACCGTC TTGAGTGCCA ATGCATGCTA TCTCGACTGG
CTTGGGTATA GCTTCGACGA ATTGCAGGGC CACACACTGG ATGTCGTCCG GGACCCGTAT
GAGGCGACCC GCGCGGAGCA GCAGGAACTC TGGGCGGCGG TGACCCGAGG CGAATCCCGG
ACCGCGTGCT CTCGGCATCT CACCAAGCAG CGCGACGCGA TCTGGCTGCA GGTCAGCTAC
TGCCCCGTGC TCGACCGTAC CGGCCGGGTG AGCAAAATCG TTCTCCTCGC GGTCGATGTC
ACGGCACAGC AGCAGAGCAG CGCCGATCAC GCGGCGGTCC TTGAGGCCAT CAGCTGCTCG
CGCGCCACAA TCGAGTTTGC CCTGGATGGC ACCATCCTCA TGGCCAATAC CGGCTTCCTC
GACGCTGTCG GCTACAAGCT GGAAGAGGTG CAGGGCCGGC ACCACAGCCT GTTTGTCGCT
CCGGCAGAAC GTGAAAGCCA AGCCTATCGC GCGTTCTGGG CGGCTCTGGC GCGTGGCGAG
TTCCAGCGCG CCGAGTACAA GCGCATCGGC AAGGATGGAC GGGAGATTTG GCTGCAGGCC
ATCTACAATC CGGTTCGCGA CAGCCAGGGC AACCCGTACA AGGTCGTGAA GTTCGCGACC
GACATCACTC AGGACAAGCT GCACCGGGCC GACGTGGCAG GGCAGATCGC GGCCATCAAC
CGGTCACAGG GCGTCATCCA TTTCGACATG GATGGCACCA TCCTCGACGC CAACGAGAAC
TTCCTGAGCG TTGTCGGCTA TCGGCTGGAC GAGGTCCGCG GCCAGCACCA TCGGACCTTT
GTCGAGCCGT CTCAGGTCCA GTCCGCGGAG TACCTGCAGC TGTGGGAGGC CCTGCGGCGG
GGTGAGTACC GGGCGGGCGT GTTCAAGCGC CTCGGCAGGG GCGGGCGGGC GGTCTGGATC
CAGGCGAGCT ATAACCCGAT CTTCGATGCG GACGGCAAGC CGTTCAAGGT GGTGAAGTAC
GCCACCGACA TCACGGCCAA AACGGCGGCT CAGCACGCGG CGACCGGCGC CTCAAGCCAG
ACGCTGATGA ACGTGCAAAC CGTTGCCTCG GCTGCGGAGG AGCTGAGTGC TTCGATCGGT
GACATCGCGC AGAGCCTTGC CCGCTCGCGC TCCGAAGTAG ACGCCATCCA CGATCAGACC
GTCGTGGCGG ACCGCTCCAC AGCGAAGCTG AAGGACGCCG CCGCGGCGAT GAACGGCGTC
GTCCAACTCA TCCAGGGCGT TGGTCAGCAG ATCAATCTTC TGGCCCTCAA TGCGACGATC
GAGGCGGCGC GGGCGGGAGA GGCCGGACGC GGTTTTGCTG TGGTGGCCGG CGAGGTGAAG
AACCTGTCGA ACCAAGTGAC CCAGGCGACG ACGCGGATTG CACAGGACAT CCATGGGATG
CAGGGAATCG CGGAAGACGC GGTGGGTGCC CTGCTCTCGA TCGTGCAGGC GATCGGGTCG
GTACGGGAGA CGGTGACCGG TATCGCCTCG GCCGTCGATG AGCAAAACGC CATCACCCAG
GAGATCTCGT CCAGCATGCA GACGGCCGCA CACGGTGTCG GCGAGATCAA CACCAGTCTG
CAGGCTCTCA CGGGCTGA
 
Protein sequence
MSLFFRGDHA VLHAFGRAHG MVELTPAGTV LSANACYLDW LGYSFDELQG HTLDVVRDPY 
EATRAEQQEL WAAVTRGESR TACSRHLTKQ RDAIWLQVSY CPVLDRTGRV SKIVLLAVDV
TAQQQSSADH AAVLEAISCS RATIEFALDG TILMANTGFL DAVGYKLEEV QGRHHSLFVA
PAERESQAYR AFWAALARGE FQRAEYKRIG KDGREIWLQA IYNPVRDSQG NPYKVVKFAT
DITQDKLHRA DVAGQIAAIN RSQGVIHFDM DGTILDANEN FLSVVGYRLD EVRGQHHRTF
VEPSQVQSAE YLQLWEALRR GEYRAGVFKR LGRGGRAVWI QASYNPIFDA DGKPFKVVKY
ATDITAKTAA QHAATGASSQ TLMNVQTVAS AAEELSASIG DIAQSLARSR SEVDAIHDQT
VVADRSTAKL KDAAAAMNGV VQLIQGVGQQ INLLALNATI EAARAGEAGR GFAVVAGEVK
NLSNQVTQAT TRIAQDIHGM QGIAEDAVGA LLSIVQAIGS VRETVTGIAS AVDEQNAITQ
EISSSMQTAA HGVGEINTSL QALTG