Gene Mext_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4107 
Symbol 
ID5832572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4569163 
End bp4570926 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content71% 
IMG OID641369898 
ProductPAS sensor protein 
Protein accessionYP_001641548 
Protein GI163853505 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0206968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AGCCCGGCGC CACGGCTTTG GACGAGGCGC AGGCGCGCAT CCGCGCGCTG 
GAAACGCGGC TGGAGGAGGT CGAGGACACC CTGGCGGCGA TCCGCCGGGG CGATTTCGAT
GCGATCGTGG TCGAGGGTCC GGGTGGCGAG CGCCTGGTCT ACACCCTGGA GAATGCCGAC
CGGCCCTACC GGGTGCTGAT CGAGCAGATC CAGGAGGGCG CCTTCACCCT CGGGCCGGAC
GGAACACTGC TCTACTGTAA CCGCCGGCTC GCCGAGACCC TGGGCACTCC GCAGGAGCGC
CTGATCGGCC AGCCCTTCGC GCGCTTCGCC GCCGGGGGCC AGGACGCGGC GCTCGCCGCT
CTGATCACGC AGGCGGCGCG GATGCCGGCC CGCGGCGAGA TCGCCCTGTG CGACGAGGCG
GGGGTCGAGC GCCCCGCCTA CCTGTCGCTG AACCGCCTCG ACGGCGACGG CGACACCCTG
CTCCTTTGCG GCGTGATCAC CGACCTCACG GCGGAGCGGC TGCGCCTGCA CGAGCTCGCC
GAGGCCAACG AGCGCCTGCG CGCCGAGGTG GCCGAGCGCG AGCGGATCGA GGAGGCATTG
CGCCAGAGCC AGAAGATGGA GGCGGTGGGC CAGCTCACCG GCGGCGTCGC GCACGACTTC
AACAACCTGC TCACCGTCAT CAAGTCCTCC ACCGATCTGC TCAAGCGCCC CGATCTCGCC
GAGGAGCGCC GCCGCCGCTA TGTCGACGCC ATCTCCGACA CGGTGGCACG GGCGGCCAAG
CTGACCGGCC AGCTCCTCGC CTTCGCCCGG CGCCAAGCCT TGAAGCCCGA GGTGTTCGAT
GCCGGCCGCG GCGTCGCGTC GGTGGCCGAC ATGGTCGGCA CGCTCACCGG CGCGCGCATC
CAGGTGAAGA CCCTGATCGA GCCCTGTTTC GACGCCGCGG GCGAGCCCTT CGCCTGCCTC
GTGGAGGCCG ATCCGAGCCA GTTCGACACG GCGCTCGTCA ACATGGTCGT CAACGCCCGC
GACGCCATGG ACGGCGAGGG CACGCTCACC ATCCGCGTCG GCCATGCCGG GCAGATTCCG
GCCCTGCGCA CCCATCCGGC CGTGCCCGGT GACTTCGTGG CGGTGGCGAT CTCCGACACG
GGGACGGGCA TCGCGCCCGC CGACCTGTCG CGGATCTTCG AGCCGTTCTT CACGACGAAG
GGCGTCGGCC AGGGCACTGG GCTCGGGCTG TCCCAGGTGT TCGGCTTCGC CAAGCAATCG
GGCGGCGACA TCGCGGTGGA GAGCGTGCTG GGACAGGGCA CGACCTTCAC CCTGTTCCTG
CCGCGGGCCA AGGTCGCCCC AACGATGGTC GAGTCCGCCG ACGAGCCCGA GCCGCTAGCC
CCCGGCCACG GCACCTGCGT GCTCCTCGTC GAGGACAACC GCGAGGTCGG CGCCTTCGCG
ACGCAAGCGC TGGCCGAACT CGGCTACGGC ACCGTCTGGG CGATGGATGC GGAGCAGGCG
CTGGCCGAGC TCGACCGGAC GCCGGAGCGC TTCGACGTGG TGTTCACCGA CGTGGTGATG
CCCGGCATGA ACGGGGTCGA ACTCGCCCGC ACGATCCTGG GGCGGACGCC CGGCATGCCG
ATCGTGCTCT CCTCCGGCTA CAGCCACGTG CTGGCCGAGG ACGGGCACCA CGGCTTCCCG
CTGCTGCACA AGCCCTACTC GGTGGAGGAT CTCTCGCGCA TCCTGCGTCG GGCGATCCAG
CGGCGCGCCG CGCGGGCGCG GTGA
 
Protein sequence
MSTEPGATAL DEAQARIRAL ETRLEEVEDT LAAIRRGDFD AIVVEGPGGE RLVYTLENAD 
RPYRVLIEQI QEGAFTLGPD GTLLYCNRRL AETLGTPQER LIGQPFARFA AGGQDAALAA
LITQAARMPA RGEIALCDEA GVERPAYLSL NRLDGDGDTL LLCGVITDLT AERLRLHELA
EANERLRAEV AERERIEEAL RQSQKMEAVG QLTGGVAHDF NNLLTVIKSS TDLLKRPDLA
EERRRRYVDA ISDTVARAAK LTGQLLAFAR RQALKPEVFD AGRGVASVAD MVGTLTGARI
QVKTLIEPCF DAAGEPFACL VEADPSQFDT ALVNMVVNAR DAMDGEGTLT IRVGHAGQIP
ALRTHPAVPG DFVAVAISDT GTGIAPADLS RIFEPFFTTK GVGQGTGLGL SQVFGFAKQS
GGDIAVESVL GQGTTFTLFL PRAKVAPTMV ESADEPEPLA PGHGTCVLLV EDNREVGAFA
TQALAELGYG TVWAMDAEQA LAELDRTPER FDVVFTDVVM PGMNGVELAR TILGRTPGMP
IVLSSGYSHV LAEDGHHGFP LLHKPYSVED LSRILRRAIQ RRAARAR