Gene Mext_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2103 
Symbol 
ID5833210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2359837 
End bp2361804 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content68% 
IMG OID641367900 
ProductPAS sensor protein 
Protein accessionYP_001639569 
Protein GI163851526 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00365518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGGGA GCGACAGTCC ACCTGCGCAG TCGCGCCGCG GCGAGATGGC CGAGCGGATC 
CGGGCGCATG ACTGGACCGC GACGGCGCTC GGTGCGGCGG ACGCTTGGCC GCCGAGCCTC
AGGGCCACGA TCAGCCTGAT CCTCGGCTGC GGTTTCCCCA TGATCGCGCT GTGGGGCCGG
GATCTGATCC AGGTCTACAA CGACGACTTC CGGGACCTGA TGGGATCGAA GCACCCGGCC
GGGCTGGGTC AGCCGGCACG CGCGTGCTGG CCCGAGATCT GGCACATCAC CGCGCCGATC
TACGAGCGGG TCTGGAACGG CGAGACGTTC ACCTTCGAAG ACGCGCTCTA CCCGCTGTTC
CGGTCGGGCC GGCTCGAAGA CGCGTGGTTC ACGCTGACCT ACAGCCCGCT GCGGGACGAG
GCGGAGCGGA TTGCGGGCAT TCTGGTCACC CTGGTCGAGA GTACGGCCCG CGTGCTGTCC
GACCGGGCGT TGCGCGAGAG CGAGGCGCGC TTCCGGGCCG AGCTGGAACG TCAGGTCCAG
GAGCGGACGG CGGAGCTTCA GGCGAGCCGA GACCTGCTCA AGGCGACCAT GGACAGTTCC
ATGGACATGA TTCAGGTCTT GAAGGCCGTG CGCGATCCGG CGGGCGAGAT CATCGATTTC
CGCTGGCTCC TGAACAACTC CACCTCGGCG AGCCGCTACG GCGATGTGGG CGGTCAGAGT
TTGCTCGAAC GCAATCCGGG CGTGATTCAG GAGGGCATCT TCGACACCTT CAAGCGTGTC
ACGGAAACAG GCCAGCCCGC GACCGCGGAG CGCCGCTACG CCCACGAGCA GTTCGACGGC
TGGTTCTTCC AGTGCGCGGT GAAGCTTGGT GACGGAGTCG CTATCACCAC CAAGGAAATT
TCGGCGTGGA AGGCGGCGCA GAACGAGATG CTGCGGCTTC GCGACGAGAG CGCAAACGCG
GCCCTGCGCG AGAGCGAGGA ACGCTTTCGC ACCCTGGCGA GCCTCATCCC CGTCCTGCTG
TGGCGCTCGG ACGAGAGCGG GCAGCACAAC TCCCTCAACG AGGCCTGGCT CACCTATACC
GGCCAGACCC TGCAGCAATC CCAGGCCGGC GGCTGGCTCG AAGCGATCCA TCCCGCCGAC
CGCGACGCGG TGAGCGAGGC CTTCCGCTCC GGACGCGAGC AGCAGCGGTT GATCGAGGTG
CAGCAGCGCA TCCGCCGGTA CGACGGGCAG TATCGCTGGT TCCTCGTACG GCAGGCGCCC
CTCCTCGACA CCGAGGGGCA GGTCACGCAG TGGATCGGTG CCGCCATGGA CATCCACGAT
CTGCACGATC TGCAGGAGCG CCAGACCATC CTCGTCGCCG AGTTGCAGCA CCGCACCCGC
AACCTGCTCG GCGTCGTGCG CTCCATCGCC CACCAGACCA TGGCGCAGAC CGGTCCGACG
GAGCGCTTCC GCGAGCAGTT CAACGACCGG CTCGCCGCCT TGTCGCGGGT TCAGGGGCTG
CTGTCGCGCT CGGAGCAGGA GCCGATCACC CTGCGCACCC TGATTCGAAC GGAGCTGGAC
GCCCTCGGGG GCGGCGACTT CGCCGATCGA ATCCATATCG CCGGGCCGCC GGTGCGCCTG
CGCAAGGCGT CGGTGCAGAC CCTGGCGCTC GCCGTGCACG AACTGGCCAC CAATGCCCGC
AAGTACGGTG CCCTGACGAC CGAGCACGGC CGCCTCTCGG TGACATGGCG CGCCGACCGG
GACGACCAGG GCGGAGGAAA CCTGCTGATC GAGTGGATCG AGGAGGGCAT CAGCCGGCCG
CGCGAGGAAC AGAGCCCGAC GCGGCGCGGC TACGGACGCG AGTTGATCGA GCAGGCGATG
CCCTACGCGC TCAACGCCAA GACGCACTAC GAACTCGGTG AGACGCGGCT GCGCTGCGCC
ATCGAACTGC CGCTGGGCGA GCGGTTCGGG CAGGTGAGCA CGGCCTGA
 
Protein sequence
MDGSDSPPAQ SRRGEMAERI RAHDWTATAL GAADAWPPSL RATISLILGC GFPMIALWGR 
DLIQVYNDDF RDLMGSKHPA GLGQPARACW PEIWHITAPI YERVWNGETF TFEDALYPLF
RSGRLEDAWF TLTYSPLRDE AERIAGILVT LVESTARVLS DRALRESEAR FRAELERQVQ
ERTAELQASR DLLKATMDSS MDMIQVLKAV RDPAGEIIDF RWLLNNSTSA SRYGDVGGQS
LLERNPGVIQ EGIFDTFKRV TETGQPATAE RRYAHEQFDG WFFQCAVKLG DGVAITTKEI
SAWKAAQNEM LRLRDESANA ALRESEERFR TLASLIPVLL WRSDESGQHN SLNEAWLTYT
GQTLQQSQAG GWLEAIHPAD RDAVSEAFRS GREQQRLIEV QQRIRRYDGQ YRWFLVRQAP
LLDTEGQVTQ WIGAAMDIHD LHDLQERQTI LVAELQHRTR NLLGVVRSIA HQTMAQTGPT
ERFREQFNDR LAALSRVQGL LSRSEQEPIT LRTLIRTELD ALGGGDFADR IHIAGPPVRL
RKASVQTLAL AVHELATNAR KYGALTTEHG RLSVTWRADR DDQGGGNLLI EWIEEGISRP
REEQSPTRRG YGRELIEQAM PYALNAKTHY ELGETRLRCA IELPLGERFG QVSTA