Gene Mext_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2022 
Symbol 
ID5833494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2254176 
End bp2256329 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content71% 
IMG OID641367822 
ProductPAS sensor protein 
Protein accessionYP_001639491 
Protein GI163851448 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.620506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCGAC GCATGCTCTC CCGGCTCGGC CAGCGTGGCT TCTCCACGCA GTTCTATCTG 
ATCATGCTCG TGATCGCGCT GATCGGCCCC GGCCTGATCT TCACCGCGAT CCTGCTGACC
CGCTACGCCG CCACCGAACG CGCCCGCTTC GAGCAGGATG CACGGGAGAA CGTGCGTGGC
GTCGCGCTCT CGATTGATCG CGACACCGCC GGTCTCGTCT CGGTGCTGCA GACGCTCGCC
ACCTCGCCGC GGCTCAAGGA CAGCGAGTTC GCGATTTTCG AGAATCAGGC ACGCCTCGTG
CGCGAGGCGG TCGGCCTCGA TCTCGTTCTG CGCCGGCCGG ACGGCCAGCA GATCGTCAAC
ACCGCCCTCA AGCCGGGGGC GCCCCTGCCG GTGACGACGC TGCCGATCGA CCGCGAACTC
ATCGAGAGCG GGCAGCGCTC CATGGTCACC GGCTATCTCG CCGGGGCCAT GCCGGATCAG
GCGCATTACG CGGTCGCGGT GCCGGTGCGG ATCGAGGAGA CGCCCGCCTT CATCCTCAGC
TTCTCGGTGC CCCTGAGCCG CATCGCCGGC ATCCTCGCCC GCGAGCAGAT CCGGGGTTGG
GTCACGGGGG TCTCGGATCG CGACAGCGTC GTGCTGGCGC GCCTGCCGGA ATTACCCGGC
GTGGTCGGGA ATCCCCGTCT CGCGACCCTG CGCCAGACGG CGACGGGCAC CCCCGGCGTC
TGGGAGGGGC GCGACCGCGA CTTCAAGCCC GTCACCGTGG TCGAGGCGCG CTCGCGGCTC
AACGGCTGGA CCGTGGGGGC GAGCATCCCG CGCGAACTGG TCGAAGCGCG GCTGCGGCGC
TGGATCTGGG CCTTCGGCGG CTTCGGCCTT CTCGTGCTCG CCACCTCCTC GGTGCTCGCC
GTCCACCTGT GGTCGCGGGT CTCGAAGCCC CTGCGCCAAC TCGCGGCGAC GGGGCCGGCC
CTCGCGCGGG GGCAGGCGAT CCCGCGGATC GCTTCGCCGA TCCACGAGAT CCGCCGCCTT
GCCAACGTGC TCTCGGAAGC CTCGCTGCGG CTGCGCACCC GCAGCGAGGA GCGCGACCGG
GCACTCGCCG AAACCCAGCG CGGCCTCGCG GCTTTGCGTG AGAGCGAGGC GCGCTTCCGC
CACATGGCGG ATTCGGCCCC GGCGCTGATC TGGATGACCG ACGAGACCGC CGAGGTGGTC
TTCGCCAATA TGCATTTCGA CCACCTGTTC GGCCGTCCGG CGGCGGAGAT GGCCGGCGGC
GGCTGGGAGT CGATCGTGCA TCCGCCGGAC CTGCCGGCTT TCCAGGCGAC GTTCCAAGAG
GCGTTCGAGC ACCGGCATCC GTTCCGGGCC GAGATGCGGG TCGTCGATCG CAACGGGGAG
ATTCGCTGGC TGCGCTGCGA GGGCGTGCCG CGCCTCGACG ACCACGGCAC CTTCCTCGGC
TTCACCGGGT GCAGCGTCGA TGTCACCGAT GCCAAGCGGG CCGAAGAGCA TCTGCGCCTG
CTCATCAACG AGTTGAACCA CCGGGTGAAG AACACCCTGG CCACCGTCCA ATCGATCGCC
ATGCAGTCCC TGCGGGGGCT CGACGGCGAA GAGGCGCTCG CGGCCCGCGC CGCCTTCGAG
GCACGGCTGC TGGCGCTCGC CCGCGCCCAC GACGTGCTGA CCCGCGAGAG CTGGGAAGGC
GCCGAGCTGA AGACCGTGGT GGGCGATGCG ATCCGCCCGC TGGAGGCGGG GGACGGGCAG
GAATCGCGCT TCGTCGTATC GGGCCCGCGC CTGCGGCTCG CGCCGCGGCT GGCATTGTCC
ATCGCCATGG CCCTGCACGA ACTCGGCACC AATGCCGTGA AGTACGGGGC TTTGTCCAAG
GAGGGCGGGC GGGTGACGAT CACCTGGACC GTGCAGCGCC GGCCCGAACT CTCGCTGTCC
CTGCGCTGGA CCGAGAGCGG CGGCCCGCCG GTCTCGCCTC CGACTCGCCG CGGCTTCGGC
TCGCGCCTGA TCGAGCGCAG CCTCGCCCGT GAACTCGCGG GCAAGGTCGA GCTGCTTTAC
GAGCCGGACG GCGTCGTCTG CACCATCGAT GCGCCGGTGC CGCCGCCGGG TTTGCTGGAG
CGCAAGGGCG GCACGCAACT CGCTGCAACG AAGCCGCTGC CGCTGGCGGG GTAA
 
Protein sequence
MIRRMLSRLG QRGFSTQFYL IMLVIALIGP GLIFTAILLT RYAATERARF EQDARENVRG 
VALSIDRDTA GLVSVLQTLA TSPRLKDSEF AIFENQARLV REAVGLDLVL RRPDGQQIVN
TALKPGAPLP VTTLPIDREL IESGQRSMVT GYLAGAMPDQ AHYAVAVPVR IEETPAFILS
FSVPLSRIAG ILAREQIRGW VTGVSDRDSV VLARLPELPG VVGNPRLATL RQTATGTPGV
WEGRDRDFKP VTVVEARSRL NGWTVGASIP RELVEARLRR WIWAFGGFGL LVLATSSVLA
VHLWSRVSKP LRQLAATGPA LARGQAIPRI ASPIHEIRRL ANVLSEASLR LRTRSEERDR
ALAETQRGLA ALRESEARFR HMADSAPALI WMTDETAEVV FANMHFDHLF GRPAAEMAGG
GWESIVHPPD LPAFQATFQE AFEHRHPFRA EMRVVDRNGE IRWLRCEGVP RLDDHGTFLG
FTGCSVDVTD AKRAEEHLRL LINELNHRVK NTLATVQSIA MQSLRGLDGE EALAARAAFE
ARLLALARAH DVLTRESWEG AELKTVVGDA IRPLEAGDGQ ESRFVVSGPR LRLAPRLALS
IAMALHELGT NAVKYGALSK EGGRVTITWT VQRRPELSLS LRWTESGGPP VSPPTRRGFG
SRLIERSLAR ELAGKVELLY EPDGVVCTID APVPPPGLLE RKGGTQLAAT KPLPLAG