Gene Mext_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4217 
Symbol 
ID5833285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4692975 
End bp4695353 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content69% 
IMG OID641370008 
ProductPAS sensor protein 
Protein accessionYP_001641657 
Protein GI163853614 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.423052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.639944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTAC AGGACGGGAA GGTGCCGGAA ACCGGGCCGA CCATGGCCGA TTTCCGCAAT 
CTCGCCGACG CTTTGCCGCA ACTCGCCTGG ATCGCCGAGG CCGACGGCAC CCTCGTCTGG
TACAATCGCC GCTGGTACGA CTACACCGGA ACGACGCCCG CCGAGATGGC CGACCATGGC
TGGCGCAAGC TGCACCACCC GGACCATCTC GAGCGCGCCG CGGAGCGCTT CGCCAGCTGC
ATCGCGGCGG GCGAGTCGTG GCAGGACACC TTCCCCCTGC GCGGCAAGGA CGGCCGCTAC
CGCTGGTTCC TGTCGATGGC CGAGCCGGTC CGCAATGCGG AGGGCACCGT CCTGCGCTGG
TACGGCACCA ACACCGACGT CACCGAGACG CGGCTCGCGC AGGACGCGCT GGGCCATTCC
GAGCAGCGCT TCCGCGCGCT GGTGGATGCC TCCGCCGCCG TCATCTGGAG CACGACCGCG
CAAGGGGAGC TGATGCCGCC CCAGCCGAGC TGGGCCGCCT ATACCGGCCA GAGCGACGAG
GCCTACCAGG GCTGGGGCTG GATTGACGCG ATCCATCCCG ACGACCGGGC GCGGGTCGCC
GAGGTCTGGG CCGAATGCGT CGAACGGGTG ACCGTGTTCG AGGTCGAGTA CCGCCTGCGC
CGCCACGACG AGGTCTGGCG CGACATGGAG GTGCGCGGCG TGCCGGTGCT CGCCGAGGAC
GGCTCCATCC GTGAATGGGT CGGGCTCAAC ATCGACATCA CCGCCCGGAA GGAGGCCGAG
GCCGCGATCG AGCACGCCCG TGCCGCGGCG GAGGCTGCCA ACCTCGCCAA GAGCCAGTTC
CTGGCCAATA TGAGCCACGA GTTGCGCACG CCGCTCTCGG CGGTGATCGG CTATTCCGAG
ATGCTCGGCG AGGAGCTGGA GGATCTGGGC CAGGCCGAGC TGCTGCCGGA TCTGCGCAAG
ATCGAGTCGT CGGCCCGCCA CCTGCTCGGC CTCATCAACG ACGTGCTCGA TATCTCCAAG
ATCGAGGCCG GTCGCCTGAC CCTGGCGGCG GAGACCTTCG ATGTCGTGTC CCTGATCGAG
GATGTCACGG CCGCCACGCA GAGCCTCATC ACGAAGAAGC GCAACCGCTT CCGCCTCGAT
TTCGAGGGGG ATCTGGGCGC CATGCATCAG GACCAGCTGA AGCTGCGCCA ATCGCTGATC
AACCTGATCG GCAACGCGGC GAAGTTCTCG GAGGATGGCG AGATCATCCT CGGTGTGCGG
CGCCTGCGGG AGGGCGGGGC CGACTGGCTG AGCTTTTCCG TCTCGGATAC CGGCATCGGC
CTGACGCAAG AGCAGATCGG CCGGCTGTTC GAGCGCTTCT CTCAAGCCGA CGAATCGACC
ACGCGCCAGT TCGGCGGCAC CGGCCTCGGC CTCGCCATCA CCCGCGCCTT CGTCGAGCGG
ATGGGCGGCA CGATCGGCGT GGAGAGCACC TTCGGCGAGG GCGCGACCTT CACGATCCGC
CTGCCGGCCG AACTCGCCGC CCACGAGGAA GAGGTCGAGG CGGAAAGCGT CGCCGCCCGC
GTTCAGGAGA TCACCGAGGG CGAGGCGCAT CTGCACGACG TGGTGCTGCT CGTCGATGAC
GACCCGGCTG CCCGCGATCT GCTTCAGCGC TTCCTCGAAC GCGAGGGGTT CCGCGTGCGC
ACCGCCAATG ACGGGCGGGC CGGGCTGACC TTGGCCCGGG CGCTGAAGCC GCGGGCGATC
TTGCTCGATA TCGAGATGCC GCGCATGGAC GGCTGGGCGG TGCTGCACGC GATCCGCACC
GATCCCGAGA TCGCCGAGAC GCCGGTCATC ATCACCAGCG TGGTCAACGA GTTCAGCCTC
GCCCACGTGC TCGGCGCCAC CGACTACATG GTGAAACCAA TCGATTGGGG TGCGCTCAAG
GATGCGATGG AGCGCTACCG CCCCGTCGAC CGCGAGGGCA GTGTGCTCGT GGTCGATGAC
GACGCCGATG CCCGCGAGCG GGTGCGCCGC ACGCTCCAGC GCGACGGCTG GCAGGTGCGC
GAGGCCGAGA ACGGCGCCGC CGCCCTGGAG AGCCTGGATC AGGTCCGCCC GAGCCTGATC
CTGCTCGACC TGATGATGCC GGTGATGGAC GGCTTCGCCT TCCTGCGGGC GCTGCGCGGC
CGCCCGGACG GCGACAGCAT CCCCGTGGTG GTGCTCACCG CCAAGGAGAT CACGAGCGAA
GAGAAGGAGA GCCTCGGCCG GCAGGCCGAC CGGCTCATCG TCAAGGGCAC GATGAGCCTC
TCCGAGATCG GCCGGCAATT GCGCGACCTC TACAGCCACC AGGACGGCAC GCCGCTGCCG
GGCAAGATCC AGAGCTTGAT CGACAAGCTG TCGCCGTAG
 
Protein sequence
MTVQDGKVPE TGPTMADFRN LADALPQLAW IAEADGTLVW YNRRWYDYTG TTPAEMADHG 
WRKLHHPDHL ERAAERFASC IAAGESWQDT FPLRGKDGRY RWFLSMAEPV RNAEGTVLRW
YGTNTDVTET RLAQDALGHS EQRFRALVDA SAAVIWSTTA QGELMPPQPS WAAYTGQSDE
AYQGWGWIDA IHPDDRARVA EVWAECVERV TVFEVEYRLR RHDEVWRDME VRGVPVLAED
GSIREWVGLN IDITARKEAE AAIEHARAAA EAANLAKSQF LANMSHELRT PLSAVIGYSE
MLGEELEDLG QAELLPDLRK IESSARHLLG LINDVLDISK IEAGRLTLAA ETFDVVSLIE
DVTAATQSLI TKKRNRFRLD FEGDLGAMHQ DQLKLRQSLI NLIGNAAKFS EDGEIILGVR
RLREGGADWL SFSVSDTGIG LTQEQIGRLF ERFSQADEST TRQFGGTGLG LAITRAFVER
MGGTIGVEST FGEGATFTIR LPAELAAHEE EVEAESVAAR VQEITEGEAH LHDVVLLVDD
DPAARDLLQR FLEREGFRVR TANDGRAGLT LARALKPRAI LLDIEMPRMD GWAVLHAIRT
DPEIAETPVI ITSVVNEFSL AHVLGATDYM VKPIDWGALK DAMERYRPVD REGSVLVVDD
DADARERVRR TLQRDGWQVR EAENGAAALE SLDQVRPSLI LLDLMMPVMD GFAFLRALRG
RPDGDSIPVV VLTAKEITSE EKESLGRQAD RLIVKGTMSL SEIGRQLRDL YSHQDGTPLP
GKIQSLIDKL SP