Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4542 |
Symbol | |
ID | 5832113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5073567 |
End bp | 5076857 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641370336 |
Product | PAS sensor protein |
Protein accession | YP_001641981 |
Protein GI | 163853938 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.220973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0899725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGC GTATTCGTGC CTACGATTGG GCACGGACAC CTCTTGGCGC CATTGGTGAT TGGCCGCAGA GTTTGAGGAC GATGGTCGAG TTGATGCTCG GCTCTCCTTT GCCGGCCGCG ATCGCCTGGG GTCCTGAACT TACGGTAATC TATAACGACG GTTTCGACGC GATCGACAAC TTCAGCGATC CGCCGCCCCT TGGCCGCCCG TTCACGAAAG TCTGGCCAAA GCCGGAAAGC GACGGGATCG TCGACTCGAT ACGACAGGGG CAGGCCCGTC AGATCATTGA TCGCTACTGG GATCTTCCCG CGCGGCCGGA GCGACCCTTC GGCTGGTTCA CGTCCCAGTG GACGCCGTTG CGGGACGAAG CGGGAGGCTT CGCCGGGTTC TATCTCGCCG CCTTCGAGAC GACCGATCGC GTCCTCGTCG AGCGGGCCTT GCTCGAGCGC GAGGAGCAGC AGGCCTTCGT GCTCGGCCTC AGCGACGCGC TGCGCCCCCT GGCCGATCCG CTCGAAGTCC AGGCCGTGGC CTGCCGCTTG CTCGGCGAGC ACCTTCAGGC GGACTACACG TACTACCTCA ACCTGTACGA GGCGGAGGGT TTCGCGATCA TCGCACAGGA TTTCGCGCGC GCGGGTCTCC CCTCGCGAGC CGGGAAATAT CCCCTCAGCC TCGTCGGTTG GGCGATGCCC CATCAGGGCC AGGGCCAGCC GATCGCGATC ACGGACGTGC AGACATCCCC CCTGGTCCCG GACGACTGCC GGGAGAGGGT GCTGGCCGAC CGTGTCGTGA GCCTGCTCGG CATTCCGCTG GCGAAGCAGG GCCGCCTCGT CGGCGCACTT GCGCTCTCCA TGACGACGCC GCGAGAATGG ACCCCTGCGG AGATCGCGCT GGCCACCGAG ATCGGCGAGC GCACCTGGGC GGCGATGGAG CGGGCCCGCG CGGAGAAGAC GGCGCGCGAG GCCGACATTC GCCTGCGCAC CCTGGCCGAT GCCGCGCCCG TCCTGATCTG GGACGCGGAT TCGAGCGGCA CGATCCTCGT CAACGACCAC TATCTCGACT TCTTCGGTGT CGGCTTCGAG GCCGTGGCGG GGTATGGCTG GCAGAAGTTC CTGCATCCCG AGGATGCCGA GCGACACCTT TCTCTCTTCC AAGAGGCGTT CGCGCAGCGC CATTGCTTCA CCGACGAGGC CCGGCTCCGC CGTGCCGACG GGCAGTACCG CTGGCTCAGC ACGTCCGGCC GGCCGCTGGA GGACGGGCGC TTCGTCGGCG TCTCGATCGA CGTCACCGAG CGGCGCCTCG CCGAGCAGCG CGTCAGGCGC AACAATGCCG TCCTCCAAGC GATCAACCTC GTCTTCAGCG AGACGCTCGG CGCCTCCTCG GAGGAGGAAT TGGCGCGCAT CTCCCTCGAA GTCGCCGAAG AGCTGACCGG CAGCGCCATC GGCTTCATCG GTGAGATCAA TGAGACCACC GGGCGCCTCG ACGGCCTGTT CTTCAGCGAC CGGAGCCGGG CCCGCTACGC CGCGCACGCA TCGGAGGGGG ATGACAGCTT CCCGATGGGC AAGTCGGCGC TGGGGCTTGC GATTCACGGC ATCTACGGGC GGGTGTTGCA GGATGGCGCG GGCGTCATCG TCAACGATCC CGCCTCCCAC CCCGACCGCA TCGGCACGCC GGCCGGCCAC CTGCCGCTGA CGGCCTTCCT CGGCGTGCCG CTGAAGCAGG GTGACAGGAC GCGGGGTCTG ATCGGGCTCG GCAACCGCCC GGGCGGCTAC CGGCCGGAGG ACCTTGAGGC GGCGGAGGCG CTGGCGCCCG CGATCTGGCA CGCCTTGCGC AGCAAGCGGG CGGAGCTGCG CCTGCGCGAG AGCGAGGAGC GTTTCCGGCA ATTCGCCGAG GCTTCCTCCG ACGTCCTCTG GATCCGCGAT GCGGAGACGT TCGAGATGGA GTTCGTCAGC CCGGCGCTGC GGACGGTCTA CGGCATCGAG CCCAACGTGC TCGGCCCCGA GATCCGGCGG TGGGCCGGCC ACATGCTGCC GGAGGATCGC GAGAACGCCC TGCAGAATCT GCAGCGGGCG GCGACCGGCC AATCGCTGCT GAACGAGTTC CGCATCAAGC GGCCGAGCGA CGGCGCCTTC CGCTGGATCC GCAGCACGCT GTTTCCCCTG CGCGACGAGC AGGGCCGGGT GCGGCGCATC GGCGGTCTGT CCTCCGACAT GACCGAGGCC AAGCTGCTGA TCGGGCATCA GGCCGTGCTG CTCGCCGAAT TGCAGCACAG GGTGCGCAAC ATCATGGCGG TGACCCGCTC GATCGTCGCC CGCACGGGCG AGCGGGCGGA GACCGTGTCC GATTACGCGT CCCTGGTGGG AGGGCGCCTC CTGACGCTGG CCCGCGTCCA GGCCCTGCTG ACCCGCTCGC CCAATGCCGG CGTGCCGGTC GCCACCATCG TCCGCGACGA GATCGACGCC CAGGACCTGC GCGCGGACCA ATACGATCTG TCCGGACCCA AAATCGAACT CTCGCCCAAG GCGGCGGAGA TCCTGACGCT CGCCGTTCAC GAACTGGCGA CCAATGCCCT GAAATATGGA GCCTTGTCGG TGCCGGATGG GCGCGTGCGG GTGAATTGGT CCTCGTTCGA GAAGAGAGGC GAGCCCTGGC TCGGCTTCGA TTGGGCAGAG GATGGTGTGC CGGAAGCGAA GATTGCGGGC GAGCGTGAGG CGAATCGTCG CACCGGCGCG CGCCGTGGTT TCGGCCGCGA ACTGATCGAG GTGCGCCTGC CTTACGAACT CGGGGGGCGC GGTCGCCTGG AGATCGGTCC GGAGGGGGCG CGGTGCCGCC TCGAATTTCC GCTCCGTGAC GGCGCCAGCA TTCTTGAAAC CGATGCCCCG CAACGAGCGA CCGTGTTCGG AGGAGCGCTC GACATGACCG GCGAACCAGA TTTGAGCGGC TATCGTATCC TCGTGGTGGA GGACGATTAC TATCTCGCCA CCGATACGGC ACGCGCGCTC CAGGGAGCGG GCGCGGAGGT GGTTGGCCCC TGCTCCAGCG AGGAGGCGGC GCGCGAGGCG CTCGATGAAG GGGCGCTGGC AGCGGCGCTG GTGGATATCA ATCTCGGATC AGGGCCGTCC TTCACCCTGG CGGCCCTGCT TCGGGAGCGC GGCGTGCCGT TCGTGTTCAT CACCGGTTAC GACGAGGGGG TGATTCCGCC GGATTTCGCC GATGTCGAGC GTCTCCAGAA GCCCGTCGAG CTGAAACGCG TCGTCAATTT CCTCGCCGAC ACGCTGCAGG CGGCGCAGTG A
|
Protein sequence | MSARIRAYDW ARTPLGAIGD WPQSLRTMVE LMLGSPLPAA IAWGPELTVI YNDGFDAIDN FSDPPPLGRP FTKVWPKPES DGIVDSIRQG QARQIIDRYW DLPARPERPF GWFTSQWTPL RDEAGGFAGF YLAAFETTDR VLVERALLER EEQQAFVLGL SDALRPLADP LEVQAVACRL LGEHLQADYT YYLNLYEAEG FAIIAQDFAR AGLPSRAGKY PLSLVGWAMP HQGQGQPIAI TDVQTSPLVP DDCRERVLAD RVVSLLGIPL AKQGRLVGAL ALSMTTPREW TPAEIALATE IGERTWAAME RARAEKTARE ADIRLRTLAD AAPVLIWDAD SSGTILVNDH YLDFFGVGFE AVAGYGWQKF LHPEDAERHL SLFQEAFAQR HCFTDEARLR RADGQYRWLS TSGRPLEDGR FVGVSIDVTE RRLAEQRVRR NNAVLQAINL VFSETLGASS EEELARISLE VAEELTGSAI GFIGEINETT GRLDGLFFSD RSRARYAAHA SEGDDSFPMG KSALGLAIHG IYGRVLQDGA GVIVNDPASH PDRIGTPAGH LPLTAFLGVP LKQGDRTRGL IGLGNRPGGY RPEDLEAAEA LAPAIWHALR SKRAELRLRE SEERFRQFAE ASSDVLWIRD AETFEMEFVS PALRTVYGIE PNVLGPEIRR WAGHMLPEDR ENALQNLQRA ATGQSLLNEF RIKRPSDGAF RWIRSTLFPL RDEQGRVRRI GGLSSDMTEA KLLIGHQAVL LAELQHRVRN IMAVTRSIVA RTGERAETVS DYASLVGGRL LTLARVQALL TRSPNAGVPV ATIVRDEIDA QDLRADQYDL SGPKIELSPK AAEILTLAVH ELATNALKYG ALSVPDGRVR VNWSSFEKRG EPWLGFDWAE DGVPEAKIAG EREANRRTGA RRGFGRELIE VRLPYELGGR GRLEIGPEGA RCRLEFPLRD GASILETDAP QRATVFGGAL DMTGEPDLSG YRILVVEDDY YLATDTARAL QGAGAEVVGP CSSEEAAREA LDEGALAAAL VDINLGSGPS FTLAALLRER GVPFVFITGY DEGVIPPDFA DVERLQKPVE LKRVVNFLAD TLQAAQ
|
| |