Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3897 |
Symbol | |
ID | 5834885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4331459 |
End bp | 4334251 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369688 |
Product | PAS sensor protein |
Protein accession | YP_001641339 |
Protein GI | 163853296 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2203] FOG: GAF domain [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.240133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAC CGCCCGACGA TTGTCCAGCA TCCGGTGAGG CCGGCGCAGA GGCCGAGAGG CTGGCCGCGC TGGCCTGCTA CGGCATCCTC GACACGCCCG CCGAGGCCGC GTTCGACGAT GCCGTGGCCT TGGCCGCCCA GCTCTGCGCC ACGCCGACCG CCCTCGTCAG CCTGGTGACC GGCGACCGGC AATGGTTCAA GGCGCGGCTC GGCTTCGCCC CCGGCGAGAC CGGGCTCGAC CGCTCGGTCT GCGTTCACGC CCTGGCCGGG CGGGGCTTGC TGGTCATCCC CGATCTCGCG GCCGACCCGC GCACCCGCAC CAACCCGCTG GTCACGGGAG AGCCCGGCAT CCGCTTCTAT GCCGGCGCCC CCCTGGTGAC GCCTGAGGAT CGGGCGATCG GCACCCTGTG CGTGCTCGAC ACCAAGCCCC GGCCGGAGGG TCTGAGCGCG GCCCAAGGGG CCGGTCTCGA AGCGCTCGCC CGCACGGTGA TGACGCAGCT CGAACTGCGC CGGGGCATCG CGGCCCGGAA GACCGAGGCC GCGGCCCTGG CCGACAGCGA GTTCCGTCTG CGGCTCGCGA TCGAAGCGGC CGGTGCCGGC ATCTTCGACT ACGACCTCGT GGCGGGCACC CTCGCCTGGG ACGGGCGGAC CCGCGCCCTG TTCGGGGTCG GGCCGGACGA GGCCGTGAGC TATGCCGGCA CGTTCCTCGC CCGCCTCCAC CCCGAGGACC GGGCGCGCAC CGATGCCGCC GTGCAGGCGG CCCTCGACCC GGCCGGGCCC GGCCTGTTCG ACGCCACCTA CCGCACCGTG ACGGCGGACG GCACGGTGAT CGCCTGGGTC GCGGCCCGCG GCACCCTCGT CGTCGAAACG GATGAGGGGA TCAAACGGGC GCGGCGCTTC GTCGGCACGG TGCGCGACGT CACCGCCGAG CGCACGGCGC AGGTCGCCGT CGCCGCCACC GAGGAGCGCT ACCGCCTCGT CACCCGCGCC ACCAACGACG CGATCTGGGA CTGGGACCTC GCCGCCGACC ACGTGCTGTG GAACGAGGCC CTTCAGGCCG CCTATGGCTG GGCGCCGGAG ACGGTCGAGC CGACGGGCCG GTGGTGGCTC GACCACATCC ACGCCGAGGA TCGCCCCCGC GCCGAGGCGG GTATCCGCCG CGTCATCGGC GGCGGCGGCC ACGATTGGCA CCACGAATAC CGCTTCTGCC GCGCCGACGG CGCCTACGCC GACGTGCTCG ATCGCGGTTC GATGGTGCGC GGGGCCGACG GCAGGCCGCT GCGCATGATC GGCGCCATGC TGGATCTGAC CGAGCGCAAC CGCGTCGCCG CCCAGCTCCG GGCGGTGGTC GAAGGCGCGA ATATCGGCAT CGTGCAGATC GATCCGCGCA CCATGATCGC GCTGGAGGCC AACCCGAAGC TCTGCGCGAT CTGGGGGGCG GAGGAATCCG ACATCGTCGG GCATTCCATT GCCAAGTGGA CGCCGGAAGC GGATGCGGCG GAGCGCGACC AGCTCCACCG CCGGCTCGCC GCGGGCGAGA TCGTGCGCGA GACCCTGGAG AAGCGCTACC GCCGCAAGGA CGGGCGCCTG ATCTGGGGCC GGGTCAACCT CGTCTCGCAG GCCCGCGGCG AGGCGCTCCA GGCCACGGCG ATGATCGAGG ACATCACCGC GGAGAAGGCG ACCGAGGCAC GCCAGACGGC GCTGATCGAA CTCGGTGACA CCTTGCGCGA CGCCGCCGGC CCCGCCGAGA TCCGTGGGAT CGCGGCCCGG ATCCTTAGGC GCAGCCTCGA CCTCTCGGAG GCGGGCTACG CGGCCATCGA TGCCACCGTC GGCGGCTTTG CGATCGGGCG CGCGAGACCG GACGGGACGA TGAGCCCCGC GCCGTTTCCG GCCATGCTCG CGCGGCTGCG CCGCGGCGAG ATCCTGGCCG TGCCCGATCT GACCGCCGAA CCGGACCTCG CGCCGGATGC AGGCGGCTAC GCGGCGGCCG GCGCCCGTGC GCTGATCGGC GTGCCCTTGA CCCGGCGGGG CGTCCTCGTC GGCCTCGTCT ATGCCCACGC CGCCGAACCC CGGACCTGGG ACGCGGGCGA GGTCGATTTC GTCCGCGAGG TGGCCGGACG GATCTCCGTG GCGCTCGCCC GCATCCAGGC CGAGGAGCAG CAGCGCTTCC TCAACCGCGA ACTGAGCCAC CGGTTGAAGA ACACCCTGAC CATGGCCCAG GCCATCGCCT CGCAGACGCT GCGCAACGCC ACCGACATCG CCTCGGTGAA GGAGGCGCTG GTGGCCAGGC TGGTGGCGCT CGGCAAGGCG CACGACATCC TGCTCTCGGG CGAGGGCGAG GGGGCGGCGC TGCAGGCGGT GATCGCCGGC GCGCTCACCA TCCACGACGA CGGCGAGCCC GGCCGCATCC ACCTGTCCGG CCCAGCCCTG GAGGTCGGGC CGAAAGCCGC GCTGTCGCTG GCGCTGATGA TCCACGAACT CGCCACCAAT GCCGCCAAGT ACGGCGCCTT CTCGGTGCCG GGCGGGCGCG TCGGGGTGAA CTGGCACGTC GCGCGGGCGC GTTGGCCGGA GGACGCGGAG GATGCGGGGG AGGCCGAGCC GGTCATCACG ATGACCTGGG CCGAGACCGG CGGACCGCCG GTCGCCGCGC CCACCCGCAA GGGCTTCGGC TCGCGGCTGA TCGAGCGCGG GTTCTCCGGG GCGGTCGGCG GCGAGACGCA GATGATCTAC GCCCGAGAAG GGGTGACGTG CCGGATCAGG GCGCCCCTGA AGGGTCTTCT CGAAAAAGAA TAG
|
Protein sequence | MSRPPDDCPA SGEAGAEAER LAALACYGIL DTPAEAAFDD AVALAAQLCA TPTALVSLVT GDRQWFKARL GFAPGETGLD RSVCVHALAG RGLLVIPDLA ADPRTRTNPL VTGEPGIRFY AGAPLVTPED RAIGTLCVLD TKPRPEGLSA AQGAGLEALA RTVMTQLELR RGIAARKTEA AALADSEFRL RLAIEAAGAG IFDYDLVAGT LAWDGRTRAL FGVGPDEAVS YAGTFLARLH PEDRARTDAA VQAALDPAGP GLFDATYRTV TADGTVIAWV AARGTLVVET DEGIKRARRF VGTVRDVTAE RTAQVAVAAT EERYRLVTRA TNDAIWDWDL AADHVLWNEA LQAAYGWAPE TVEPTGRWWL DHIHAEDRPR AEAGIRRVIG GGGHDWHHEY RFCRADGAYA DVLDRGSMVR GADGRPLRMI GAMLDLTERN RVAAQLRAVV EGANIGIVQI DPRTMIALEA NPKLCAIWGA EESDIVGHSI AKWTPEADAA ERDQLHRRLA AGEIVRETLE KRYRRKDGRL IWGRVNLVSQ ARGEALQATA MIEDITAEKA TEARQTALIE LGDTLRDAAG PAEIRGIAAR ILRRSLDLSE AGYAAIDATV GGFAIGRARP DGTMSPAPFP AMLARLRRGE ILAVPDLTAE PDLAPDAGGY AAAGARALIG VPLTRRGVLV GLVYAHAAEP RTWDAGEVDF VREVAGRISV ALARIQAEEQ QRFLNRELSH RLKNTLTMAQ AIASQTLRNA TDIASVKEAL VARLVALGKA HDILLSGEGE GAALQAVIAG ALTIHDDGEP GRIHLSGPAL EVGPKAALSL ALMIHELATN AAKYGAFSVP GGRVGVNWHV ARARWPEDAE DAGEAEPVIT MTWAETGGPP VAAPTRKGFG SRLIERGFSG AVGGETQMIY AREGVTCRIR APLKGLLEKE
|
| |