Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4875 |
Symbol | |
ID | 5833374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5447189 |
End bp | 5449087 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370673 |
Product | PAS sensor protein |
Protein accession | YP_001642314 |
Protein GI | 163854271 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3300] MHYT domain (predicted integral membrane sensor domain) [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.152822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTCCA CCGGCTACAA TCCGGCTCTC GTGGCGCTTT CCCTCGCCAT CGCCGTGCTC GCCTCCTACA CGGCGCTCGA TCTCGGCGGC CGCGTGCGGG CCGCCGCATC CGGACTGGGC TGGGCTTGGC TTCTGGGTGC CGCCCTGGCG ATGGGGGGCG GCATCTGGTC GATGCACTTC GTCGGGATGC TGGCCTTCGA GATGGGCCTG CCCGCCGCCT ACGACCTCGG CCACACCCTC CTGTCCCTGC TCATCGCCAT TGGGACGACC GGCGCGGCCC TGGCCTGGGT CGGCCGTCGG CGGGCCGGCC GCTGGGATGT CCTCGTCGCT GGCCCCGTGA TGGGGATCGG CGTGGCGGCG ATGCACTATA CCGGGATGGC CGCGATGCGG ATGCCCGGCC ATCTCGCCTA CAGCGTGCCG GTCGTCGCGC TCTCGGTCGG GATCGCCATC ACCGCGGCGA CCATCGCCCT GTGGCTGACC TTTCGGCCGA CGAGCGTCTG GCAGCGCTTC GCCGCCGCCC TGATCATGGG CGTGGCCGTG GCCGGGATGC ACTATACCGG CATGGCGGCC GCGACGATCA CCGCCGAGGA AGCCGGCGCC CATTCCGCTC ATCTCGCCGC GACGAGCGTG GATCAGCAGA ATCTCGCCCT CTACGTCGCC GGGGTGACCT TCGTGATCCT GTTCCTCGCG ATGCTCGCCT CCGCTTTCGA TCAGCAGAGG ATCCAGGGCG ATTTGCGGGC GAGCGAGGAG CGTTTCCGGG CCGCCGTACA GGCCGTGCGC GGCGTGCTGT GGACCAACGA CCCGAAGGGG CGGATGACGG GCGAACAACC CGGCTGGACC GCCCTCACGG GGCAAACCCG CGCCGAGTAC GAGGGCTTCG GCTGGGCCGA CGCCGTGCAT CCGGAGGACC GGCAAGCGAG CGTCGAGGCT TGGAATGCGA CTGTCGCCGC CCGCAGCACC TTTCTGCACG AGCATCGCGT GCGCGCCCGG GACGGCCTGT GGCGACATTT CTCGATCCGC GCAATCCCCG TGCTCGACCC GCACGGCGCC ATCCGCGAAT GGGTCGGCGT CCATACCGAC ATCACGGAGC AGCGCGAGGC CGAGGCCGAA TTGCGGGAGT CGAACGACGA GATCCAGCGC TACGCCTACA TCGTCAGCCA CGACCTGCGC GCGCCGCTCG TCAACGTCAT GGGTTTCACG AGCGAGTTGG AGGCGGTGCG CCAGGAATTG CGTACGGTGC TGCGTGACCA TCCGCAGGGC GCGCGGATCG ATGACGACAT GACCGAGGCG CTGAGCTTCA TCCAGGCGGC GATCGTCAAG ATGGAGCGGT TGATCGCAGC CATCCTGAAG CTCTCGCGAG AGGGGCGGCG CCGGTTCAGC CCGGAGCCGC TCGCGATGAC GCCCGTCATC CGCGGCATCG CCGACGCGCA GCGTCATCAG GCCGGGCGCA AGGGTGTGAC GGTGACGGTG GCCGACGATC TTCCCTCGAT CGTCGCCGAC CGGCTCGCCG TGGAGCAGGT CTTCGGCAAC CTGATCGACA ACGCTCTCAA ATATCTCGAT CCGGCCCGTC CCGGAACGAT CGAGGTCACG GCCCGGCCTG CACCCGGCAA CCGGATCCGT TTTGACGTCT CCGATACCGG GCGAGGTATC GCACCGCAGG ATCATGGCCG CATCTTCGAG CTGTTCCGGC GCTCCGGCAC GCAGGATCAG CCGGGGGAGG GGATCGGGCT CGCCAGCGTG AAAGCTTTGG TGCGCGCGCT CGGCGGCCGG ATCGAGGTTT CGTCACAACC CGGTGTCGGC ACAACCTTTA TCGTGACGCT GCCGCGCGAG CCGGTTACAG GCCGAGGCGG GGCCGCGACA CTCGATCCTC CCGATACGAT GACCCTGGCC GCGGAGTAG
|
Protein sequence | MVSTGYNPAL VALSLAIAVL ASYTALDLGG RVRAAASGLG WAWLLGAALA MGGGIWSMHF VGMLAFEMGL PAAYDLGHTL LSLLIAIGTT GAALAWVGRR RAGRWDVLVA GPVMGIGVAA MHYTGMAAMR MPGHLAYSVP VVALSVGIAI TAATIALWLT FRPTSVWQRF AAALIMGVAV AGMHYTGMAA ATITAEEAGA HSAHLAATSV DQQNLALYVA GVTFVILFLA MLASAFDQQR IQGDLRASEE RFRAAVQAVR GVLWTNDPKG RMTGEQPGWT ALTGQTRAEY EGFGWADAVH PEDRQASVEA WNATVAARST FLHEHRVRAR DGLWRHFSIR AIPVLDPHGA IREWVGVHTD ITEQREAEAE LRESNDEIQR YAYIVSHDLR APLVNVMGFT SELEAVRQEL RTVLRDHPQG ARIDDDMTEA LSFIQAAIVK MERLIAAILK LSREGRRRFS PEPLAMTPVI RGIADAQRHQ AGRKGVTVTV ADDLPSIVAD RLAVEQVFGN LIDNALKYLD PARPGTIEVT ARPAPGNRIR FDVSDTGRGI APQDHGRIFE LFRRSGTQDQ PGEGIGLASV KALVRALGGR IEVSSQPGVG TTFIVTLPRE PVTGRGGAAT LDPPDTMTLA AE
|
| |