Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0994 |
Symbol | |
ID | 7271727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1024681 |
End bp | 1026603 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643569633 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002466068 |
Protein GI | 219851636 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.620121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0418978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCCTG GAATTGAGAT GTACCTGCTC AGATTATATG TGGCTGGTGA GACTCAGGAA TCTGCCCGGG CAATCCAGAA CCTTGTGCGT ATCTGTGAGA CATACCTGGT GGGACGGTAC GACCTGGAGG TGATCGATAT CTCCCGGCAT CCGAGTCTCT CACGGGATGA GCAGATCGTT GTGACCCCGA CGCTGATCAG GAAGGGGCCG GCTCCTGAGC GGCGACTGAT CGGCGACCTG TCAGACACCC GGGTGGTCCT TCGGGGTCTT GGTTTACCTG AGGGGGGGCT TCAAAACGAA CCGAACGATT CAGGATCTTC TGCAAACCCG CAGGTCATAA AACTCTGTGA ACGTTTGCGG GAGACTGGCG TCGATAGGAT GGATATCTGG ACCGGCCAGG TGGATGGAAT CGTCGTCCCG ACCTCTGATG AAGACAGGCT ATTCATAAAC CCCGGGGGGG CATCTCCGTA CTGGACCTTT GTCGAGACGA TGAATGAGGG GGCGGTGGTC ATCGATCGGG CCGGAACGAT TCTGTACTGC AACCGACGGT TTGCTGCGAT CATCGAGGCG CGTATGGAGA TGATCTCAGG TTCATTCTTC GATAGTTGGG TATGTTCTCA TGATCACCTC CTCTTTGAAG CACTCTCTGT GGCCGGTGCA GATCGGCAAT CTTCAGGGGA TCTGCAGATG GTCAATACCC GGGGGCGCCT GGTCCCGGTT CATCTCTCGC TCAGCCCGTT TACGGGCGGA GGGGTCTCTG AAATTTCTAT CGTGGTGACC GATCTGACCA CGCGGAAACA TAACTGTGCG TTGCTTCGAT CCGAACATCT GGCCCATTCG ATCCTCGAAC AGGCCGCCGA CCCGATCGTC GTGATCAACG CTGAAAAGGT GATCATCAGG GCCAATACCG CGGCAGTGGA GATGGCCGGG ACCAACCCCC TTCTTGCACA GTTTGATTCG GTCTTTCCAC TCTTCCAGGT GATTGGGGAT GAAGAAATAC CCTTCTCACC GTCCAGAATC TCCAGTGCCG GCAAGCAGTT ACAGGGGATG GAGGTTCTCT TTCGAAGAAC GGACAGGTCA CTCTTTTCCT TGCTGATCAG CGCGGCACCG ATAATCGGAG AATTCGGGGA ATCGCCCGGA TGCGTGGCTG TGATGACCGA TATCACGGCC CAGAAACAGG TCGAAGGGGA ACTTCGAAGG ACACTCGACG ACCTGGCCCA CTCCAATCAG GACCTGCAGC AGTTTGCGTA CATCGCCTCG CATGACCTGC AGGAACCACT CAGGATGGTG GCCAGTTACC TGCAGCTCCT CGAGCGAAAG TATCGGGACC GGCTCGATTC GGACGCCCAG GAGTTCATCG GGTTTGCGGT CGAGGGTGCG AACCGGATGC AGCAGCAGAT CAACGACCTG CTGGCGTACT CGCGGGTCAC GAGCCGGGGC CAGCCTCTCA AGCCGGTGAG CGCCGAAGAG GCATTGGCCT CTGCACTGAG TCACCTGGCC CTGAAGATCG AAGAGACTGG TGCCACGGTC ACCCATGACC CACTTCCGAT GGTCAGGGCC GATCTCCCGC AGCTGGTTCA GGTCTTTTCG AACCTTCTCG ACAACGCACT CACATTCCTC CGTCCCCAGG TCGCCCCGGT GATCCATCTC TCGGTGGAAG ATCAGGCTGG CTGGGTGGTC TTTTCTCTGC ACGACAATGG GATCGGGATC GACCCGGAGT TCTATCAGCG GATCTTTCAG ATGTTCCAAC GACTTCACTC TCGCGCGGAG TATCCCGGGA CCGGTATCGG GCTTGCGATC TGCCAGCGAA TTATCGAGCG GCACCATGGG CGGATCTGGG TCACTTCGGT TCCCGGCAGT GGATCGACCT TCTCCTTTAC GATCCCTGGT GTTAATGGTC ACCTTCATGG TCCTGATCGC TGA
|
Protein sequence | MAPGIEMYLL RLYVAGETQE SARAIQNLVR ICETYLVGRY DLEVIDISRH PSLSRDEQIV VTPTLIRKGP APERRLIGDL SDTRVVLRGL GLPEGGLQNE PNDSGSSANP QVIKLCERLR ETGVDRMDIW TGQVDGIVVP TSDEDRLFIN PGGASPYWTF VETMNEGAVV IDRAGTILYC NRRFAAIIEA RMEMISGSFF DSWVCSHDHL LFEALSVAGA DRQSSGDLQM VNTRGRLVPV HLSLSPFTGG GVSEISIVVT DLTTRKHNCA LLRSEHLAHS ILEQAADPIV VINAEKVIIR ANTAAVEMAG TNPLLAQFDS VFPLFQVIGD EEIPFSPSRI SSAGKQLQGM EVLFRRTDRS LFSLLISAAP IIGEFGESPG CVAVMTDITA QKQVEGELRR TLDDLAHSNQ DLQQFAYIAS HDLQEPLRMV ASYLQLLERK YRDRLDSDAQ EFIGFAVEGA NRMQQQINDL LAYSRVTSRG QPLKPVSAEE ALASALSHLA LKIEETGATV THDPLPMVRA DLPQLVQVFS NLLDNALTFL RPQVAPVIHL SVEDQAGWVV FSLHDNGIGI DPEFYQRIFQ MFQRLHSRAE YPGTGIGLAI CQRIIERHHG RIWVTSVPGS GSTFSFTIPG VNGHLHGPDR
|
| |