Gene Mpal_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0053 
Symbol 
ID7272222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp54621 
End bp57527 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content54% 
IMG OID643568710 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002465170 
Protein GI219850738 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTT CCTTCCTTCA GCACCTGCAA GATCGTCCAA ATCTACTTGC TGCACTGATC 
GCAGTCAGTA CCGGCATAAC TCTGATCCTG AACGAATACG AACTCATGAT CGGAACGACC
AGTGTCCTCC CCCACCTCTT TTATATTCCG ATCATCCTCA CCGCGTACTT TTTTCCCCGA
CGCGGAATCA TCTTCTCAGC GGTTATTTCA GCGATCTATT GTGGTATGAC CTATATCTCG
AATCCAATCA TCCCCGGGGA ACTGCTATTG GTCGGGGGAC GGGTGATCAT GTTCATCCTG
ATTGCGACCG TTGTCTCGTC CCTGACTGCC CGCCAGAGGG AGAGCGAGAC CCTCTTCCGG
GGTGTGGCTG AACGGAGTTC CGATATTATC CTGCTCACCG ACAGGGAAGG ACGGGCAACG
TATGTCTCCC CTTCAGCCAG AAAGATCCTG GGCTACGATC CTGCTGAGAT CATCGGGAAA
CTCTCCGGAG CTTTCATCCA CCCCGATGAT CTCGGCCAGT TAGAGAAGTC GTTCCCTGAT
CTCCTGGGAG GGGGGGTCAC CGAAGGGATC ACGGTCAGGT TCAGAAAAAA ATACGGAGGC
ACTGCATTCA TAGAATTTTT TGGTGCACCC ATCATAAACG ATGGAACGAT ATCAGGAGTC
CAGGTCATCG GGAGGGATAT CACAGAGAGA AAGCGGACAG AAGACGAACA AAAAATCCGC
GATGATCTCC TCAACGCCAT TCTGGAATCG ATGATCGCCG GCGTGGTGGT GATCGATCCA
GAGGACCATA CCATCGTCGA GGTGAACGCC GTTGCAGCGG CTATGATCGG GGCAAAAAAA
GAGGAGATCA TCGGTTCGGT CTGCACCAGT TACATCTGTC CTGCCCTGAA CGGAGGATGC
CCGATCACCG ATCTCCATCA GATCGTCGAC CGGTCAGAGA AGGTGCTGAT CCAGGCGGAT
GGTGAACGAC GCCCGGTCCT GAAATCGGTG GTACCGATCA TCCTTCACGA CCGCACCTAC
CTTCTTGAAA GTTTCATCGA TATATCTGAA CTCACCCTGA CGCAGAATGC CCTCAAGGAG
AGTGAGGAAA AATATCGAAC ACTCGCCGAT TACACCTACG ACTGGGAGTA CTGGATCGGT
CCTGATGAAT CGATCCTGTA TACGACTCCT TCCTGCGAAC GGATCACCGG CTACACCCCG
CAGGAATTCT ATACCACCAG GCGTCTGATC AATACCATCC TGCATCCTGA GGACCGGGAT
GCCCTTGAGC ACCACATGTC CCGTTTTTTT CCGCCCCATA CCCTGGAAAC AGTTGATTTC
CGGATAGTTC ATCGGGATGG GAGCATTCGC TGGATCGGGC ATGTCTGTCA GCCCCTCTAT
AACGCAAAGG GAGCGTTCAT CGGAAGACGT GCCAGCAATC GGGATATCAC CGAACAAAAA
CAGGCGGAAG AGGCCTTTCG TGAGACCAGC CGGCGTCTTG CCGAGATCAT CGATTTCCTT
CCTGAACCGA CGATGGTCAT CGACAGGACT GGTGTTGTTG TAGCATGGAA CCATGCTCTG
GAACTGTTGA GCGGGGTTTC GGCATCGGAT ATACTCGGCA AGGAGAGAGA ATCGTATACT
GCATGGATCT CCGATCACAC CGGTCCAATC CTCATCGATT ATGTCCTGCA ACAGGATCAT
GAAGAGATAA AAAAAGCGTA TCCCAATGTC CACTTCAAAG GAACTACGGT GATGACCGAG
ACAGAGATCT CCCGTATGGA TGGCGCCCGT TTTTCGCTCT GGGCCAGTGC GACGCCGCTG
ATCGATCAGA AGGGGGAGAT CACCGGTGCG ATCGAGTCGA TCCGGGATGT CACCGACCAG
AAGATGGTCC AGCGGGCGTT ACAGGAATCG AATGCATACC TGGACACGGT GATCAATACC
CTGGCGGACC CGCTCTTCAT CAAGGATCAC AAGCACCATT TTGTCAAAGT GAACACCAGT
TTCTGTCAGT TGACCGGACA CACCCGGGAG GAATTGCTGG GAAAGACCGA TTCTGATCTC
TTCAGAAGGG AGGAAGCGGA TGTCTTTTTG GAGAAAGATG ATGAGGTTCT CCGGACCAGT
CAGGGGAATG AAAACGAGGA GAAGATCACC GACTTACGGG GAAATACTCA TACGGTCATC
ACCAAAAAAG CCCTCAACAC CAATACCGCC GGCGATGAGT TCATCGTCGG GATCATCAGA
GACATCACCG AACGGAAACA GACTGAGCTG GCGCTGCAGG AAGCCCTCAA GAAACTCAAC
ATGCTCTCGT CCATCACCCG ACATGACATT CTCAATCAGA TCATCGGCCT GCGTGCCTAT
CTCGAACTCT CGATTGAACG TGTACAGGAT CCCGACCTCC AAGAGTACCT GAAGAAGGGG
GACGTGGCAG CGGATGCAAT ACGCGAACAG ATCGAATTCA CCCAGTATTA CGAGGATCTC
GGGGTCCAGA CACCCCAATG GTGTACCATC GCAGAGATCT TCCGATCTGC CGCGTCGCAG
CTCCCGCTCG GGGAGATCGG TGTCGAGGTG CAGGTCTCCA ATCTCCTCGT CTATGCAGAC
CCCCTGATCG AGAAGGTCTT CTACAATCTC ATCGAAAACT CCCTGCGCCA TGGGGGTCAT
GTGACCGTGA TTCGGCTCAC CGCAGACGAG ACCCCGGACG GTATCGTGAT CACCTACCAT
GACGACGGTG CCGGTATCGC CTTTGATGAC AAGGACAGGC TCTTCCAGAA GGGGTTTGGG
AAACATACCG GTCTCGGTCT CTTTCTGATC CGGGAGATCC TCTCGATCAC CGGGATCTCC
ATCACCGAGA CCGGCGAACC CGGTGCCGGC GTCCGGTTTG AGATCCATGT TCCCAGGGGG
ACGTACCGGT CTACTGACCC CACGTAA
 
Protein sequence
MDRSFLQHLQ DRPNLLAALI AVSTGITLIL NEYELMIGTT SVLPHLFYIP IILTAYFFPR 
RGIIFSAVIS AIYCGMTYIS NPIIPGELLL VGGRVIMFIL IATVVSSLTA RQRESETLFR
GVAERSSDII LLTDREGRAT YVSPSARKIL GYDPAEIIGK LSGAFIHPDD LGQLEKSFPD
LLGGGVTEGI TVRFRKKYGG TAFIEFFGAP IINDGTISGV QVIGRDITER KRTEDEQKIR
DDLLNAILES MIAGVVVIDP EDHTIVEVNA VAAAMIGAKK EEIIGSVCTS YICPALNGGC
PITDLHQIVD RSEKVLIQAD GERRPVLKSV VPIILHDRTY LLESFIDISE LTLTQNALKE
SEEKYRTLAD YTYDWEYWIG PDESILYTTP SCERITGYTP QEFYTTRRLI NTILHPEDRD
ALEHHMSRFF PPHTLETVDF RIVHRDGSIR WIGHVCQPLY NAKGAFIGRR ASNRDITEQK
QAEEAFRETS RRLAEIIDFL PEPTMVIDRT GVVVAWNHAL ELLSGVSASD ILGKERESYT
AWISDHTGPI LIDYVLQQDH EEIKKAYPNV HFKGTTVMTE TEISRMDGAR FSLWASATPL
IDQKGEITGA IESIRDVTDQ KMVQRALQES NAYLDTVINT LADPLFIKDH KHHFVKVNTS
FCQLTGHTRE ELLGKTDSDL FRREEADVFL EKDDEVLRTS QGNENEEKIT DLRGNTHTVI
TKKALNTNTA GDEFIVGIIR DITERKQTEL ALQEALKKLN MLSSITRHDI LNQIIGLRAY
LELSIERVQD PDLQEYLKKG DVAADAIREQ IEFTQYYEDL GVQTPQWCTI AEIFRSAASQ
LPLGEIGVEV QVSNLLVYAD PLIEKVFYNL IENSLRHGGH VTVIRLTADE TPDGIVITYH
DDGAGIAFDD KDRLFQKGFG KHTGLGLFLI REILSITGIS ITETGEPGAG VRFEIHVPRG
TYRSTDPT