Gene Mboo_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1806 
Symbol 
ID5411985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1883384 
End bp1885429 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content53% 
IMG OID640869041 
Productsignal transduction histidine kinase 
Protein accessionYP_001404965 
Protein GI154151347 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.286721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAA TCAGGAAGTC CCGGTTTGTC ATCATCCTGC TGGCGGCCCT CCTGTCCATT 
GCCAGTATCC TTATTACGAT CTTTTCTCTC AGGAACGGCC TTGTTGATGT ATACCCCTAC
CTGTACCTCG TCCCTGTTAT CCTTGTTGCC TATGCGTGGC CGAGGATGGG AATTTATTTT
ACCATTGTTC TTGGGTGGCT GTATCTGGGC ACCGTGTACA TGTATGGCGC GCTCGACCTG
CACCTTCTTG CTTCCAGTAG TATCTGGTTC TATATCTTTG TCTCGCTGGG TGTCCTGATC
TCTGCGTACA CAAACGAGCT GGCAAAGCAG CGTACATTCA GGGACATCTT CATGAGTTCC
CAGTCCGGGA TCTTCACTTT TGATCTTGAG ACTCTGAGAA TCCAGCAGCA CAACATCCGG
CTCAATAATC TCCTGGGGTA CGATGAGCGG GAGCTGGACG ATAAAAAACT CTCTGATATA
TGGGAATCCC CCGGGGAGAT GAATGCACTC ATCTCCCGGA TCCGGTCGGG CAGCGCGGTA
AATGATGCAG AAACCGTGTT TGCAAAAAAG GATGGCAGCC GGGTCTGGTG CCAGATTACG
ATGTCCTGTA GTACTGAGAA CCTTGTTACC ACTTCGGTTG CAGATATCAG TGAGCGCAAA
AAGGTGTGTG ACCAGCTCCG TGAAACCGAG CTTCAGTACC AGATGCTCTT TGACCGGGCC
GGCGATGCCA TTGTCATCCA TGACAGGAGC GGCCTGATCC TGGCAGCAAA CCGGATTGCC
TGCCAGCGCT CGGGCTATTC CGCAGAAGAG CTTAAAACGA TGCACTTTAC GGATCTCGAT
CTTGCGTCAT GCGCCGAAAA AACGCGTATA TCCCTGCGGG AACTTGAACA CAAAGGTTTC
CTGATTTACG AAACAATTCA CCACTACCGG TCCGGAACGA AAATCCCTAC TGAGATCAGC
AGCAGCCTTA TCGAATTCCG AGGCCGCCCT GCGGTTATCA GTATTATCCG GGATATCTCG
GAGCGAAAAC GGGCCGAAGC CCAGATCCGC GAACGCGAAC AGCGCTTCCG GAAGACCGGC
GAGCTCATTC CCTATGGCGT CTGGATTGCC GATGCAGCCG GCCAGTTTAC CTACTGCTCG
GATTCCTTCC TGGCACTTCT TGACATGAAA CTTGAGGAAT GCGCACACTT TGGCTGGATG
AAACGTCTCT CGCCGGAAGA TGCGGAGCGC ACGAAAAACG ACTGGATTGA GACTGTGCGG
ACCGGCGGGT TCTGGGACTA CGAGTACCGC CTCTTTGATA AGGCCGGGGG AGAGCACTTC
GTTTTGAGCC GCGGTTCGCC TTTACGGGAT GGTACAGGAA CGATCCTCTC CTTTGTGGGC
ATCCACCTGG ATATCACGGA CAGGCGACGG TACGAGAACC AGCTCGAAGA ATCTCTCCGG
GAAAAAGAGG TCATTATCAA GGAAGTCCAT CACCGGGTCA AGAATAACAT GCAGGTGATC
TCGGGGTTTT TGCAACTCCA GTCCAATTAT ATCAGTGACC CGGAATCGGC CGAGAAGCTC
AACGAGTGCC AGCGCCGCGT GCGCTCCATG GCACTGGTTC ACGAGAAACT TTACCAGTCG
CGGCACCTCG GGTTCATCAA TGTTGCAGAG TATATCAAGT CCCTGGTGTC AGAACTTCAG
GAAGCTTACG TGGTCCAGGC CGATATCCGG TTTGAGGTGA ATGTCGAGAA TGTCAATATC
AACCTGGATA CTGCAATTCC CTGCGGCCTT ATCATCAACG AGCTGCTGAC CAATTCCCTG
AAGTATGCCT TCAAGGGGAG AGCTTCAGGG ACTGTGAAGA TCTCCCTGAA CCTTTCCCCG
GACCACCGGT TCACTCTTAA GGCAGGTGAT GATGGGACCG GTCTCCCGGC GAATTTCGAT
CTCCACAGCA CAGCCACGCT CGGGATGCAG CTGGTCCAGG TACTGGTGCG CCAGCTCGGC
GGGGAGATTG CCATCGTCTC CCAACAAGGG ACAAGTTTTG TGATCACATT CCCGGAAAAA
TTCTAA
 
Protein sequence
MPEIRKSRFV IILLAALLSI ASILITIFSL RNGLVDVYPY LYLVPVILVA YAWPRMGIYF 
TIVLGWLYLG TVYMYGALDL HLLASSSIWF YIFVSLGVLI SAYTNELAKQ RTFRDIFMSS
QSGIFTFDLE TLRIQQHNIR LNNLLGYDER ELDDKKLSDI WESPGEMNAL ISRIRSGSAV
NDAETVFAKK DGSRVWCQIT MSCSTENLVT TSVADISERK KVCDQLRETE LQYQMLFDRA
GDAIVIHDRS GLILAANRIA CQRSGYSAEE LKTMHFTDLD LASCAEKTRI SLRELEHKGF
LIYETIHHYR SGTKIPTEIS SSLIEFRGRP AVISIIRDIS ERKRAEAQIR EREQRFRKTG
ELIPYGVWIA DAAGQFTYCS DSFLALLDMK LEECAHFGWM KRLSPEDAER TKNDWIETVR
TGGFWDYEYR LFDKAGGEHF VLSRGSPLRD GTGTILSFVG IHLDITDRRR YENQLEESLR
EKEVIIKEVH HRVKNNMQVI SGFLQLQSNY ISDPESAEKL NECQRRVRSM ALVHEKLYQS
RHLGFINVAE YIKSLVSELQ EAYVVQADIR FEVNVENVNI NLDTAIPCGL IINELLTNSL
KYAFKGRASG TVKISLNLSP DHRFTLKAGD DGTGLPANFD LHSTATLGMQ LVQVLVRQLG
GEIAIVSQQG TSFVITFPEK F