Gene Mboo_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1248 
Symbol 
ID5412077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1269096 
End bp1272041 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content53% 
IMG OID640868476 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001404409 
Protein GI154150791 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTTA TTGATGAAAT GAAACTGGGA AAAGAGTTCA ACACATTGCT TGTCGATATA 
GAGCACATGT CGGACGCGCA CGACAAAGGG GATATTGATG TACTTATACC GGTCGAAAAG
TACCAGGGTT CGGCAAAGAC CGCGGCCCAG TGTATCAACA ACATGGTTAA CGGGCACATC
ACGGTAAAGA AGAAGGCTAT GGCAACTGTT GCCGAGTTTG CCAAGGGCAA CTTCGAAGCA
CCGCTGGAGA AATTCCCGGG CAAGAAGGCG TTTATCAACG ACAATATCGA ACTCCTGCGC
ACGAATCTCA AGGCCCTTGT CGCAGATGCG AACATGCTCA GCAAGGCCGC AGTCGAGGGC
AAACTTGATA CCCGTGCTGA TGCATCCAAA CACCAGGGAG ACTACCACAA GATTGTCCAG
GGCGTCAACG ATACACTTGA CTCGGTCATC GGCCCGCTCA ATGTAGCAGC GGAGTATGTT
GACCGGATCA GCAAGGGAGA TGTCCCGGCC AAGATCACCG ACAATTACAA CGGTGACTTC
AACGAGATCA AGAACAATCT CAACCAGTGT ATTGACGCAG TCAACCTCCT GGTAAAGGAT
GCAAACCTGC TCAGCAAGGC CGCAGTCGAG GGTAAGCTCG ATACCCGTGC TGATGCATCC
AAACATGAAG GCGACTTCAG GAAGATCGTT GAGGGCGTTA ACCAGACCCT TGACTCGGTC
ATCGGCCCGC TCAACGTTGC CGCAGAGTAT GTAGACCGGA TCAGCAAGGG AGATGTCCCG
GCCAAGATCA CCGACAATTA CAACGGTGAC TTCAACGAGA TCAAGAACAA TCTCAACCAG
TGTATTGACG CAGTCAACCT CCTTGTTGCA GATGCAAACA TGCTCAGCAG GGCAGCAGTT
GAAGGTAAAC TCGATACCCG TGCTGATGCA TCCAAACACC AGGGCGACTT TAGGAAGATT
GTCCAGGGTG TTGACGACTG TCTTGACGCA GTGATCGGCC CGCTCAACGT TGCCGCAGAG
TATGTAGACC GGATCAGCAA GGGAGATGTC CCGGCCAAGA TCACGGACAA CTACAACGGC
GACTTCAACG AGATCAAGAA CAACCTCAAC CAGTGTATCG ACGCAGTCAA CCTCCTGGTC
CAGGAAGCAA ATATGCTCAG CAAGGCAGCA ATTGAGGGCA AGCTCTCCAC CCGTGCTGAC
GCATCCAAAC ACCAGGGTGA CTTCAGGAAG ATCGTCCAGG GTGTTGATGA CTGCCTGGAT
GCCGTTATCG GTCCGCTCAA CGTAGCTGCC GATTATGTAG ACAAGATCTC CAAGGGCAAC
ATCCCGTCAA GGATTACTGA CAACTACAAC GGCGACTTCA ACGTTATCAA AAACAACCTC
AACCAGTGCA TTGACGCTGT CAATCTCCTG GTCCAGGACG CAAACATGCT CAGTAAGGCG
GCAGTTGAAG GCAAACTCGA TACCCGCGCT GATGCATCCC GACACCAGGG TGATTTCAGG
AAGATTGTCC AGGGTGTTGA TGACTGCCTT GACTCGGTCA TCGGTCCGCT CAATGTAGCT
GCTGATTATG TAGACAAGAT CTCCAAAGGC AACATCCCGG CAAGGATCAC TGACAACTAC
AACGGCGACT TCAACGTTAT CAAAAACAAC CTCAACCAGT GCATTGACGC TGTCAATCTC
CTGGTTAAAG ATGCAAACAT GCTCTCGGAA GCAGCGGTTG CAGGAAAACT CGGGACCCGT
GCCGATGCAT CCAAGCACCA GGGTGACTTT AAGAAGATCG TTGACGGTGT TAACGACACG
CTCGATGCAG TCATCGACCC AGTAAACGAG GCCATGCGCA TCTCAGATGA GTACGCAAAG
CAGAACTTTA CCGCCCGCGT AAACGAGGAC CTCCAGGTCC GCGGGGACTT TATCCGGTTC
AAGAAATCCC TCAATAACGT GGGTATCCAG GTTTCTTCGG CACTCAGCAA GGTCAATGAC
CGTGTCAACG AACTTTCAGC GAGCGCCCAG GAAGCCAGCG CAAGCGCAGA GGAGGTCTCT
GCCGGTTCAA ACCAGGTGGC AAAGAACGCC GGTGCAGTCA GCTCGAATGC AGACAAGAGC
GGGGAAGGAA TCAAGCAGGT ACTCAAGGCA ATGGAAGACC TCTCAAACAC GGTCCAGGAA
GTTGCCTCCC GTGCTGATGC AGTTGCCCAG CTTGCCAAAC AGTCAGAAGA ACTTTCCCGG
AAAGGTACCG ATCTCGCAAA TAAGGCAGAC CGGGGAATGG ACGGCATCAC CAAGTCCTCC
GCAGAGGTTA ACACGATTAT CCTTGATATC AAGTCCCAGA TGGACAAGAT CGGGGACATC
GTGGTCCTGA TTGCAAACCT GGCAAACCAG ACCAATCTCC TTGCCCTCAA CGCGGCAATC
GAAGCAGCCC GCGCCGGTGA AGCAGGCCGC GGATTTGCGG TCGTCGCCAC CGAGGTCAAG
TCGCTTGCCG AGGAGTCGGA GAACTCCGCA GAGAAGATCC GCCAGATGAT CGGTGAGCTC
CAGAAGCAGA CCCAGCGTGC AGTTGAGGCC GTGGAATCTG CAAATGATGG TGTCAGGGAA
GGCAGCGGCG CCCTTACCCA GACCCTCGAG GTATTCAACA AGATTGTCAC CTCGATCGAC
GACATCAACA AGAACATTTC AAGCGTAGCT GCATCCGCAG AAGAGCAGGC CGCATCAGTC
GAAGAAGTTA CTGCCAGTGT CACAACCGTA AGCACCTTGA TCAGTGAGAC CGCAAAAGAG
GCAACCGACG CAGCTGCAGC AACCGAAGAG GCGTCCGCAT CCATCGACCA GATCACCAAA
GTCATCCAGA ACGTGAACAC CATTGTCGAG GATGTCAGTC ACGAGATCGC AGGGTTTAAG
GTCGATGCAT CGGTACGGAT CAGCGCAGAC AGAGGCGCGG CATCGAACAC AGCGAGCGCC
CAGTAA
 
Protein sequence
MTFIDEMKLG KEFNTLLVDI EHMSDAHDKG DIDVLIPVEK YQGSAKTAAQ CINNMVNGHI 
TVKKKAMATV AEFAKGNFEA PLEKFPGKKA FINDNIELLR TNLKALVADA NMLSKAAVEG
KLDTRADASK HQGDYHKIVQ GVNDTLDSVI GPLNVAAEYV DRISKGDVPA KITDNYNGDF
NEIKNNLNQC IDAVNLLVKD ANLLSKAAVE GKLDTRADAS KHEGDFRKIV EGVNQTLDSV
IGPLNVAAEY VDRISKGDVP AKITDNYNGD FNEIKNNLNQ CIDAVNLLVA DANMLSRAAV
EGKLDTRADA SKHQGDFRKI VQGVDDCLDA VIGPLNVAAE YVDRISKGDV PAKITDNYNG
DFNEIKNNLN QCIDAVNLLV QEANMLSKAA IEGKLSTRAD ASKHQGDFRK IVQGVDDCLD
AVIGPLNVAA DYVDKISKGN IPSRITDNYN GDFNVIKNNL NQCIDAVNLL VQDANMLSKA
AVEGKLDTRA DASRHQGDFR KIVQGVDDCL DSVIGPLNVA ADYVDKISKG NIPARITDNY
NGDFNVIKNN LNQCIDAVNL LVKDANMLSE AAVAGKLGTR ADASKHQGDF KKIVDGVNDT
LDAVIDPVNE AMRISDEYAK QNFTARVNED LQVRGDFIRF KKSLNNVGIQ VSSALSKVND
RVNELSASAQ EASASAEEVS AGSNQVAKNA GAVSSNADKS GEGIKQVLKA MEDLSNTVQE
VASRADAVAQ LAKQSEELSR KGTDLANKAD RGMDGITKSS AEVNTIILDI KSQMDKIGDI
VVLIANLANQ TNLLALNAAI EAARAGEAGR GFAVVATEVK SLAEESENSA EKIRQMIGEL
QKQTQRAVEA VESANDGVRE GSGALTQTLE VFNKIVTSID DINKNISSVA ASAEEQAASV
EEVTASVTTV STLISETAKE ATDAAAATEE ASASIDQITK VIQNVNTIVE DVSHEIAGFK
VDASVRISAD RGAASNTASA Q