Gene Mboo_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1104 
Symbol 
ID5411264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1099118 
End bp1100827 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content55% 
IMG OID640868330 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001404265 
Protein GI154150647 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.53532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.324425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCC AGTATTCCCC GTTTTTTTTC CCCCTTCTCC TGTCCGGAGG GCTGACCGGT 
ATTATAGCCT ATATCAGCAT ACGGAACCGT TCCAACCCGG TGGCCCTGCC GTTTGCAGTC
CTTATGGGTG CAGTCACGCT CTGGACAACC GCATACGCAG TGCAGCTGAT CACAGCCGAT
CTGGCCACCA CCCTGTTCTT CACGGATCTG GAGTATATCG GGATTGTAAC CGTACCTGTG
GCCTGGCTGC TGGTTGTCCT GTGTTATACC GGTCGTCCCC GGTATGTCAC GGCCCGGAAT
GTCGGCCTGC TCATGATAGT GCCTGCCATT GTAGTCCTGC TGGTGGCAAC AGATCCCCTT
CACCATCTCT ATTATTCCCT GATATACCCC GGGCAGGTTG CGGGCTCTGT TCTCTGGATC
TTTACAAGGG GGCCGCTTTT CTGGATAAAT GTCGGCTATA ATTACATTCT GCTGATAGTC
TCCCTTATCC TCCTTGCTTC GCGCTTTTCC GGCGCTCCTG CAGTATACCG GAGGCAGATC
CTGATCCTCG GGATAGCCGT GATCGTGCCG GTACTGGCAA ACTTTGTCTA CCTCTCGCCC
ATTGACCCGG TCCCAGGCCT GGATCTGACC CCCTTTACCT TCACAGCGGT AGGAATCATT
CTGGCGGTTG GTCTCTTTCG CTACCATCTC TTCCTTACCC TGCCGGTTGC GTATCCGCAG
GTCTTCTCTG CTATAAGTGA TGGCATTATT GTTGCCGACA TCAGCAACCG GATCCTTGAC
CTCAACCCCG CAGCCCAAAC GATCGCCCAG GTACCGGGCG AACTTATTGG CAAGGTCATT
ACCTCCCCCT TTCCCCAGTT ATCCGTGTTT GTTGCCAATG ACGGCTGCAT AGCAGAAAGC
CATCAGGAGA TCGTAATCCC AAATGAAAGC AGCTCCCGGT GGTACGATGT CACGTGCCGG
CAGCTTCGTA TCTCCGGCCA GTCACCGACC GGACATCTCT TTATACTCCG CGACATCACC
GACCGCCATA TGGCTCTCGA TGCTCTTGCT TCCGCTCACC GGAAACTCAA CCTCCTTTCT
ACGGTTACCC GGCACGATAT GATGAACAAG CTTACCGGTC TCATGGTATA CCTTGACCTG
ATCAGGAGTA CCCATGATCC GGCCGTACGG GACAGGTACC TCCGGCAGGT AGACGAAATT
GCACGGATGA TCCGGGATGA GGTTGCCTTT ACCCGGGATT ACCAGGAGAT GGGGGTGAAA
TCACCTGCCT GGCAGGATCT CTCTGCCTGC ATAGCCTCGG CAAAGAGTCA GGTTGATCTG
GGGAAAATAC GGGTAACAGA GGACTGCAGG GGAACCGAGC TGTTTGCGGA TCCACTGCTG
GTCAAAGTGA TCATCAATCT CCTGGAAAAC GCCGTCCGGC ATGGAGGCAG CCGGTTGAGC
ATGGTCCGGT TCTCCTGCCG GCATGAAGGA GATTCCCTGG TTATCGTCTG CGAAGATGAT
GGTACCGGAA TCGGGGATAC AGACAAGGGG CGGCTCTTTG CCCGGGGTTT TGGAAAAAAC
ACCGGACTTG GGCTCTTCCT CTCACATGAG ATTCTCGCTG GCACCGGTCT TTTTATTCGC
GAAAACGGTA TCCCGGGGCA GGGGGCACGC TTCGAGATCA CTGCTCCGTC GGGATCTTTC
CGGGTTACAG GGGATACCTT TCTCAGGTAG
 
Protein sequence
MILQYSPFFF PLLLSGGLTG IIAYISIRNR SNPVALPFAV LMGAVTLWTT AYAVQLITAD 
LATTLFFTDL EYIGIVTVPV AWLLVVLCYT GRPRYVTARN VGLLMIVPAI VVLLVATDPL
HHLYYSLIYP GQVAGSVLWI FTRGPLFWIN VGYNYILLIV SLILLASRFS GAPAVYRRQI
LILGIAVIVP VLANFVYLSP IDPVPGLDLT PFTFTAVGII LAVGLFRYHL FLTLPVAYPQ
VFSAISDGII VADISNRILD LNPAAQTIAQ VPGELIGKVI TSPFPQLSVF VANDGCIAES
HQEIVIPNES SSRWYDVTCR QLRISGQSPT GHLFILRDIT DRHMALDALA SAHRKLNLLS
TVTRHDMMNK LTGLMVYLDL IRSTHDPAVR DRYLRQVDEI ARMIRDEVAF TRDYQEMGVK
SPAWQDLSAC IASAKSQVDL GKIRVTEDCR GTELFADPLL VKVIINLLEN AVRHGGSRLS
MVRFSCRHEG DSLVIVCEDD GTGIGDTDKG RLFARGFGKN TGLGLFLSHE ILAGTGLFIR
ENGIPGQGAR FEITAPSGSF RVTGDTFLR