Gene Mboo_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1119 
Symbol 
ID5411363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1119490 
End bp1121280 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content54% 
IMG OID640868345 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001404280 
Protein GI154150662 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGAC AGGTTACCCT CCGGGGATAT TCATTCTCAC ATACCGATCT GGTCCGTTCC 
ATCGTCATTG CCTCGCTGAC TGTTTCTGTT ATCCTGATAA CCGCACTGGC TCTTGCCAAA
AATGCCGGGG ACCTCTATCC CCAGCTCTTT TATTTTCCCA TCCTGTATGC AACGTATTTC
TATCCCAAGC GCGGGATCAT CCTTTCCGGC CTTTGCGGGG TCATCTACGA ATGCCTGGTC
TACTTCTCGC TCTACCCGGA TGTTCTTGCA CTCTGGTCGG CTACCGCCCA GGCCATCCTT
TTTATCTGCA TCGCGCTTGC CGTGGCTTAC TTCACCAACC TGATCCGGGT ATCCGAGGCC
CGGTACCGGA GCATCTTTGA AAATTCCCTT CTCGGGATCA TCCTTTTTGA CAAGAACCGG
TTTACGATCC GGCTTGCAAA CCAGCAGGTG GCAACCATGC TCGGATATGA GGCAGAGGAA
CTGGGCGGGA TTGCATTCTC CGATCTCTTC TTCTCGCAGG ACTTCAAACG CAGGTTCTTT
GAGCACCTTG GATCCGGTGA GGATATCAGG AATTTCGAGA CCTGTTTTGT CACAAAGGAC
AAAAGGCCAC ACTGGGTCAA CCTTTCCTGG AGCCGGATCG ATGACACCAT CGTGAGCTGC
TCAATCACCG ATATTGACGC GGAGAAATCT GCACGGGAAC TTGCGGCAGA CAGTTCGATC
CAGTACCGCC AGGTGACCGA AAACTCGCCC ACAGGTATGG TGATCACGGA TCGCACCACA
ATCCTCTTTG CAAACCCGGC GTTTTTTTCC TTCTCCGGCT ATGGGCAGGA AGAATGCTCC
GGGATGAATC TTGCCGATCT GGTCATTCCT GAAGACAAAG ACCGGTTCCG GTCGTTTTCT
GATCGCTGGG GGTTTCTGGA GCCTGCCCCT GATCGTGATG AATTCCGGTT TCTCACCAAA
AACGGGGAGA CAAGGAGGGC TGTGCTCTAC TTTACCCCGA TTATCCGGAA TAACCGTCCT
GCGGGACTCG TCAATATCAT TGACAATACC GAATGGGAGG AATACCGCGA ACGGGTGGAG
CAGACCAAGG AGCGGAGGCG CGAGATGATG CGGGCGGTTG CCCACGAGCT GCGGACCCCA
CTCCAGCCGG TGCTCGGGTA CCTTAACCTC CTTCTCCAGG ATCCTCCCGC TTTTGGGGTG
ACCGAGGAGA CCCGGCAGAT CCTTGAACGG TGTGCAAAGA GCGTGGACCG CGAGCGCCAG
ATCATCAACC AGATGCTTGA ACTCTCGGTT CTCGAAGAAG AGGAATCCAG CCTTGACTAC
TCGGTCTTTC CCGTTGCCGG TATGATAAAC AACGTAATCT CAGGAGGCGG GTATGCACTC
AAGGCAGAGA TCGCAGTTGA TGTCCCGGCA GATCTTCTTT TTGATGCAGA TCGCCAGAAA
CTCAGCTACG TTATCGATGT GCTGGTGGCA AACGGGGTGG CATATTCCAA GCCGCCACGG
AAGATCTGGA TTACGTACCG CGACTCGCCA TCGCACCCTT TCCACCGGCT TGCTATCCAG
GATAACGGTG TCGGGATCAC TGAAGCCCAG CTTGATGAGA TCTTCAAATC CGACGGAGGA
ACCGGGCCGG CACGTGAAGG TGTCGGCGGT ACTGGCCTTT CACTTGCCAT TGCAAAAAAG
TATGTCCAGC TGCATGGGGG ATATATCAGC GTGGACAGTA TGGTAAACAT CGGGAGCACC
TTTACCCTCC ATATCCCCAA AAAACGACCT GACGGGACGG AATTACCATG A
 
Protein sequence
MERQVTLRGY SFSHTDLVRS IVIASLTVSV ILITALALAK NAGDLYPQLF YFPILYATYF 
YPKRGIILSG LCGVIYECLV YFSLYPDVLA LWSATAQAIL FICIALAVAY FTNLIRVSEA
RYRSIFENSL LGIILFDKNR FTIRLANQQV ATMLGYEAEE LGGIAFSDLF FSQDFKRRFF
EHLGSGEDIR NFETCFVTKD KRPHWVNLSW SRIDDTIVSC SITDIDAEKS ARELAADSSI
QYRQVTENSP TGMVITDRTT ILFANPAFFS FSGYGQEECS GMNLADLVIP EDKDRFRSFS
DRWGFLEPAP DRDEFRFLTK NGETRRAVLY FTPIIRNNRP AGLVNIIDNT EWEEYRERVE
QTKERRREMM RAVAHELRTP LQPVLGYLNL LLQDPPAFGV TEETRQILER CAKSVDRERQ
IINQMLELSV LEEEESSLDY SVFPVAGMIN NVISGGGYAL KAEIAVDVPA DLLFDADRQK
LSYVIDVLVA NGVAYSKPPR KIWITYRDSP SHPFHRLAIQ DNGVGITEAQ LDEIFKSDGG
TGPAREGVGG TGLSLAIAKK YVQLHGGYIS VDSMVNIGST FTLHIPKKRP DGTELP