Gene Mboo_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1112 
Symbol 
ID5411272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1111215 
End bp1112696 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content53% 
IMG OID640868338 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001404273 
Protein GI154150655 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.607083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTA CTCTTTCGTC GGAAAAGAAG CACCAGATCT TCTGGACGAT CACCCTCATT 
ATTACAATCT TCGTTACCAT CTTATCGACG ATTTACTCCC TCTCGCTTGG GATATACGAT
GTATTCCCCT TCATCTACTT CCTGCCGATC ATCCTCTTTG TCTACATCTA CCCGAGCAGG
GGCGTATATT TCTCCCTGGG GTTAAGCTCT GCCTATATTA TCCTTGTTTA CCTGTACAGC
GGATTTAATC CCCAGGATGT GGCGATCTCC ACTGCATGGT TTGTGGTCTT TGTCACCATT
GGCGTTGTCA CCTCCTCTTT TGCCGAGGGA TTGCGCGAAG AAGAACGGAA ATACCGGGGA
ATCTTCGAGA ACTCCCAGGC AGGAATCTTT ACATTCGATC TCAGTACCCT GAGGATCGCG
GAGCTGAACG CAAAATGCGC ACAGATGCTC CGGTATGAGA AAGGAGAGCT GGCCGGGACT
GACCTGGCCC GCATCATACC TGATGCAGGA AACCGCGACC ATTTCCTCCG GCAGGTTCGA
TCTTCATGGG AAACCGGGGA TCTCGAACTG CTCTTTACCG CGCGTGACGG GACAATCCGC
CAGATGCTTG TCTCTGCCTC GGTCGCATCC GGTAACATCG TGATCTGCTC TGCAATCGAC
ATTACCGCAA GAAAACTTGC CGAGCAGGTG ATCGAACGGG CACGGGAAGA TCTTGAACGC
CGGGTGAATG AACGTACTGA AGAGCTCCTG CGCGTTAACC GGGAACTTTC CGCGGAGATC
GAGGAAAGAA AACGCTTTGA GGCTGCGATC AGGCTTGCCA ACCACAAGAT TAACACCCTC
TCCGGGGTCA CCCGCCATGA TATCCTCAAC CAGATCACAG TGATTGTTAT GTACCTCGCG
CTCATACGGG AGACCGAGAC CGATCCCCAC GTGACCGGGT ATATCGACAA GATTGGCGAG
GTTACAGATA TGATCCAGAA ACAGATCCGA TTCACCCGAG AGTACCAGGA TATCGGATCC
GGGGAACCAC GCTGGCACAA TATCAATGAG GTAATCGGGG AAGCCGCATC GGGAGTGGGA
AATGATGATG TAATCATCGA CCGCCAGGTA GCTGACCTGG AGATCTTTGC CGATGCCGGC
TTTCCCAAAG TATTCGCAAA CCTGATCGAG AACGCGCTGG TCCACGGCAG GCATGTAACC
CTTATCCGGT TCTCGTACTA TGAGACAGAT ACCGGGCTGG TACTCTGCTG CGAGGACAAC
GGGATCGGTA TTCCCGATGA TGCAAAGGCA CGGATCTTCC GGCGCGAATA CTTCCGTAAC
ACCGGGTACG GGCTGTTTTT AATTGTCGAG ATCCTCAGCA TTACCGGGCT CTCCATAAAA
GAGACGGGTA CTCCCGGAAC TGGTGCCCGG TTCGAGATGC ATGTCCCGAA AGGGAATTAC
CGGTTTGTTG CAAGAAACCA CCGGAGCGAC CGCTGGCAGT AA
 
Protein sequence
MSVTLSSEKK HQIFWTITLI ITIFVTILST IYSLSLGIYD VFPFIYFLPI ILFVYIYPSR 
GVYFSLGLSS AYIILVYLYS GFNPQDVAIS TAWFVVFVTI GVVTSSFAEG LREEERKYRG
IFENSQAGIF TFDLSTLRIA ELNAKCAQML RYEKGELAGT DLARIIPDAG NRDHFLRQVR
SSWETGDLEL LFTARDGTIR QMLVSASVAS GNIVICSAID ITARKLAEQV IERAREDLER
RVNERTEELL RVNRELSAEI EERKRFEAAI RLANHKINTL SGVTRHDILN QITVIVMYLA
LIRETETDPH VTGYIDKIGE VTDMIQKQIR FTREYQDIGS GEPRWHNINE VIGEAASGVG
NDDVIIDRQV ADLEIFADAG FPKVFANLIE NALVHGRHVT LIRFSYYETD TGLVLCCEDN
GIGIPDDAKA RIFRREYFRN TGYGLFLIVE ILSITGLSIK ETGTPGTGAR FEMHVPKGNY
RFVARNHRSD RWQ