Gene Mboo_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1107 
Symbol 
ID5411267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1104407 
End bp1105756 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content51% 
IMG OID640868333 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001404268 
Protein GI154150650 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.316445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.383649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGAC AACCATCTCT CATGGGGGAA CGAGAAGAAA GTTTTGTAGA CCTCTGGCTG 
TTTATTATCG TTGCCACAAC CATCATTGCA ATCCTCATCA ACATTCTTGC GCTCCATTAT
GGAACAGCGG CCGTGGCTGC AAATCTTCTC TATATCCCGA TTGTCCTTGC AGCGTACTGG
TACCCCCGCT GGGGAATCTC GTACGCAATC GGGGTCTCTG CCCTGTTTAT TGCCATTGTC
GCTTTCGTGA CCGGGGGTAC GGTTGCCCAG GTAGCGGCTT CCTTTGTTAC ATGCCTTGTG
GTTATCGGGG TAGCAGCGGT TGTTTCAAGC CTTGCCATTC ATATGCGCAA AAACGAGGTA
AAATACCGGG GTATCTTCAA CCATTCTGAG GCAGGTATCG GCCTTGTGAA TAACCCGGAT
CATAAAGTAA ATGAGGTAAA CCGACGCTTT GCAGATACAC TCGGGTACGA GCCCGCCGAA
ATCGAAGCAA GAACATTCGT TGACCTGTGG GCCGATGCGG CAGACCGGGA CCGGTTTTTC
CAGCGCCTTG CCAGCCAGGG TAATGTGGAG AACCTGGAAA CCCGGTTTGT GACAAAGGGC
GGCGCCACCC GGTGGATGCT GCTTTCTGCA GGAATGCTTC CGGATGACCA GTTTGTCTGT
ACGATCGTAG ATATTACCGC CCGTAAGCAG GCCGAGGAAT CGCTTATTAT CAAAGATCAT
GCGATCAGTT CCTCGCTGAA TGCGATTGCG ATTATGGATC TTGATTTTTC GATCACGTAC
GTGAACCATT CTCTGATTTC CATGATGGGC TCCCGCAGCG AAAGGGAGTT TGCCGGCACA
AATCTCTGGA AATGTATGGC ATCACCTCAG GAGATCGAAA AGATACGGGA CACACTCTCG
CACAAAGGGA GCTGGCTTGG CGAGATCCTG CTCAAAAAAA CGGATCAGAC GCAGTTTTAT
GTCATGCTTT GGATTAACCT GGTAAGGAAT GAGACCGGCA ACCCGGTCTG CATCATGGCC
TCGTTTATAG ATATCACCGA CCGCAAGCAG ATGGAATCCG TAAAACGGCA GGCCCTGGAG
CAGATTGAGA AGAATATCGA GCAGTTTGCC ATCCTTGGCG ATCACATCAG GAATCCGCTT
GCTGTCATTG TCGGTCTCTC CAGTCTTGCA CCCGGGGATG TATCGGATAA GATCATCCTG
CAGGCCCGTG AAATCGACCG GATCGTAACC CAGCTCGACA TGGGCTGGAT AGAATCGGAG
AAAGTGAGGG AATTTATCAA GCGGTATTAC ATGGTAGGTA TCCAGGATAT CAGCGATACC
GGTGGGGCCC GGGAAGGCCT GGTTCGCTGA
 
Protein sequence
MTGQPSLMGE REESFVDLWL FIIVATTIIA ILINILALHY GTAAVAANLL YIPIVLAAYW 
YPRWGISYAI GVSALFIAIV AFVTGGTVAQ VAASFVTCLV VIGVAAVVSS LAIHMRKNEV
KYRGIFNHSE AGIGLVNNPD HKVNEVNRRF ADTLGYEPAE IEARTFVDLW ADAADRDRFF
QRLASQGNVE NLETRFVTKG GATRWMLLSA GMLPDDQFVC TIVDITARKQ AEESLIIKDH
AISSSLNAIA IMDLDFSITY VNHSLISMMG SRSEREFAGT NLWKCMASPQ EIEKIRDTLS
HKGSWLGEIL LKKTDQTQFY VMLWINLVRN ETGNPVCIMA SFIDITDRKQ MESVKRQALE
QIEKNIEQFA ILGDHIRNPL AVIVGLSSLA PGDVSDKIIL QAREIDRIVT QLDMGWIESE
KVREFIKRYY MVGIQDISDT GGAREGLVR