Gene Mboo_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1933 
Symbol 
ID5411018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1998409 
End bp2000223 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content56% 
IMG OID640869172 
ProductDNA-directed RNA polymerase subunit B' 
Protein accessionYP_001405091 
Protein GI154151473 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT CACGCGTATT TGTTGATGGT GCCTTAATCG GCCTTGTCGA TGACCCAAAG 
GGCCTTGTGG AGAATATCCG TTCCATGCGC CGGCAGGGCG CCATATCCTC CGAGGTCAAT
GTCTCGTTCA AGGAATTCAA CGGCGATGTG ATCCTTCTCA CCGACCGCGG CCGCGCCCGC
CGACCCCTGA TCGTATTAAA GGACGGCAAG AGCCTGATCT CTGAAGACGA TATTAAGAAA
CTTGCCAAGC GTGAGATCGA TTTCTCCTCA TTCGTACAGC GGGGCCTCAT CGAGTTCGTG
GATGCCGAGG AAGAAGAAGA TCTCCTCATT GCCATGAATC CCGCCGATAT TACGCCGGCG
CACACCCACC TGGAGATTGA CCCCTCGCTT ATCCTTGGGA TCGGTGCGGC GCATGTCCCG
TTCCCGGAGC ACAATGCAAG TCCGCGTGTC ACAATGGGAG CAGGGATGGT CAAGCAGGCG
CTGGGGTTTG CCGCGGCAAA CATGAAGCTC CGGCCGGATA CCCGTGGCCA CATGCTTCAC
TATGTGCAGA AACCGCTCAC CCACACCCAG ACCTCCGAAT TGATTGGATC AGACGACCGG
CCTGCCGGGC AGAACTTTGT GGTTGCCATC CTCTCGTACG AAGGTTACAA CATTGAAGAT
GCGCTTATCT TCAACAAGGC CTCGATTGAC CGTGGCCTGG GCCGTTCGCA CTTCTTCAGG
ACCTATGAAG GTGAAGAACG CCGGTACCCC GGTGGCCAGA TTGACCGCAT CGAAGTCCCA
GATGAAGATG TGAGCGGTGC CCACGGTGCC GAATCCTATA AGAATCTGGA TGAGGATGGC
GTGATCAATC CCGAAACGAT CGTTGCCGAA AAGGATGTCC TGATCGGTAA GACTTCGCCG
CCCCGGTTCC TTGAAGAGCC CTCCGGAGAA CTTATCGCGG TGGAGAAGCG CCGCGACACC
TCGGTCACGA TGCGCAGCAA CGAGCACGGC ATCGTGGACA CGGTTATTAT CACTGAGGGT
GAGAACAGTT CACGCCTCGT GAAGGTGCGG ACCCGTGACC TCCGTGTCCC AGAGATCGGC
GACAAGTTTG CATCCCGACA CGGACAGAAA GGGGTTATCG GGCTTATTGT CAACCAGGAA
GACATGCCCT TTACCGAAAG TGGTCTCTCT CCCGATCTCG TGATCAATCC CCATGCAATC
CCGAGTCGTA TGACCATCGG GCACATGCTC GAGATGATCG GTGGTAAGGT CGGGTCACTC
GAAGGCCGAA GAATCAATGC CACCGCATTT GGAGGCGAGA GCGAGGCTGA CCTCCGTGCC
TCTCTCAGAA AACTCGGCTA CTCTCATACC GGCCGAGAAG TGATGTATGA CGGGTATACC
GGAAAATCTT TTAAGGCCGA TATCTACATC GGCGTAATTT ACTACCAGAA GTTGTACCAC
ATGGTCAGCT CCAAGATGCA CGCCCGGTCC CGCGGGCCGG TGCAGGTGCT TACCCGGCAG
CCCACCGAGG GCCGTGCCCG TGAAGGAGGT CTCCGGTTCG GTGAGATGGA GCGTGATGTC
ATGATCGGCC ACGGTGCCGC TATGGCATTA AAGGAGCGTC TGCTTGACGA ATCAGACAAG
GTGCAGGAGT ATGTCTGTGC CCACTGTGGA ATGGTGGCAA TGCTTGACCG CAAGCGCAAC
ATGACCCGCT GCCTGGCCTG CGGCAACGAG ACTGACATTT ACCCGGTCGA GATGAGCTAC
GCATTCAAGC TCCTGCTGGA TGAGATGAAG AGCATGGGAA TAGCACCCCG CCTGAGATTA
GAGGATGTAG TATAA
 
Protein sequence
MKKSRVFVDG ALIGLVDDPK GLVENIRSMR RQGAISSEVN VSFKEFNGDV ILLTDRGRAR 
RPLIVLKDGK SLISEDDIKK LAKREIDFSS FVQRGLIEFV DAEEEEDLLI AMNPADITPA
HTHLEIDPSL ILGIGAAHVP FPEHNASPRV TMGAGMVKQA LGFAAANMKL RPDTRGHMLH
YVQKPLTHTQ TSELIGSDDR PAGQNFVVAI LSYEGYNIED ALIFNKASID RGLGRSHFFR
TYEGEERRYP GGQIDRIEVP DEDVSGAHGA ESYKNLDEDG VINPETIVAE KDVLIGKTSP
PRFLEEPSGE LIAVEKRRDT SVTMRSNEHG IVDTVIITEG ENSSRLVKVR TRDLRVPEIG
DKFASRHGQK GVIGLIVNQE DMPFTESGLS PDLVINPHAI PSRMTIGHML EMIGGKVGSL
EGRRINATAF GGESEADLRA SLRKLGYSHT GREVMYDGYT GKSFKADIYI GVIYYQKLYH
MVSSKMHARS RGPVQVLTRQ PTEGRAREGG LRFGEMERDV MIGHGAAMAL KERLLDESDK
VQEYVCAHCG MVAMLDRKRN MTRCLACGNE TDIYPVEMSY AFKLLLDEMK SMGIAPRLRL
EDVV