Gene Mlab_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1162 
Symbol 
ID4794938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1181994 
End bp1183907 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content54% 
IMG OID640099834 
ProductDNA-directed RNA polymerase subunit B' 
Protein accessionYP_001030598 
Protein GI124485982 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.595392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA AAATGGCACG TGTATTCGTG GACGGTGCTT TGATTGGCAA AGTCGACGAT 
GCCTACGGCT TCACCAGAAA CTTCAGACAG ATGCGCAGAT CGGGGCACGT CTCGACCGAA
GTCAATATTT CATACAAAGA TTACAGTAAC GAGATCGTCA TCAATACCGA CCGCGGACGG
GCCCGCAGAC CCCTGATCGT CGTCGAAAAC GGCGTCCCGG CAGTCACCCC CGAAGATATC
GAGATCGTCA GATCCGGTGC TGAAGACTTT ATGGCACTGG TATCTCAGGG TAAGATCGAG
TTCATCGATG CGGAAGAGGA GGATGACCTC TTTATCGCGG TGCAGGAATC CGATCTGACT
CCAGAACACA CCCACCTTGA GATCGATCCG TCCCTTATCC TCGGTATAGG AGCAGCACAC
GTTCCGTTCC CGGAACACAA CGCATCGCCG CGTGTTACGA TGGGTGCGGG TATGATCAAA
CAGGCACTGG GTTTTGCCCA GGCAAACATG AAGCTTAGGC CCGATACCCG CGGTCATATG
CTTCACTACG CCCAGGTCCC GATGGTGCAC ACGCAGGCAG CCGAACTTAT CGGTTCCGAC
AACCGCCCGC AGGGTCAGAA CTTCTGTGTG GCCATCATCT CCTACGAAGG ATACAATATT
GAAGATGCAC TCATCTTCAA CAAAGCCTCG GTCGAACGCG GAGTAGGCCG TTCCCACTTC
TTCAGAACCT ATGAAGGAGA GGAACGCAGA TATCCCGGAG GACAGGTCGA CAGGATCGAG
CTGCCCGAAG AGGATGTCCA GGGTTCCCAT GGAATCGGTT CCTACGCAAA CCTCGATGTT
GATGGAATCA TCAACCCCGA GACCGTCGTC AATGAAAAGG ATGTTCTCAT CGGCAAGACC
TCTCCGCCGC GTTTCCTCGA GGAACCGACC GGAGAACTCA TCGCCGTTGA AAAACGCCGC
GACACGTCCA TCACGATGAG AAGTAACGAG ACCGGTATTG TAGACACCGT CATCATCACC
GAGTCCGAGA ACTCCTCCCG CCTCGTCAAA GTCAGAACAC GTGATCTCCG TATCCCGGAG
ATCGGCGACA AGTTCGCATC CCGTCACGGT CAGAAAGGTG TCTGCGGTCT TATCACTCCG
CAGGAAAACA TGCCTTTCAC CGCTGCAGGT ATCGCGCCCG ATCTCGTCAT CAACCCGCAC
GCAGTCCCGT CCCGTATGAC CATCGGTCAT ATGCTCGAGA TGATCGGCGG CAAAGCCGGA
GCACTCGAAG GTTCCCGCGT GAATGCAACT TCCTTCCGAA AGACCCAGGT CGAGAAGATC
TTCGACTCGT CCTTCGCACA GAACCGTCTC TTAAACGACA AAGAGATCGG CAAAGACGTG
AAGTCGAGCA CCCTCACCCG TGAGCAGGTC TTACGACACC AGCTTGCCGA GGCAGGATTC
GCCCACACCG GCCGTGAAGT CATGTACGAC GGTATCACCG GCCGCAGATT CCAGGCCGAC
ATTTACATCG GTGTTATCTA CTACCAGAAA CTCTACCACA TGGTATCTAC CAAGATGCAT
GCACGGTCCA GAGGTCCGGT CCAGGTCCTT ACCCGTCAGC CGACCGAAGG ACGTGCCCGT
GAGGGAGGTC TTCGTTTCGG TGAAATGGAA CGTGATGTGA TGATCGGTCA CGGTGCCGCA
ATGGCTCTCA AAGAGCGTCT CCTTGACGAA TCCGATGCAG TAAAGCAGTA CGTCTGCGCA
CGCTGCGGAA TGGTCGCCAT GTATGATGCC AAGCAGAAGA TGACCCGCTG TCTCGCCTGC
GGCGCAGAAA CAGATATTTA TGAAGTCGAA ATGAGTTACG CATTCAAACT TCTCCTTGAT
GAGATGAAGA GTATGGGTAT AGCTCCCAGA CTGAGACTGG AGGATATGGT ATGA
 
Protein sequence
MEKKMARVFV DGALIGKVDD AYGFTRNFRQ MRRSGHVSTE VNISYKDYSN EIVINTDRGR 
ARRPLIVVEN GVPAVTPEDI EIVRSGAEDF MALVSQGKIE FIDAEEEDDL FIAVQESDLT
PEHTHLEIDP SLILGIGAAH VPFPEHNASP RVTMGAGMIK QALGFAQANM KLRPDTRGHM
LHYAQVPMVH TQAAELIGSD NRPQGQNFCV AIISYEGYNI EDALIFNKAS VERGVGRSHF
FRTYEGEERR YPGGQVDRIE LPEEDVQGSH GIGSYANLDV DGIINPETVV NEKDVLIGKT
SPPRFLEEPT GELIAVEKRR DTSITMRSNE TGIVDTVIIT ESENSSRLVK VRTRDLRIPE
IGDKFASRHG QKGVCGLITP QENMPFTAAG IAPDLVINPH AVPSRMTIGH MLEMIGGKAG
ALEGSRVNAT SFRKTQVEKI FDSSFAQNRL LNDKEIGKDV KSSTLTREQV LRHQLAEAGF
AHTGREVMYD GITGRRFQAD IYIGVIYYQK LYHMVSTKMH ARSRGPVQVL TRQPTEGRAR
EGGLRFGEME RDVMIGHGAA MALKERLLDE SDAVKQYVCA RCGMVAMYDA KQKMTRCLAC
GAETDIYEVE MSYAFKLLLD EMKSMGIAPR LRLEDMV