Gene Mboo_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0126 
Symbol 
ID5410066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp115449 
End bp119372 
Gene Length3924 bp 
Protein Length1307 aa 
Translation table11 
GC content60% 
IMG OID640867341 
ProductDNA polymerase II large subunit 
Protein accessionYP_001403293 
Protein GI154149675 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA TCTCCCCCGA GATGGAGCAG TACGGGAACG GCCTGATGGA CGGCCTTAAC 
CGTGCCATCG CACTTGCAAA AGAGGCGCGG GCGCGGGGCC TTGACCCAAG CATGGAGGTC
GAGATCCCGA TCGCAAGCGA TCTTGCCGAC AGGGTCGAAG TACTGGTCGG GATCAAGGGA
GTGGCGGCCA GGATCCGGGA ACTTGAGTCG CAGATGTCCC GTGAGGAGGC GGCACTCAAG
ATCGGCGATG ACTTTGTCGC AAAAATGTTC GGGGAGAAGA ACAATAACGA AATTCTTGAC
CATTCTATCC GCACCGCGAT GGCTCTCCTC ACCGAAGGCG TGGTCTCGGC GCCAACCGAA
GGTATCGCAA AAGTAGGGCT TGGCAAGAAC GACGATGGCA GCCAGTACCT GATGATCTAT
TATGCCGGCC CGATCCGGAG CGCCGGCGGG ACGGCACAGG CGCTCTCGGT GCTTGTCGGG
GACTATGTGC GGGCCAAACT GGGGATCGGG CGTTACCGCC CCCGCCCGGT CGAAGTCGAG
CGGTACATCG AGGAGATCCG GCAGTACAAC ACCATCATGA GCCTCCAGTA TCTCCCCAGC
GAAGCGGAGA TCCGGCTGAT CGTGGAGAAC TGCCCGGTCT GCATTGACGG CGAGGCAACG
GAAAAAGAGG AGGTCTCGGG ACACCGTAAC CTCGAACGCG TGGAGACAAA CGCGGTGCGG
GGCGGCATGG CGCTTGTGCT TGCAGAAGGG ATTGCGGGAA AAGCCCGCAA GCTCAAGAGC
AAAGTCGAGA AGATGAACAT GGCCGGCTGG GACTGGCTGG ACAAGCTCAT CGCCGGCGCG
GGAAAACCGA GCGCCGATGA TGAAAGCCAT GCCCCCGGTG TCAAGCCGCT GGACAAGTAC
CTCCGGGATC TTATTGGCGG CCGCCCGGTA TTTTCGTACC CGATGAGGAA AGGTGGCTTC
CGGCTCCGGT ACGGGCGTTC GCGGAACACC GGCTTTGCAA CGGCGGGGTT TAACCCGGCA
ACGCTCCACA TCCTCGGGGG ATTTCTCGCG GTCGGCACCC AGATGAAACT GGAACGGCCG
GGCAAGGCAT GCGGGGTTGT CCCGGTCGAC ACCATCGAGG GTCCGACCGT GCGCCTTACC
GGGGGCGAGG TCATCCGGGT GGATGACGAG AAGACCGCCC TTGCAGTTGC TTCAAAGGTA
GAGCGGATTC TCGATGTCGG GGAGATCCTG ATCGCGTACG GGGAATTTTT GGAGAACAAC
CACCCGCTTG TGCCCGCCGG TTACTGCGAG GAATGGTGGC TGCTCGAAGT GCCCCCGGGC
ACAAAGCCGC CGCAGGATGA ATCCGAGGCA CTTGTCCAGG CAAAGAAGGG GGCATACCTG
TACCCGGCCT ATACGTGGTT CTGGGATGAC ATCTCCGTTG ACCAGATCCG GCTCCTTGCC
GATGCGGTTT CCGGCACCGG TGCGATCGAA GAGGAAACAC TCGTTTTTCC ACTCGATTCC
ACGGCAAAAG AGGCGCTTGA GCTCCTGCTC ATCCCCCATA AGGTAAACGG CGGAATGATC
CGGATCAAAA CATTCCGGGC ATTTATCGCC GGCCTGGGCC TTGACGGGAA TTTAAAGAAA
TGCGAAACGT GGAAGACGGC ACCGGCCGAT GCCCAGCCTC TTGCTCTTGT CATGCACCTG
AGCGGTCTCT TGCTCCGCTC CCGCTCGGGG CTCCGGATCG GGGGAAGGAT GGGCAGGCCC
GGCAAGTCCA AGCCGCGCAA GATGAGCCCG CCGCCTCACG GGCTCTTCCC GCTCGGGGAG
TCCGGGGGTG CCCGGCGGTC GTTCCAGGAA GCGTCGATCC ATACCGAGGA GTCGGATGCT
CATGCCACGG AGATCGATTT CGAGAAAGAA GGGGGCATCA TCGAGATCGA AGTGGGGCGC
CGGCGCTGCC CCGGATGCGG TGAGATTACG TACCTGAACC GGTGCCAGAA GTGCGGGACG
CACACCACGC CGATCAATAC CTGCCCCAAA TGCGGCCACG AGGTGCCCGG GGAGCGCTGC
CCCAACTGCG ATGTCCCGGC CACCTGCAGC CAGCGGATCA CCTTAAACGT GAAAGCGGAG
TATGCGCAGG CCATGGAGCG CCTCGGGATC AAAAAAGAGA GCATCGCGCT TGTCAAGGGA
GTCAAAGGCG TTATCTCTCG GGAGAAGACG GTCGAGGCCA TGGAGAAAGG CATTCTCCGC
GCCCAGCAGG ACATCTACGT CTTCAAGGAT GGCACAACGC GCTTTGACAT GATCGACCTC
CCGCTCACCC ACATCCGGCC GGACGAGGTC CGGGTCCCGG TTGAAAAGAT GCGTGAGCTC
GGGTATGTCC AAGACATCAA CGGCTACGAT CTCCAGAACG GGAAGCAGGT GATCGAGCTG
CACGCGCAGG ATATCCTCCT CTCCGATTCG TGCGCCGACT ACATGATCAA GGTCACGCAG
TTCATCGATG ACGAACTCAC CCGGCTCTAT GGTCTCCCGA CCTTTTATAA TGTAAAAACC
CGGGATGATC TGGTCGGCCA CCTGGTGATC GGCCTTGCCC CCCATACGAG CGCCGGGGTG
CTTGCCCGAA TCGTTGGTTT TACCCGGGCA AATGTCGGGT ACGCCCACCC GTTTTTCCAT
GCGGCCAAGC GCCGGAACTG CTTCTACGGC GAGACAAAGA TCGAAATTTT CGATGGGAGA
TCGTGGGCAA CCTTCCCTAT CCGGAAATTT GTGATGGAGA ACTTCGATGT CAGCCGGCCC
GGCCTTGACC GGCTGGGCAC GTACTACTCC GATCCGGTGC GCCCGTACTA CACGCGGACC
GTGGACACAA ATGGCGGCAT GCATCTGCGC CGGATTACCT CGGTTTCCAT CCACCGTTCG
CCGGCGAGCA TGATCCGGTT TGTCACGGCC CGGAACCGCG AACTGACGGT CACCCCGGAT
CACGCGATGG TGGTCTGGGA TACCGGGTAC CTGCGGAAGA TAAAAGCGCT GGAGATCAAG
GCAGGCGATG CGGTGCCCAT TCTTGAAGGC GGGGTGGTAA TTGCGGACCG GATCGTTACG
GCCGAGACGG TTGCCTCCCT TGAGGACCGG GTGTACTGCC TGACCGTGGC CGAGGACCAT
ACACTTGTTG CAAACGGCGT CTTCACCGGC CAGTGCGATG GCGATGAGGA CTGCATCATG
CTCCTCATGG ACGGCCTGTT AAATTTCTCC CGCTCGTTCC TGCCCCAGAA CCGGGGCGGC
ACGATGGATG CGCCGCTTGT GCTCACGAGC CGTATCGACC CGGCGGAGAT CGACAAGGAG
TCCCTGAACG TGGACGTTGG GAAAAGTTAC CCGAAGGAAT TGTATGAGGC GGGCCTTGTG
TATGCCAAAG CAAAAGATGT CGAGCCGCTC ATTGACCGGG TGGAGCGGAG GCTTGGGACG
CCGCGGCAGC TCGAAGGGTT CTTCTTCACC CACGATACCT CGGACATCTC GGCCGGGCCC
CTGGAATCCA CCTACACCCA GCTCAAGACC ATGGCTGAGA AACTCGAGGC CGAGCTCGAC
CTTGCCGAAA AGATCCGAGC GGTGGATGCC GATGATGTGG CCGAGCGCGT ACTTAACACG
CATTTTATCC GCGACCTGAT GGGCAACCTC TCCGCGTTCT CCAAGCAGAA GTTCCGGTGC
ACCAAGTGCA ACACGAGCTA CCGGCGGATG CCGCTTGCCG GCAAGTGCAC GAAGTTTAAG
GGCAAGGGGA TCTGCAACGG CAACATTATC CCGACTGTGC ACGAGGGATC GGTCAAGAAA
TACCTTGAAG TCTCCCGTGC AATGGTCAAA AAATACAAGG TGTCGGAATA CTGCCGGCAG
CGGGTCGAGG TGCTCGACCT TGCCATCGAG TCGACCTTTG GCGAGGAGAA ACAGGAGCAG
CTGGGCTTAG CGGATTTCAT GTGA
 
Protein sequence
MLKISPEMEQ YGNGLMDGLN RAIALAKEAR ARGLDPSMEV EIPIASDLAD RVEVLVGIKG 
VAARIRELES QMSREEAALK IGDDFVAKMF GEKNNNEILD HSIRTAMALL TEGVVSAPTE
GIAKVGLGKN DDGSQYLMIY YAGPIRSAGG TAQALSVLVG DYVRAKLGIG RYRPRPVEVE
RYIEEIRQYN TIMSLQYLPS EAEIRLIVEN CPVCIDGEAT EKEEVSGHRN LERVETNAVR
GGMALVLAEG IAGKARKLKS KVEKMNMAGW DWLDKLIAGA GKPSADDESH APGVKPLDKY
LRDLIGGRPV FSYPMRKGGF RLRYGRSRNT GFATAGFNPA TLHILGGFLA VGTQMKLERP
GKACGVVPVD TIEGPTVRLT GGEVIRVDDE KTALAVASKV ERILDVGEIL IAYGEFLENN
HPLVPAGYCE EWWLLEVPPG TKPPQDESEA LVQAKKGAYL YPAYTWFWDD ISVDQIRLLA
DAVSGTGAIE EETLVFPLDS TAKEALELLL IPHKVNGGMI RIKTFRAFIA GLGLDGNLKK
CETWKTAPAD AQPLALVMHL SGLLLRSRSG LRIGGRMGRP GKSKPRKMSP PPHGLFPLGE
SGGARRSFQE ASIHTEESDA HATEIDFEKE GGIIEIEVGR RRCPGCGEIT YLNRCQKCGT
HTTPINTCPK CGHEVPGERC PNCDVPATCS QRITLNVKAE YAQAMERLGI KKESIALVKG
VKGVISREKT VEAMEKGILR AQQDIYVFKD GTTRFDMIDL PLTHIRPDEV RVPVEKMREL
GYVQDINGYD LQNGKQVIEL HAQDILLSDS CADYMIKVTQ FIDDELTRLY GLPTFYNVKT
RDDLVGHLVI GLAPHTSAGV LARIVGFTRA NVGYAHPFFH AAKRRNCFYG ETKIEIFDGR
SWATFPIRKF VMENFDVSRP GLDRLGTYYS DPVRPYYTRT VDTNGGMHLR RITSVSIHRS
PASMIRFVTA RNRELTVTPD HAMVVWDTGY LRKIKALEIK AGDAVPILEG GVVIADRIVT
AETVASLEDR VYCLTVAEDH TLVANGVFTG QCDGDEDCIM LLMDGLLNFS RSFLPQNRGG
TMDAPLVLTS RIDPAEIDKE SLNVDVGKSY PKELYEAGLV YAKAKDVEPL IDRVERRLGT
PRQLEGFFFT HDTSDISAGP LESTYTQLKT MAEKLEAELD LAEKIRAVDA DDVAERVLNT
HFIRDLMGNL SAFSKQKFRC TKCNTSYRRM PLAGKCTKFK GKGICNGNII PTVHEGSVKK
YLEVSRAMVK KYKVSEYCRQ RVEVLDLAIE STFGEEKQEQ LGLADFM