Gene Mboo_0127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0127 
Symbol 
ID5410546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp119909 
End bp123154 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content49% 
IMG OID640867343 
Productvon Willebrand factor, type A 
Protein accessionYP_001403294 
Protein GI154149676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTT ATCGTTTCAT GATCCTGATT ATCGGACTGT TCCTGCTTGC AGGGGCGGGC 
AGTGCCGCAG CGATACCGGA TCAAAATTCA ACAATTACCA GCAGTACCAG TTGGGTGGTG
GTGAATCACC AGGCAACCAT CACCATTACG GCACTCAATA CAACATCATC CATTCCGGTC
CCGGGCGCCT CCGTAACGGC GACGCTTAAC TCGACAACTC TGGGATCGCT GACGGTAAGC
AGCGGGACAA CGGATACCAC CGGCCAGGTA AACTTTCCTT TTTTGGCAGG AACAAAAACC
GGAGCAGTCA ATATCACGGC CACGATAACC TACAATGATA ATGGAAACCT CGTTTCGGTG
ACCAAAGTCT ATACCCAGAA TATCGATCAC GATGTTGCAC AGAATGCAGT GTTCGATTTC
CAGAATGAAG TGACCGTGGG AACAGAAACA CCCTTCAACA TCAGCTTTAC CGACCAGTAC
GGTAACCCGA TTGATAACAG GAATCCCTAT AATCCCCCCA TCATATCATT GTGGATATCG
GCATCCTCGG ACAACATGGC GGCTTTCAAT ATCAGCGGAA GCCATGTCCA GTATACATCG
CAGGAGCTTG ATGCCAACGG GAATATTTCT GTCAGCGTTG TGACTGACAC TGCCCCGGGA
GAAAACGAGA TCCTGCTCTC TCACTTCGGC AATGTCGGTA TCCCATGGGG GTTACCGCAG
TTTATCTATG GCATAAATAA CGGTGTACCC TATTCGATAG ATGAATCTGT TTTACCGTCC
TCTCTGACCC AGCCGGCAGA TAATGGAGTG GACCACGTTT TTACCATCAA CAATACGGTA
TACGATAAGT ACGGTAACCC TACGGAAAAT CAAGGGATCC TGGTGCAATC GAACTGGGCA
GGAGATCCTT CCGCAACTAT GCTGTCAAAC CCCTATGGTC AGATCATGTA TACGTATGGT
CCTCACAGCG CATCCGGCTA CGTTGTCTTA ACAGCTACCT CAGTAGCGAA TTCATCGGTC
AGTATCTCCA AGACCGTCCA ATTCTATAAC ACCGCACCGG TAAACATGCT CTTCTCGGCA
ACACCTCAGG TCATGCCAAG TCTTGATGCA AAATCCGGGC AGACTGCACA GCTTGTCGCA
AAAGTGGTAG ATACCATGGG TAACCCGGTG GCTAACCAGA CGGTTACATT CAGTCTGGGA
ACGCCCACAT ACACCTACGC GAACAGTTCG CTTGCCGGAC CACAGCTCCT CAATACGACA
GCGGTCACCA ATATAAACGG AACGGCCACA GTCCTTTTTG TTCCCGGAAG ATTTGATACG
AACAAGGAGA ACGTCAGTTA TGATGCAACA GATTCCGGAT ACGTTACGGC AACGGCAACC
TGGAGTTCGA TCAGCCAACC GCTCCAGCTG AGCTGGAAGA ACTATCCCTA CCTGAGCGAA
CTGACCACAA TATCATCCCA GAATGTTAAC GTCAACGGAA AATTCTATGT TACGGTCAGC
CTCACCGCTG ACGGATGGAA CCTGACCGGA ACACCGGCAG ATGTCGTGAT AGTATCTGAT
CTCGCGGCAG GTATCGGAGG AGCGACCAGG CTGACCCACA CAAAAGCTGC AGAGGTCGGC
TTTATTAAAA ATGCCACTGA CAACACCTAT GTCGCCCTTG CTTCCTTTGG AAGCGCTCCC
AATGCAGGAT CTACACCCTA TGACTCCTCA GATACCCAGA ACCTCTGGAA CCTCCAGCTC
TCCAAATCCA ATACTACGTA TACATACAGA CCCTTCAACC CCTATGGCAA TGTCTGGGAT
TACAATCTGG TCAATCCCGC TAACTGGAAC AGTATCTCAA GTTCGACTGC ATATTGTTTT
AATTCAAGTT CGCAGAAAAA TCCGTCCAGT CAAAGCTTGG GTCTGAGGGC CTGTGTGAAT
GCCACAAGCC CCTATGGGTA CACCTACCTC AACCCCTGGT CTGATTCAAA AATAGATGCT
GATCTGATGA ACGCCGGACC CACCTATAAG ACCACCAACC AGAACGCTCT TGTCAATACA
GTCAATGCAT ATACAGCGTA CGGAGGGACG GATTATGCAG CAGGAATTAA TGCTGCCCTC
CAGGAGCTCC AGAGCAAAGG CAATCCCTCC CATAACCAGA CCATCATTAT TATGGGAGAT
GGTGTCAACA TGATGGCCCC GATCGCACCG GGATCCTTTG AATCATACTG GCCATCGGAC
TGGAACCCCC GGAACGGGAC CGGTATCTCT GAAGGCCCCA ACCTTTGGTA TTTGGACGAA
AGCGACGTTG GAAAAGCAGC AGCATTGAAT GCATCAACGA CGGCAAAAAA CCTGGGTATC
ACCATCTATG GGATCCAGTT CCCGACACCA GACAATTACG GCCATAACAT CAACGATACT
GCATTCTTCC AGCAAATGGT GTCATCACCG ACAAGCACAT GGTACTATGC GCCGGATCCG
ACAACGATGA CCGGGATATT CCAGCAGATT GAGGGGCAAA TTCAGAATAC CGCAGGAGTA
AATACAACGA TGGCATTTAA CCTGCAGAGT GTTGCCGTCA ATTACAACAA CGTTTCCACT
ACTTATCCGG GAAATGAGGT ATTCTCATAC GTATATGATG CAAATGATAC TCCTTCGTCA
ACCCAAGTCA TCGATCAAAA TGGAATTACC ACGGTGATCA ACCAGACGGA TCAGTGGAAC
AATAACCAGA CCCTGATATT CAATATAGGC AAGATGACCG TAGGGCAAAC CTGGAGTACG
ACATTTGAGT TGCAACTGCT GAAACCGGGT ACAGTAAATG TTCCAGGAAA CCTTTCAACA
ATATGTTTTA ATACCGGAGA ATGCATGTCG CCAAATCAGG GGACTATTAA CGGAGTATAT
AACTATTCCA ATACCGGTTT TTCCATGCCA ACGATCAGCA TCACTGATCT CCAGGCATTC
GAAGAGAGTC AGTCTACAAA TATTGTACCC TTACAATGGA ATATCAGTTA CCCGGGCAGC
CAGTATGCCA CGGAAACCCT GTATTATAAC TCCCAGGCCG ATCCGACATG GAGATATATC
TACCAGCAGC CGGTCAATCC GGGGAACTGG ACACAAGCCT ATGACTGGAA CGTAGGAGGT
CTCCCGGCGG GAACCTATTC CGTTGAATTG ATCGCATATG CCCATGATGC AAACAGCGGC
AAAGAGATCA TCCCAACCGG AATTCAACTC GGGCTCTCCC CGAAAGCGTT TATCAGACTC
CAGTAA
 
Protein sequence
MQIYRFMILI IGLFLLAGAG SAAAIPDQNS TITSSTSWVV VNHQATITIT ALNTTSSIPV 
PGASVTATLN STTLGSLTVS SGTTDTTGQV NFPFLAGTKT GAVNITATIT YNDNGNLVSV
TKVYTQNIDH DVAQNAVFDF QNEVTVGTET PFNISFTDQY GNPIDNRNPY NPPIISLWIS
ASSDNMAAFN ISGSHVQYTS QELDANGNIS VSVVTDTAPG ENEILLSHFG NVGIPWGLPQ
FIYGINNGVP YSIDESVLPS SLTQPADNGV DHVFTINNTV YDKYGNPTEN QGILVQSNWA
GDPSATMLSN PYGQIMYTYG PHSASGYVVL TATSVANSSV SISKTVQFYN TAPVNMLFSA
TPQVMPSLDA KSGQTAQLVA KVVDTMGNPV ANQTVTFSLG TPTYTYANSS LAGPQLLNTT
AVTNINGTAT VLFVPGRFDT NKENVSYDAT DSGYVTATAT WSSISQPLQL SWKNYPYLSE
LTTISSQNVN VNGKFYVTVS LTADGWNLTG TPADVVIVSD LAAGIGGATR LTHTKAAEVG
FIKNATDNTY VALASFGSAP NAGSTPYDSS DTQNLWNLQL SKSNTTYTYR PFNPYGNVWD
YNLVNPANWN SISSSTAYCF NSSSQKNPSS QSLGLRACVN ATSPYGYTYL NPWSDSKIDA
DLMNAGPTYK TTNQNALVNT VNAYTAYGGT DYAAGINAAL QELQSKGNPS HNQTIIIMGD
GVNMMAPIAP GSFESYWPSD WNPRNGTGIS EGPNLWYLDE SDVGKAAALN ASTTAKNLGI
TIYGIQFPTP DNYGHNINDT AFFQQMVSSP TSTWYYAPDP TTMTGIFQQI EGQIQNTAGV
NTTMAFNLQS VAVNYNNVST TYPGNEVFSY VYDANDTPSS TQVIDQNGIT TVINQTDQWN
NNQTLIFNIG KMTVGQTWST TFELQLLKPG TVNVPGNLST ICFNTGECMS PNQGTINGVY
NYSNTGFSMP TISITDLQAF EESQSTNIVP LQWNISYPGS QYATETLYYN SQADPTWRYI
YQQPVNPGNW TQAYDWNVGG LPAGTYSVEL IAYAHDANSG KEIIPTGIQL GLSPKAFIRL
Q