Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0127 |
Symbol | |
ID | 5410546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | - |
Start bp | 119909 |
End bp | 123154 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640867343 |
Product | von Willebrand factor, type A |
Protein accession | YP_001403294 |
Protein GI | 154149676 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATTT ATCGTTTCAT GATCCTGATT ATCGGACTGT TCCTGCTTGC AGGGGCGGGC AGTGCCGCAG CGATACCGGA TCAAAATTCA ACAATTACCA GCAGTACCAG TTGGGTGGTG GTGAATCACC AGGCAACCAT CACCATTACG GCACTCAATA CAACATCATC CATTCCGGTC CCGGGCGCCT CCGTAACGGC GACGCTTAAC TCGACAACTC TGGGATCGCT GACGGTAAGC AGCGGGACAA CGGATACCAC CGGCCAGGTA AACTTTCCTT TTTTGGCAGG AACAAAAACC GGAGCAGTCA ATATCACGGC CACGATAACC TACAATGATA ATGGAAACCT CGTTTCGGTG ACCAAAGTCT ATACCCAGAA TATCGATCAC GATGTTGCAC AGAATGCAGT GTTCGATTTC CAGAATGAAG TGACCGTGGG AACAGAAACA CCCTTCAACA TCAGCTTTAC CGACCAGTAC GGTAACCCGA TTGATAACAG GAATCCCTAT AATCCCCCCA TCATATCATT GTGGATATCG GCATCCTCGG ACAACATGGC GGCTTTCAAT ATCAGCGGAA GCCATGTCCA GTATACATCG CAGGAGCTTG ATGCCAACGG GAATATTTCT GTCAGCGTTG TGACTGACAC TGCCCCGGGA GAAAACGAGA TCCTGCTCTC TCACTTCGGC AATGTCGGTA TCCCATGGGG GTTACCGCAG TTTATCTATG GCATAAATAA CGGTGTACCC TATTCGATAG ATGAATCTGT TTTACCGTCC TCTCTGACCC AGCCGGCAGA TAATGGAGTG GACCACGTTT TTACCATCAA CAATACGGTA TACGATAAGT ACGGTAACCC TACGGAAAAT CAAGGGATCC TGGTGCAATC GAACTGGGCA GGAGATCCTT CCGCAACTAT GCTGTCAAAC CCCTATGGTC AGATCATGTA TACGTATGGT CCTCACAGCG CATCCGGCTA CGTTGTCTTA ACAGCTACCT CAGTAGCGAA TTCATCGGTC AGTATCTCCA AGACCGTCCA ATTCTATAAC ACCGCACCGG TAAACATGCT CTTCTCGGCA ACACCTCAGG TCATGCCAAG TCTTGATGCA AAATCCGGGC AGACTGCACA GCTTGTCGCA AAAGTGGTAG ATACCATGGG TAACCCGGTG GCTAACCAGA CGGTTACATT CAGTCTGGGA ACGCCCACAT ACACCTACGC GAACAGTTCG CTTGCCGGAC CACAGCTCCT CAATACGACA GCGGTCACCA ATATAAACGG AACGGCCACA GTCCTTTTTG TTCCCGGAAG ATTTGATACG AACAAGGAGA ACGTCAGTTA TGATGCAACA GATTCCGGAT ACGTTACGGC AACGGCAACC TGGAGTTCGA TCAGCCAACC GCTCCAGCTG AGCTGGAAGA ACTATCCCTA CCTGAGCGAA CTGACCACAA TATCATCCCA GAATGTTAAC GTCAACGGAA AATTCTATGT TACGGTCAGC CTCACCGCTG ACGGATGGAA CCTGACCGGA ACACCGGCAG ATGTCGTGAT AGTATCTGAT CTCGCGGCAG GTATCGGAGG AGCGACCAGG CTGACCCACA CAAAAGCTGC AGAGGTCGGC TTTATTAAAA ATGCCACTGA CAACACCTAT GTCGCCCTTG CTTCCTTTGG AAGCGCTCCC AATGCAGGAT CTACACCCTA TGACTCCTCA GATACCCAGA ACCTCTGGAA CCTCCAGCTC TCCAAATCCA ATACTACGTA TACATACAGA CCCTTCAACC CCTATGGCAA TGTCTGGGAT TACAATCTGG TCAATCCCGC TAACTGGAAC AGTATCTCAA GTTCGACTGC ATATTGTTTT AATTCAAGTT CGCAGAAAAA TCCGTCCAGT CAAAGCTTGG GTCTGAGGGC CTGTGTGAAT GCCACAAGCC CCTATGGGTA CACCTACCTC AACCCCTGGT CTGATTCAAA AATAGATGCT GATCTGATGA ACGCCGGACC CACCTATAAG ACCACCAACC AGAACGCTCT TGTCAATACA GTCAATGCAT ATACAGCGTA CGGAGGGACG GATTATGCAG CAGGAATTAA TGCTGCCCTC CAGGAGCTCC AGAGCAAAGG CAATCCCTCC CATAACCAGA CCATCATTAT TATGGGAGAT GGTGTCAACA TGATGGCCCC GATCGCACCG GGATCCTTTG AATCATACTG GCCATCGGAC TGGAACCCCC GGAACGGGAC CGGTATCTCT GAAGGCCCCA ACCTTTGGTA TTTGGACGAA AGCGACGTTG GAAAAGCAGC AGCATTGAAT GCATCAACGA CGGCAAAAAA CCTGGGTATC ACCATCTATG GGATCCAGTT CCCGACACCA GACAATTACG GCCATAACAT CAACGATACT GCATTCTTCC AGCAAATGGT GTCATCACCG ACAAGCACAT GGTACTATGC GCCGGATCCG ACAACGATGA CCGGGATATT CCAGCAGATT GAGGGGCAAA TTCAGAATAC CGCAGGAGTA AATACAACGA TGGCATTTAA CCTGCAGAGT GTTGCCGTCA ATTACAACAA CGTTTCCACT ACTTATCCGG GAAATGAGGT ATTCTCATAC GTATATGATG CAAATGATAC TCCTTCGTCA ACCCAAGTCA TCGATCAAAA TGGAATTACC ACGGTGATCA ACCAGACGGA TCAGTGGAAC AATAACCAGA CCCTGATATT CAATATAGGC AAGATGACCG TAGGGCAAAC CTGGAGTACG ACATTTGAGT TGCAACTGCT GAAACCGGGT ACAGTAAATG TTCCAGGAAA CCTTTCAACA ATATGTTTTA ATACCGGAGA ATGCATGTCG CCAAATCAGG GGACTATTAA CGGAGTATAT AACTATTCCA ATACCGGTTT TTCCATGCCA ACGATCAGCA TCACTGATCT CCAGGCATTC GAAGAGAGTC AGTCTACAAA TATTGTACCC TTACAATGGA ATATCAGTTA CCCGGGCAGC CAGTATGCCA CGGAAACCCT GTATTATAAC TCCCAGGCCG ATCCGACATG GAGATATATC TACCAGCAGC CGGTCAATCC GGGGAACTGG ACACAAGCCT ATGACTGGAA CGTAGGAGGT CTCCCGGCGG GAACCTATTC CGTTGAATTG ATCGCATATG CCCATGATGC AAACAGCGGC AAAGAGATCA TCCCAACCGG AATTCAACTC GGGCTCTCCC CGAAAGCGTT TATCAGACTC CAGTAA
|
Protein sequence | MQIYRFMILI IGLFLLAGAG SAAAIPDQNS TITSSTSWVV VNHQATITIT ALNTTSSIPV PGASVTATLN STTLGSLTVS SGTTDTTGQV NFPFLAGTKT GAVNITATIT YNDNGNLVSV TKVYTQNIDH DVAQNAVFDF QNEVTVGTET PFNISFTDQY GNPIDNRNPY NPPIISLWIS ASSDNMAAFN ISGSHVQYTS QELDANGNIS VSVVTDTAPG ENEILLSHFG NVGIPWGLPQ FIYGINNGVP YSIDESVLPS SLTQPADNGV DHVFTINNTV YDKYGNPTEN QGILVQSNWA GDPSATMLSN PYGQIMYTYG PHSASGYVVL TATSVANSSV SISKTVQFYN TAPVNMLFSA TPQVMPSLDA KSGQTAQLVA KVVDTMGNPV ANQTVTFSLG TPTYTYANSS LAGPQLLNTT AVTNINGTAT VLFVPGRFDT NKENVSYDAT DSGYVTATAT WSSISQPLQL SWKNYPYLSE LTTISSQNVN VNGKFYVTVS LTADGWNLTG TPADVVIVSD LAAGIGGATR LTHTKAAEVG FIKNATDNTY VALASFGSAP NAGSTPYDSS DTQNLWNLQL SKSNTTYTYR PFNPYGNVWD YNLVNPANWN SISSSTAYCF NSSSQKNPSS QSLGLRACVN ATSPYGYTYL NPWSDSKIDA DLMNAGPTYK TTNQNALVNT VNAYTAYGGT DYAAGINAAL QELQSKGNPS HNQTIIIMGD GVNMMAPIAP GSFESYWPSD WNPRNGTGIS EGPNLWYLDE SDVGKAAALN ASTTAKNLGI TIYGIQFPTP DNYGHNINDT AFFQQMVSSP TSTWYYAPDP TTMTGIFQQI EGQIQNTAGV NTTMAFNLQS VAVNYNNVST TYPGNEVFSY VYDANDTPSS TQVIDQNGIT TVINQTDQWN NNQTLIFNIG KMTVGQTWST TFELQLLKPG TVNVPGNLST ICFNTGECMS PNQGTINGVY NYSNTGFSMP TISITDLQAF EESQSTNIVP LQWNISYPGS QYATETLYYN SQADPTWRYI YQQPVNPGNW TQAYDWNVGG LPAGTYSVEL IAYAHDANSG KEIIPTGIQL GLSPKAFIRL Q
|
| |