Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1200 |
Symbol | |
ID | 4795428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 1224818 |
End bp | 1228048 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640099874 |
Product | hypothetical protein |
Protein accession | YP_001030636 |
Protein GI | 124486020 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.563874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0639023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCA AAATCACCTT CAAATTCGTC GACCTCCCCT ATCAGATAGA CGCCGTCAAT TCCGTCGTCG ACCTCTTTTC GGGACAAAAC AAAACCACCG GCGAATCCCT CTACAAAACA CCCGGACGTG CTGAACTCAT CGAAACACGC CTCGGACGAA ATCCCAGACT CGAGATAGGC GACACTCGAC TCCTCAACAA CCTCAAATCC ATCCAGACCC AAAACGACCA CCTCCTCCCC GACGACGAAC TCTTCAACTA CAACTTCTCC GTCGACATGG AAACTGGAAC CGGAAAAACC TACGTCTACC TCAGGACCAT CCTCGAACTC CATAAACAAT ATGGATTTAC CAAATTCATC ATCGTCGTCC CCAGAATCGC CATCCTTCAG GGCGTCAAAA AAAGCATCGA ACAACTCACA GAAACCTTCA AAGCACTCTA CGACGGAATC GACATAAACG CCCGATCCTT CATCTACACA TCATCAAAAA TCGACGAACT CAGAAGCAAA TTCATCGAAG GAACAGACCT TTCCATCGTC ATAATGAACA AAGATGCCTT CAACAAAAGC GGCATCAATA TCATTCAGAA AGACCGTGAA GGCGGGACAA AACTTTGGGA CTTAATCAAA GCGGTAAAAC CAATCATCAT AATCGACGAA CCGCAACTTA TCGAAGGGAC CACCTCCAAA AAGAGTTCAT CCCTTCGCGA ACTCGAAAAT CTTGCACCCC TCTTCACTTT ACGCTACTCC GCGACCCACA AAAATCCCTA TAACTGTGTC TACCGCCTGA CAAGCTATGA TGCATACAAC CAGAACCTCG TCAAAAAGAT CCGGGTAAAA ACCGTATACG GTGAAGTCCC AACTGATTTC CCCTACATCA GATATGTAAC TTTTACAACA AACCTCAAGG CAAGGATCGA AATATTCTCC CGTGATGAAA AACGCGGTAT AAGCAAAAAA CAATACGACG TAAGCGATGG TGACTCGCTT CACGAACTCT CCGGTGGCCT CGAACAGTAT CAAAACATGT TCATTGCCGG AAACCCCCAC AAACTCGATG GACTCACCAT CTCACGCGGG AAACTTGGTC AGCTCACCCT CTTCAATGAA ACCACCGGAT CATCCAACTG GGGAATCACC ATCAATCAGG GTGATGACAG CCTCACACTT ATGCCCGGAG ATTGTACATA TAATGAAAAA CTCCGAGAAG CCGATTCAAC TGTTGTCAGG GTCCAGATCA GAACTGCAAT AAAAAATCAT CTCGACACCC AGTTTAACCT CCTCGAGAAC AAAAAACACA TCAAAGCTAT CTCCCTCTTC TTTATTGACG AAGTTAAACG CATCCGTGAC GATACGCAAG CTGACGGCCG CGGAATCTTC CTTCGGATCT TTGACGAAGA ATACAACACC ATCATCCACG AAAAAAAATA CCAGGACTAC TTCAAAAAAT ACGAAGCTCT ATTCCCGGAA TACAAAAACG TCCTCAACAT CAGAGAAGGC TACTTCGCCA TAGATAAAAA CAACAAAGTC GTCGAGCCTG AAAGCTCCAA AAAGAACAGA CTCAGCATGC TCGAAGAAGA CGACTACAAT AAAAAATCCA AGGAAGACAT CGAACGGGGT ATTGAACTGA TCCTCAAGAA AAAAGAGGAA CTCATATCAT TTCAAACACC CCTGGCCTTC ATCTTCTCCC ACTCCGCGCT AAGAGAGGGG TGGGACAACC CAAACGTCTT CACCCTCTGC ACCCTCAAGG AATCTTCAAA TGAAATGGCA AAAAAACAGG AGATCGGCAG AGGACTCCGT CTTCCTGTAG ACATACACGG GGAACGCTGC TATGATGAAG ATGTAAACAT GCTGACGGTC GTTGCAAACT CCTATTACGA CGAATTTGCA GCCCATCTCC AGGCAGATTT CGACGCAGAA CAAAACTTCA ACAAAGACAT AGCTACAGTC TACGAATTTA TCGAGACCCT GAAACAAGCC GGACTACCAC AGAAAAAAAT CACCAAAGAC CTTGCCAAGA CCTTGGAAAA AGAGCTTCGG ATAAAAGCCG TCATCAACTC GCAGGGTAAA CTCAACCCAA AAATCGACAT ACACAACATA TCGTTCACTG ACGCCACGTT AAGCGAGCAT GAAGTCGCCA TCAAAAATGC TTTCATCCAA GTCATGACCG AAAAGGGCAC AAAGAGGATA TATATCGAGA ATGGCGATGA AAAATCCGAA GAAAACTCCC CTCATTCCTA CATAGGTGAA GAATCCTTCA AAAACCTTCT AAATGAACTC ACACTCCGCC TGGAAAAACG CACATTCTAT CAGATCAACA TTGACTCTGC AGACTTCATT CAGGAAGCGG GAAGAAGACT CAATCGTCTC TTAACAACAA AATGGGCAAC CGCCCAGTCG ATCACCACCA CTACCGGCGA AGTCCGTATG AAAGCAGACG GAAAAACCTA CGTCAATGAA CAAAAAGAGG AGTACACAAC CGACAAAACA CCCTTGGTCT GGTACAAAAA AACCGACTTC CAGATAATCA ACTACATCAT GCAGCAGACA AGACTCCCGC GTCTTGCCAT ATACGCAATA CTCAATGAAC TGAGTCCTGA ACTCAAAGAA TACCTCAGGG TTCAGGATAT ATTAGATAAT GCAACACTCG AATTACAAAA AACTCTCACC GAATTCAAAT CTGCCCACAT ATCCGGATAT CAGGTAATCG ATAACTACCT CTTCGACAAA AAAGAAATCT TCGTTCCTGA CAAAATCGAC TCAGAAACGC TCAAAAAATT AGATGATGAT GATGCATTCA CTTTTGATGC ATACAAATCC AAACCAGAAA AGCGTCATGC ACTCTACAAA TACTACAAGA CAGACAGCAA AGGTGAAAAA GAGTTTGCTC AGCAGTTGGA CGAAGATGAA CGCGTCCTCC TCTTCACCAA ACTCCACAAA GGTGGGTTTG TCATCGACAC ACCTGAAGGA AATTACTCTC CAGACTGGGC AGTCATCTAT AAACACTCCG ACGAATTGAC GAGACTCTAT TTCATCGTTG AGACAAAAAT CGACAAAGAA CGTAAAGATC TCAGTGAGGT TGAAAGAGTT AAGATTAGTT GTGGAGAAAT GCACTTCGAT GCAGTATCTA AAGCAATAGG AAAAGATGTT GAGTATCTTT ATGCTAAAAA CTACCGGCAT TTCACCGAGC AGATTGAACG TTTCGAAGAG ATCAACATAC GATCTCCTTA A
|
Protein sequence | MTGKITFKFV DLPYQIDAVN SVVDLFSGQN KTTGESLYKT PGRAELIETR LGRNPRLEIG DTRLLNNLKS IQTQNDHLLP DDELFNYNFS VDMETGTGKT YVYLRTILEL HKQYGFTKFI IVVPRIAILQ GVKKSIEQLT ETFKALYDGI DINARSFIYT SSKIDELRSK FIEGTDLSIV IMNKDAFNKS GINIIQKDRE GGTKLWDLIK AVKPIIIIDE PQLIEGTTSK KSSSLRELEN LAPLFTLRYS ATHKNPYNCV YRLTSYDAYN QNLVKKIRVK TVYGEVPTDF PYIRYVTFTT NLKARIEIFS RDEKRGISKK QYDVSDGDSL HELSGGLEQY QNMFIAGNPH KLDGLTISRG KLGQLTLFNE TTGSSNWGIT INQGDDSLTL MPGDCTYNEK LREADSTVVR VQIRTAIKNH LDTQFNLLEN KKHIKAISLF FIDEVKRIRD DTQADGRGIF LRIFDEEYNT IIHEKKYQDY FKKYEALFPE YKNVLNIREG YFAIDKNNKV VEPESSKKNR LSMLEEDDYN KKSKEDIERG IELILKKKEE LISFQTPLAF IFSHSALREG WDNPNVFTLC TLKESSNEMA KKQEIGRGLR LPVDIHGERC YDEDVNMLTV VANSYYDEFA AHLQADFDAE QNFNKDIATV YEFIETLKQA GLPQKKITKD LAKTLEKELR IKAVINSQGK LNPKIDIHNI SFTDATLSEH EVAIKNAFIQ VMTEKGTKRI YIENGDEKSE ENSPHSYIGE ESFKNLLNEL TLRLEKRTFY QINIDSADFI QEAGRRLNRL LTTKWATAQS ITTTTGEVRM KADGKTYVNE QKEEYTTDKT PLVWYKKTDF QIINYIMQQT RLPRLAIYAI LNELSPELKE YLRVQDILDN ATLELQKTLT EFKSAHISGY QVIDNYLFDK KEIFVPDKID SETLKKLDDD DAFTFDAYKS KPEKRHALYK YYKTDSKGEK EFAQQLDEDE RVLLFTKLHK GGFVIDTPEG NYSPDWAVIY KHSDELTRLY FIVETKIDKE RKDLSEVERV KISCGEMHFD AVSKAIGKDV EYLYAKNYRH FTEQIERFEE INIRSP
|
| |