Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0843 |
Symbol | |
ID | 4794830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 822986 |
End bp | 825937 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640099506 |
Product | hypothetical protein |
Protein accession | YP_001030281 |
Protein GI | 124485665 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.10122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCA ACACAAAAGA AATCGGACTC GAAGACTTGA TCGTACAACA TCTCACAAGT CAGAACGGAT ACGAACTGGG GATCGCAAGT GAATATATCC GGTCTTCGGC CTTTGATGAA GGAAGGCTGT TTAGATTCCT TGAAACGACC CAGGGAACCA AACTCTCACA ATACGGAATA CTCGAAAGCG AGAAAAAAAG AGAACAACTT GCAACAAGAA TACAAGGGGA AATTGAAAAA CGCGGAACGA TCGATGTACT CAGAAATGGT GTTAACTTCT ATCCCGCTGG AAACATCGAC CTCTACTACT TCCAGCCCTC AGAAAAAAAT CCGGCTGCCA AAGCAAACTT TGAAAAAAAT ATCTTCAGTG TAACAAGACA ACTGATGTAT GCCCAGAGCG GCTCAAATGA AGCTCTCGAC TTCTGCATAT TCATCAACGG ACTCCCGGTA ATAACCGCTG AACTCAAAAA CCAGTTCACC CATCAGAATT ACGAAAACGC GATCGAACAA TACAAAAAAG ATAGAAATCC CCGTGAATTA CTGTTTCACT TTGGAAGATG CATAGTTCAC TTCGCAATTG ATGACAGTGA AATTCACATG TGTACAAAAC TGGAAGGCCA AAACTCCTGG TTCTTACCCT TCAACAAAGG ATACAAAGAC GGAGCCGGCA ACCCCCCAAA TCCCGCAGGA CTCAAAACAG ACTACCTCTG GACCCATACA CTCACAAAAA ATGAACTATC CGACATCATA GAAAACTATG CTCAGATACT TGAAGAAAAA GATCCTGACA CAGGAAAAAC AAAACGAAAA CAAATATTCC CCAGATACCA TCAGCTGTCA GTCGTCAGAG CCTTACTCGC CGACGCAAAA GAAAACGGAG TCGGGAAAAG ATACCTCATT CAGCACTCCG CAGGAAGCGG AAAATCCAAC TCCATAGCCT GGCTTGCTCT TCAAATCGTA TCACTTGAAA AAAATGGGAA AGCACTCTTC GATTCTGTCA TCGTAATTAC CGATAGAGTC AACCTCGACA ACCAGATAAA AGGAACCATC AAAGGCTTCA CCCAGATGTC GAATACCATT GGTCATGCAG AGAGCGCCGA AGATCTAAGA AAACTCCTGA CCAACGGAAA AAACATCATT ATCTCAACAA TTCACAAATT CCCCTACATA TTAGACAGCA TTAAAGACGA ACACAAAGGA CAGAACTTCG CGATCATAAT AGACGAAGCA CATTCAAGCC AGAGCGGAAG GATGGCTGGA TCAATGAACG TGGTTCTGAA CAGCGCCCCT GAATACGAAA ACGAAACAGA ATCCATTGAA GACATAATCA ACAACATAGT TGAAAAACGG AAAATGCTCA AAAATGCAAG CTATTTTGCA TTTACCGCCA CACCCAAAAA CAAAACACTC GAAACATTCG GTGTTCCCTA CGAAGACGAA GGAAAAATCA AACACAGACC TTTCCACGAA TACACCATGA AACAGGCAAT TCAGGAAGGA TTCATCCTTG ACGTTCTTCA GAATTACACA CCTGTAAAAA GCTACTTCAA ACTTATAAAG AAAGTTGAAA ATGATCCTGA GTTTGACAAA AGAAAAGCCC TGAAAAAATT AAAGGCACTC GTGGAAAGAG ACATTTATCC AATCTCTCAG AAAGCAGAGA TCATCGTTGA ACACTTCCAC ACACAGGTGA TTTCACCCGG GAAAATTCAT GGAAAAGCCC GGTGTATGGT TGTCTGCAGG GACATCGACG CAGCTATAAA ATACTACAAC ACAATAAGCA AAGCCCTTGA AAACAGAAAA AGTCAGTACA AAGCAATAAT AGCATTTTCC GGCGAAAAAG AGATCGCCGG AGAAACATTC ACCGAATCAT CAATAAACAA ATTCCCAAGC AGTCAAATCG AAAAGAAATT CAAAGAAGAA CCTTATCGTT TCTTAGTAGT TGCAAACAAA TTTCAGACCG GATACGACGA ACCTCTTCTT CATACGATGT ACGTTGACAA ACCTCTCTCT GATATAAAAG CAGTCCAGAC GTTATCCAGA CTCAACCGGG CATACCCTGG CAAATATGAT ACATTCGTCC TTGACTTTTT CAATGACAGC GATATAATCA AACAAGCCTT TGACAGATAT TATAAGACCA CCATACTCTC TGATGAAACA GATGTAAACA AACTCTATGA TTTAATTAGA GAGATGGAAG ATGCTGAAGT ATTCACAACC AATGACGTGA ATAAAGTTGT CACTGATTAT CTCAGCGGAG TAGAAAGAAA CAGATTTGAT CCAACACTTG ACTCCTGCGC TGCAAACTAT GAAAATTATC TTGATCTGGA CGGTCAGATT AAATTCAAGA GCTCAGTAAA ATCATATCTG AGGACATATG GATTCCTTGC TTCGATTCTC CCTATTGGAA ATCCAGAGTG GGAAAAACTA TCAATCTTCC TGAATTATCT GCTGCCAAAA TTAAAGTCAC CTGATGATGA TGATCTCGCA AAAGGCATTC TGAGTGCGGT TGATCTGGAA AGTTATCGTG CTGAAGCTTT GACAACAATA AATATAATCC TCGAAGATGA AGACGCTGAA ATAAATCCGA TCCCGACAAA AGACCCTAAA GGAATAGTTG AGCCTGAAAT GGATCGCTTA TCACACATAT TAGAAGAGTT CCAGAATCTC TGGGGTAACA TCAAATGGAA TGATAAAGAC AGAATTATCA GAGACATCAA AGAAATTCCG GGACGTGTTG CAAAAGATGA AGCCTATCAG AATGCAATCA AAAACTCTGA CAGAGAAAAT GCACGTCTGG AAAGTGAGAG AGCACTGAAG AAAGTAATTG ATTCTATGAT TGAAGATAAT TTGGAACTTT ACAGATTATA TGTCGATAAT CTATCTTTCA GGAAGGAGTT GGAGAGTCTG GTTTTCAACG CAACCTATGA GAAAGACTGC TCAAAAGCAT AG
|
Protein sequence | MPTNTKEIGL EDLIVQHLTS QNGYELGIAS EYIRSSAFDE GRLFRFLETT QGTKLSQYGI LESEKKREQL ATRIQGEIEK RGTIDVLRNG VNFYPAGNID LYYFQPSEKN PAAKANFEKN IFSVTRQLMY AQSGSNEALD FCIFINGLPV ITAELKNQFT HQNYENAIEQ YKKDRNPREL LFHFGRCIVH FAIDDSEIHM CTKLEGQNSW FLPFNKGYKD GAGNPPNPAG LKTDYLWTHT LTKNELSDII ENYAQILEEK DPDTGKTKRK QIFPRYHQLS VVRALLADAK ENGVGKRYLI QHSAGSGKSN SIAWLALQIV SLEKNGKALF DSVIVITDRV NLDNQIKGTI KGFTQMSNTI GHAESAEDLR KLLTNGKNII ISTIHKFPYI LDSIKDEHKG QNFAIIIDEA HSSQSGRMAG SMNVVLNSAP EYENETESIE DIINNIVEKR KMLKNASYFA FTATPKNKTL ETFGVPYEDE GKIKHRPFHE YTMKQAIQEG FILDVLQNYT PVKSYFKLIK KVENDPEFDK RKALKKLKAL VERDIYPISQ KAEIIVEHFH TQVISPGKIH GKARCMVVCR DIDAAIKYYN TISKALENRK SQYKAIIAFS GEKEIAGETF TESSINKFPS SQIEKKFKEE PYRFLVVANK FQTGYDEPLL HTMYVDKPLS DIKAVQTLSR LNRAYPGKYD TFVLDFFNDS DIIKQAFDRY YKTTILSDET DVNKLYDLIR EMEDAEVFTT NDVNKVVTDY LSGVERNRFD PTLDSCAANY ENYLDLDGQI KFKSSVKSYL RTYGFLASIL PIGNPEWEKL SIFLNYLLPK LKSPDDDDLA KGILSAVDLE SYRAEALTTI NIILEDEDAE INPIPTKDPK GIVEPEMDRL SHILEEFQNL WGNIKWNDKD RIIRDIKEIP GRVAKDEAYQ NAIKNSDREN ARLESERALK KVIDSMIEDN LELYRLYVDN LSFRKELESL VFNATYEKDC SKA
|
| |