Gene Mlab_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0843 
Symbol 
ID4794830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp822986 
End bp825937 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content41% 
IMG OID640099506 
Producthypothetical protein 
Protein accessionYP_001030281 
Protein GI124485665 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.10122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCA ACACAAAAGA AATCGGACTC GAAGACTTGA TCGTACAACA TCTCACAAGT 
CAGAACGGAT ACGAACTGGG GATCGCAAGT GAATATATCC GGTCTTCGGC CTTTGATGAA
GGAAGGCTGT TTAGATTCCT TGAAACGACC CAGGGAACCA AACTCTCACA ATACGGAATA
CTCGAAAGCG AGAAAAAAAG AGAACAACTT GCAACAAGAA TACAAGGGGA AATTGAAAAA
CGCGGAACGA TCGATGTACT CAGAAATGGT GTTAACTTCT ATCCCGCTGG AAACATCGAC
CTCTACTACT TCCAGCCCTC AGAAAAAAAT CCGGCTGCCA AAGCAAACTT TGAAAAAAAT
ATCTTCAGTG TAACAAGACA ACTGATGTAT GCCCAGAGCG GCTCAAATGA AGCTCTCGAC
TTCTGCATAT TCATCAACGG ACTCCCGGTA ATAACCGCTG AACTCAAAAA CCAGTTCACC
CATCAGAATT ACGAAAACGC GATCGAACAA TACAAAAAAG ATAGAAATCC CCGTGAATTA
CTGTTTCACT TTGGAAGATG CATAGTTCAC TTCGCAATTG ATGACAGTGA AATTCACATG
TGTACAAAAC TGGAAGGCCA AAACTCCTGG TTCTTACCCT TCAACAAAGG ATACAAAGAC
GGAGCCGGCA ACCCCCCAAA TCCCGCAGGA CTCAAAACAG ACTACCTCTG GACCCATACA
CTCACAAAAA ATGAACTATC CGACATCATA GAAAACTATG CTCAGATACT TGAAGAAAAA
GATCCTGACA CAGGAAAAAC AAAACGAAAA CAAATATTCC CCAGATACCA TCAGCTGTCA
GTCGTCAGAG CCTTACTCGC CGACGCAAAA GAAAACGGAG TCGGGAAAAG ATACCTCATT
CAGCACTCCG CAGGAAGCGG AAAATCCAAC TCCATAGCCT GGCTTGCTCT TCAAATCGTA
TCACTTGAAA AAAATGGGAA AGCACTCTTC GATTCTGTCA TCGTAATTAC CGATAGAGTC
AACCTCGACA ACCAGATAAA AGGAACCATC AAAGGCTTCA CCCAGATGTC GAATACCATT
GGTCATGCAG AGAGCGCCGA AGATCTAAGA AAACTCCTGA CCAACGGAAA AAACATCATT
ATCTCAACAA TTCACAAATT CCCCTACATA TTAGACAGCA TTAAAGACGA ACACAAAGGA
CAGAACTTCG CGATCATAAT AGACGAAGCA CATTCAAGCC AGAGCGGAAG GATGGCTGGA
TCAATGAACG TGGTTCTGAA CAGCGCCCCT GAATACGAAA ACGAAACAGA ATCCATTGAA
GACATAATCA ACAACATAGT TGAAAAACGG AAAATGCTCA AAAATGCAAG CTATTTTGCA
TTTACCGCCA CACCCAAAAA CAAAACACTC GAAACATTCG GTGTTCCCTA CGAAGACGAA
GGAAAAATCA AACACAGACC TTTCCACGAA TACACCATGA AACAGGCAAT TCAGGAAGGA
TTCATCCTTG ACGTTCTTCA GAATTACACA CCTGTAAAAA GCTACTTCAA ACTTATAAAG
AAAGTTGAAA ATGATCCTGA GTTTGACAAA AGAAAAGCCC TGAAAAAATT AAAGGCACTC
GTGGAAAGAG ACATTTATCC AATCTCTCAG AAAGCAGAGA TCATCGTTGA ACACTTCCAC
ACACAGGTGA TTTCACCCGG GAAAATTCAT GGAAAAGCCC GGTGTATGGT TGTCTGCAGG
GACATCGACG CAGCTATAAA ATACTACAAC ACAATAAGCA AAGCCCTTGA AAACAGAAAA
AGTCAGTACA AAGCAATAAT AGCATTTTCC GGCGAAAAAG AGATCGCCGG AGAAACATTC
ACCGAATCAT CAATAAACAA ATTCCCAAGC AGTCAAATCG AAAAGAAATT CAAAGAAGAA
CCTTATCGTT TCTTAGTAGT TGCAAACAAA TTTCAGACCG GATACGACGA ACCTCTTCTT
CATACGATGT ACGTTGACAA ACCTCTCTCT GATATAAAAG CAGTCCAGAC GTTATCCAGA
CTCAACCGGG CATACCCTGG CAAATATGAT ACATTCGTCC TTGACTTTTT CAATGACAGC
GATATAATCA AACAAGCCTT TGACAGATAT TATAAGACCA CCATACTCTC TGATGAAACA
GATGTAAACA AACTCTATGA TTTAATTAGA GAGATGGAAG ATGCTGAAGT ATTCACAACC
AATGACGTGA ATAAAGTTGT CACTGATTAT CTCAGCGGAG TAGAAAGAAA CAGATTTGAT
CCAACACTTG ACTCCTGCGC TGCAAACTAT GAAAATTATC TTGATCTGGA CGGTCAGATT
AAATTCAAGA GCTCAGTAAA ATCATATCTG AGGACATATG GATTCCTTGC TTCGATTCTC
CCTATTGGAA ATCCAGAGTG GGAAAAACTA TCAATCTTCC TGAATTATCT GCTGCCAAAA
TTAAAGTCAC CTGATGATGA TGATCTCGCA AAAGGCATTC TGAGTGCGGT TGATCTGGAA
AGTTATCGTG CTGAAGCTTT GACAACAATA AATATAATCC TCGAAGATGA AGACGCTGAA
ATAAATCCGA TCCCGACAAA AGACCCTAAA GGAATAGTTG AGCCTGAAAT GGATCGCTTA
TCACACATAT TAGAAGAGTT CCAGAATCTC TGGGGTAACA TCAAATGGAA TGATAAAGAC
AGAATTATCA GAGACATCAA AGAAATTCCG GGACGTGTTG CAAAAGATGA AGCCTATCAG
AATGCAATCA AAAACTCTGA CAGAGAAAAT GCACGTCTGG AAAGTGAGAG AGCACTGAAG
AAAGTAATTG ATTCTATGAT TGAAGATAAT TTGGAACTTT ACAGATTATA TGTCGATAAT
CTATCTTTCA GGAAGGAGTT GGAGAGTCTG GTTTTCAACG CAACCTATGA GAAAGACTGC
TCAAAAGCAT AG
 
Protein sequence
MPTNTKEIGL EDLIVQHLTS QNGYELGIAS EYIRSSAFDE GRLFRFLETT QGTKLSQYGI 
LESEKKREQL ATRIQGEIEK RGTIDVLRNG VNFYPAGNID LYYFQPSEKN PAAKANFEKN
IFSVTRQLMY AQSGSNEALD FCIFINGLPV ITAELKNQFT HQNYENAIEQ YKKDRNPREL
LFHFGRCIVH FAIDDSEIHM CTKLEGQNSW FLPFNKGYKD GAGNPPNPAG LKTDYLWTHT
LTKNELSDII ENYAQILEEK DPDTGKTKRK QIFPRYHQLS VVRALLADAK ENGVGKRYLI
QHSAGSGKSN SIAWLALQIV SLEKNGKALF DSVIVITDRV NLDNQIKGTI KGFTQMSNTI
GHAESAEDLR KLLTNGKNII ISTIHKFPYI LDSIKDEHKG QNFAIIIDEA HSSQSGRMAG
SMNVVLNSAP EYENETESIE DIINNIVEKR KMLKNASYFA FTATPKNKTL ETFGVPYEDE
GKIKHRPFHE YTMKQAIQEG FILDVLQNYT PVKSYFKLIK KVENDPEFDK RKALKKLKAL
VERDIYPISQ KAEIIVEHFH TQVISPGKIH GKARCMVVCR DIDAAIKYYN TISKALENRK
SQYKAIIAFS GEKEIAGETF TESSINKFPS SQIEKKFKEE PYRFLVVANK FQTGYDEPLL
HTMYVDKPLS DIKAVQTLSR LNRAYPGKYD TFVLDFFNDS DIIKQAFDRY YKTTILSDET
DVNKLYDLIR EMEDAEVFTT NDVNKVVTDY LSGVERNRFD PTLDSCAANY ENYLDLDGQI
KFKSSVKSYL RTYGFLASIL PIGNPEWEKL SIFLNYLLPK LKSPDDDDLA KGILSAVDLE
SYRAEALTTI NIILEDEDAE INPIPTKDPK GIVEPEMDRL SHILEEFQNL WGNIKWNDKD
RIIRDIKEIP GRVAKDEAYQ NAIKNSDREN ARLESERALK KVIDSMIEDN LELYRLYVDN
LSFRKELESL VFNATYEKDC SKA