Gene Mlab_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1200 
Symbol 
ID4795428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1224818 
End bp1228048 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content44% 
IMG OID640099874 
Producthypothetical protein 
Protein accessionYP_001030636 
Protein GI124486020 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.563874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0639023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCA AAATCACCTT CAAATTCGTC GACCTCCCCT ATCAGATAGA CGCCGTCAAT 
TCCGTCGTCG ACCTCTTTTC GGGACAAAAC AAAACCACCG GCGAATCCCT CTACAAAACA
CCCGGACGTG CTGAACTCAT CGAAACACGC CTCGGACGAA ATCCCAGACT CGAGATAGGC
GACACTCGAC TCCTCAACAA CCTCAAATCC ATCCAGACCC AAAACGACCA CCTCCTCCCC
GACGACGAAC TCTTCAACTA CAACTTCTCC GTCGACATGG AAACTGGAAC CGGAAAAACC
TACGTCTACC TCAGGACCAT CCTCGAACTC CATAAACAAT ATGGATTTAC CAAATTCATC
ATCGTCGTCC CCAGAATCGC CATCCTTCAG GGCGTCAAAA AAAGCATCGA ACAACTCACA
GAAACCTTCA AAGCACTCTA CGACGGAATC GACATAAACG CCCGATCCTT CATCTACACA
TCATCAAAAA TCGACGAACT CAGAAGCAAA TTCATCGAAG GAACAGACCT TTCCATCGTC
ATAATGAACA AAGATGCCTT CAACAAAAGC GGCATCAATA TCATTCAGAA AGACCGTGAA
GGCGGGACAA AACTTTGGGA CTTAATCAAA GCGGTAAAAC CAATCATCAT AATCGACGAA
CCGCAACTTA TCGAAGGGAC CACCTCCAAA AAGAGTTCAT CCCTTCGCGA ACTCGAAAAT
CTTGCACCCC TCTTCACTTT ACGCTACTCC GCGACCCACA AAAATCCCTA TAACTGTGTC
TACCGCCTGA CAAGCTATGA TGCATACAAC CAGAACCTCG TCAAAAAGAT CCGGGTAAAA
ACCGTATACG GTGAAGTCCC AACTGATTTC CCCTACATCA GATATGTAAC TTTTACAACA
AACCTCAAGG CAAGGATCGA AATATTCTCC CGTGATGAAA AACGCGGTAT AAGCAAAAAA
CAATACGACG TAAGCGATGG TGACTCGCTT CACGAACTCT CCGGTGGCCT CGAACAGTAT
CAAAACATGT TCATTGCCGG AAACCCCCAC AAACTCGATG GACTCACCAT CTCACGCGGG
AAACTTGGTC AGCTCACCCT CTTCAATGAA ACCACCGGAT CATCCAACTG GGGAATCACC
ATCAATCAGG GTGATGACAG CCTCACACTT ATGCCCGGAG ATTGTACATA TAATGAAAAA
CTCCGAGAAG CCGATTCAAC TGTTGTCAGG GTCCAGATCA GAACTGCAAT AAAAAATCAT
CTCGACACCC AGTTTAACCT CCTCGAGAAC AAAAAACACA TCAAAGCTAT CTCCCTCTTC
TTTATTGACG AAGTTAAACG CATCCGTGAC GATACGCAAG CTGACGGCCG CGGAATCTTC
CTTCGGATCT TTGACGAAGA ATACAACACC ATCATCCACG AAAAAAAATA CCAGGACTAC
TTCAAAAAAT ACGAAGCTCT ATTCCCGGAA TACAAAAACG TCCTCAACAT CAGAGAAGGC
TACTTCGCCA TAGATAAAAA CAACAAAGTC GTCGAGCCTG AAAGCTCCAA AAAGAACAGA
CTCAGCATGC TCGAAGAAGA CGACTACAAT AAAAAATCCA AGGAAGACAT CGAACGGGGT
ATTGAACTGA TCCTCAAGAA AAAAGAGGAA CTCATATCAT TTCAAACACC CCTGGCCTTC
ATCTTCTCCC ACTCCGCGCT AAGAGAGGGG TGGGACAACC CAAACGTCTT CACCCTCTGC
ACCCTCAAGG AATCTTCAAA TGAAATGGCA AAAAAACAGG AGATCGGCAG AGGACTCCGT
CTTCCTGTAG ACATACACGG GGAACGCTGC TATGATGAAG ATGTAAACAT GCTGACGGTC
GTTGCAAACT CCTATTACGA CGAATTTGCA GCCCATCTCC AGGCAGATTT CGACGCAGAA
CAAAACTTCA ACAAAGACAT AGCTACAGTC TACGAATTTA TCGAGACCCT GAAACAAGCC
GGACTACCAC AGAAAAAAAT CACCAAAGAC CTTGCCAAGA CCTTGGAAAA AGAGCTTCGG
ATAAAAGCCG TCATCAACTC GCAGGGTAAA CTCAACCCAA AAATCGACAT ACACAACATA
TCGTTCACTG ACGCCACGTT AAGCGAGCAT GAAGTCGCCA TCAAAAATGC TTTCATCCAA
GTCATGACCG AAAAGGGCAC AAAGAGGATA TATATCGAGA ATGGCGATGA AAAATCCGAA
GAAAACTCCC CTCATTCCTA CATAGGTGAA GAATCCTTCA AAAACCTTCT AAATGAACTC
ACACTCCGCC TGGAAAAACG CACATTCTAT CAGATCAACA TTGACTCTGC AGACTTCATT
CAGGAAGCGG GAAGAAGACT CAATCGTCTC TTAACAACAA AATGGGCAAC CGCCCAGTCG
ATCACCACCA CTACCGGCGA AGTCCGTATG AAAGCAGACG GAAAAACCTA CGTCAATGAA
CAAAAAGAGG AGTACACAAC CGACAAAACA CCCTTGGTCT GGTACAAAAA AACCGACTTC
CAGATAATCA ACTACATCAT GCAGCAGACA AGACTCCCGC GTCTTGCCAT ATACGCAATA
CTCAATGAAC TGAGTCCTGA ACTCAAAGAA TACCTCAGGG TTCAGGATAT ATTAGATAAT
GCAACACTCG AATTACAAAA AACTCTCACC GAATTCAAAT CTGCCCACAT ATCCGGATAT
CAGGTAATCG ATAACTACCT CTTCGACAAA AAAGAAATCT TCGTTCCTGA CAAAATCGAC
TCAGAAACGC TCAAAAAATT AGATGATGAT GATGCATTCA CTTTTGATGC ATACAAATCC
AAACCAGAAA AGCGTCATGC ACTCTACAAA TACTACAAGA CAGACAGCAA AGGTGAAAAA
GAGTTTGCTC AGCAGTTGGA CGAAGATGAA CGCGTCCTCC TCTTCACCAA ACTCCACAAA
GGTGGGTTTG TCATCGACAC ACCTGAAGGA AATTACTCTC CAGACTGGGC AGTCATCTAT
AAACACTCCG ACGAATTGAC GAGACTCTAT TTCATCGTTG AGACAAAAAT CGACAAAGAA
CGTAAAGATC TCAGTGAGGT TGAAAGAGTT AAGATTAGTT GTGGAGAAAT GCACTTCGAT
GCAGTATCTA AAGCAATAGG AAAAGATGTT GAGTATCTTT ATGCTAAAAA CTACCGGCAT
TTCACCGAGC AGATTGAACG TTTCGAAGAG ATCAACATAC GATCTCCTTA A
 
Protein sequence
MTGKITFKFV DLPYQIDAVN SVVDLFSGQN KTTGESLYKT PGRAELIETR LGRNPRLEIG 
DTRLLNNLKS IQTQNDHLLP DDELFNYNFS VDMETGTGKT YVYLRTILEL HKQYGFTKFI
IVVPRIAILQ GVKKSIEQLT ETFKALYDGI DINARSFIYT SSKIDELRSK FIEGTDLSIV
IMNKDAFNKS GINIIQKDRE GGTKLWDLIK AVKPIIIIDE PQLIEGTTSK KSSSLRELEN
LAPLFTLRYS ATHKNPYNCV YRLTSYDAYN QNLVKKIRVK TVYGEVPTDF PYIRYVTFTT
NLKARIEIFS RDEKRGISKK QYDVSDGDSL HELSGGLEQY QNMFIAGNPH KLDGLTISRG
KLGQLTLFNE TTGSSNWGIT INQGDDSLTL MPGDCTYNEK LREADSTVVR VQIRTAIKNH
LDTQFNLLEN KKHIKAISLF FIDEVKRIRD DTQADGRGIF LRIFDEEYNT IIHEKKYQDY
FKKYEALFPE YKNVLNIREG YFAIDKNNKV VEPESSKKNR LSMLEEDDYN KKSKEDIERG
IELILKKKEE LISFQTPLAF IFSHSALREG WDNPNVFTLC TLKESSNEMA KKQEIGRGLR
LPVDIHGERC YDEDVNMLTV VANSYYDEFA AHLQADFDAE QNFNKDIATV YEFIETLKQA
GLPQKKITKD LAKTLEKELR IKAVINSQGK LNPKIDIHNI SFTDATLSEH EVAIKNAFIQ
VMTEKGTKRI YIENGDEKSE ENSPHSYIGE ESFKNLLNEL TLRLEKRTFY QINIDSADFI
QEAGRRLNRL LTTKWATAQS ITTTTGEVRM KADGKTYVNE QKEEYTTDKT PLVWYKKTDF
QIINYIMQQT RLPRLAIYAI LNELSPELKE YLRVQDILDN ATLELQKTLT EFKSAHISGY
QVIDNYLFDK KEIFVPDKID SETLKKLDDD DAFTFDAYKS KPEKRHALYK YYKTDSKGEK
EFAQQLDEDE RVLLFTKLHK GGFVIDTPEG NYSPDWAVIY KHSDELTRLY FIVETKIDKE
RKDLSEVERV KISCGEMHFD AVSKAIGKDV EYLYAKNYRH FTEQIERFEE INIRSP