Gene Mbar_A1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1014 
Symbol 
ID3625593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1237407 
End bp1240304 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content43% 
IMG OID637699903 
Producttype I restriction-modification system restriction subunit 
Protein accessionYP_304562 
Protein GI73668547 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTTTG ACTTTTTGAG GACTACACAA CCTGAGACAT GGGATGAATT TAAAAAACGC 
AGGGGACCGA GGCATAGGGA ACTTTTTCTG GAAAGACTTC AGAAACAGAT TGATAGACGC
GGAGTGCTTG ACCTTCTGCG GAAAGGGATT AAGGACAGTG GGTGCCATTT CAAGCTCGCT
TATTTCAAGC CGGAAAGTGA TCTTAATGAA GAACACAGAC GGCTTTACAA TGGGAATATC
TTTTCTGTAG TGCGGCAGCT TCATTACAGT AAAAAGGAAC CTTTGCTTTC CCTTGATCTT
GCGCTTTTTT TGAATGGGTT TCCGGTAGTT ACAGCAGAAC TTAAAAATCC TCTGAAGGAT
CAGAACGTTC AGGATGCCGT AAAACAGTAC AGGGATACCA GAAACCAGAA CGAGCCTCTT
TTTAAGCTGG GACGTTGTTT TGCTCATTTT GCGGTTGATC CTGACCTTGT ATTCATGACT
ACGGAACTGA AAGGGAGTTA CACGGATTTT CTGCCATTTA ACAGGGGAAG AAACGGAGGA
GCTGGGAATC CTGACAACCC GAATGGGTAC AGGACTTTTT ACCTCTGGAA GCGGGTCTGG
CAGAAAGACC AGATGCTTGA GATTATCCAG AATTTCCTGC AGTTCGTAAA GGAAGAAAAG
GAAGGAAGAA CCGGAAAGAA TAAAAAAGAT GGAAAAGATG GAAAAGAAAA GAAAATTGTC
CGGAAATTCA TTTTCCCGCG CTATCACCAG CTCTATTCTG TAAAAAATCT TGTTACACAT
GCCAAATATC ACGGTACAGG TTATAATTAC CTTATCCAGC ACAGCGCAGG AAGTGGGAAA
AGCAACACCA TAGCCTGGCT CTGTCTGCAA CTTTGCGGGC TGCATGATAC CGAAAACAAG
CGGGTTTTTG ACTCAATTAT TGTTGTTACG GACCGAAGAG TGCTTGACAG GCAGCTCCAG
GAAACCATCA CACAGTTCGA ACATGTCAAG GGCACAGTTG CAACCATCAA AAAAGACAAA
GCAAAAAGCC TGACAGCCGC ACTTCAGGAA GGAAAGGATA TCATCATCTG TACACTTCAG
ACTTTTCCCT TTGCCGTAAA CGCGATCAGT GAGATGTCGG GAAAGCACTT TGCCGTAGTC
GTCGATGAAG CCCATTCTTC CCAGAGTGGG GAAGGGGCAG ACAGCGTAAA AAAGATCCTT
GGATCTGCGG ATCTCGAAAG TGATGAGCAG GAAGAAGAGC CTGAAGATGA TGAAGACCTC
GTAAACCAGA GAGTAGAGGC AGAACTGAAG CACAAAGGCA GGCTTCCCAA TGTCAGTTTC
TTTGCCTTTA CCGCAACCCC GAAAACCAAA ACCCTTGAAC TCTTCGGTAC CCAGCAGCCT
GACGGGAGCT ATGAGCCTTT TAGCCTCTAT ACAATGAAGC AGGCAATTGA GGAAAAATTC
ATCCTGGATG TGCTTGAAAA TTACACTACC TTCAAAGTTT ATTTCAACCT GCTCAAAACA
ATAGAAGACG ACCCCAGATA CAGCAAAAAG AAAGGCATGC ATCTCCTCAA AGCTTATGTG
GACAACCATG ACCACTCCAT AAGGACAAAA ACGGTCATCA TTGTCGAGAA CTTCCGTGAA
CAGGTCATGC ATAGAATTGA CGGGCAGGCA AAGGCAATGC TTGTCACAAA ATCCAGGGCA
AGCGCAGTTA AATATAAACT GGCTTTTGAC AGGTACATAA AGACCCATGA ATATCCTTTC
AAAGCCCTTG TAGCCTTTTC AGGTACGGTT CAGGATGGGG AAACCGGCAG AGAATTTACA
GAAGCAATCA TGAACGGTTT TTCGGAAAGC CAGACTGCAG AAGTTTTCAA GCGGGAAGAA
TACCGCATCC TGATAGTGGC CAATAAATTC CAGACCGGCT TTGACCAGCC TTTATTGTAC
GCAATGTATG TGGATAAAAG GCTCGGGGGC GTAAATGCTG TCCAGACTTT AAGCCGCTTG
AACCGTAAAT ATCCAGATAA AGAAGATACG ATGGTCCTGG ACTTTGAAAA CGAAGCAGAC
CTTATTCAGC GGAGTTTTCA GCCCTACTAC GAAAAAACCC TGCTCACTGA AGCAACTGAC
CCCAATAAGC TCTACGATTT TGAGTATGAG CTTAAAGAAT ACCATATTTT TGAAGAAGAA
GATGTAGAAA CCTTTGCCAG GGAATATTTC TCACCAAAAG GCAAACAAGA AAAGCTTCAT
TCCATACTTT CCCCGGTTGT TGACCGCTAC CTCGCAAAAC CAAAAGAGAA ACAGCATGAT
TTCAGAACGC TGGCTAAAAG ATATGTAAGG CTCTATTCCT ACCTTTCTCA TGTAATTCCC
TTCAAAGACG TTGATCTGGA AAAACTTTAC CAGTTTTTGC GGCACCTGCA CAGGAAACTT
CCAGTTTCAA GAGACCGCCT TCCTGTGGAA ATTACTGAAA ACATAAACAT GGATTCTTAC
CGTATTCAGC AGACCAGCAA CGGAAAGATA GCTCTTGCGG ACGAGGAAGG AAAGCTTGGC
CCTAGCCGGG AACCCGAAAA CTTCAGTGAA CCTGAAGAAG AACTCACACC CCTCTCCATA
ATTATTCAGG AAATAAATGA AAGATTTGGA GATATTGATT TTACAGACGC AGATAGACTA
AGATACTTTT CAGAAGATAT GGAGCGCCGT CTTGTAGAAA ACGAAAACCT TGCACGGGCT
GCAAATCCGG AAATAAACAC AAAGGACAAC TTCAAACTTA TCTATAACGA CTACTTTGAC
GATATCCTGA ACGATATGAT AGATTCAAAC TTTGATCTCT ACACCAAAAT CAATGATGAC
AGAGATTTCG GAAATATATT CAGAAAAGCT CTCTTTGAAA GTGTTTACAG GCATTTGACT
AAAGGAAAGA AAAAATAA
 
Protein sequence
MLFDFLRTTQ PETWDEFKKR RGPRHRELFL ERLQKQIDRR GVLDLLRKGI KDSGCHFKLA 
YFKPESDLNE EHRRLYNGNI FSVVRQLHYS KKEPLLSLDL ALFLNGFPVV TAELKNPLKD
QNVQDAVKQY RDTRNQNEPL FKLGRCFAHF AVDPDLVFMT TELKGSYTDF LPFNRGRNGG
AGNPDNPNGY RTFYLWKRVW QKDQMLEIIQ NFLQFVKEEK EGRTGKNKKD GKDGKEKKIV
RKFIFPRYHQ LYSVKNLVTH AKYHGTGYNY LIQHSAGSGK SNTIAWLCLQ LCGLHDTENK
RVFDSIIVVT DRRVLDRQLQ ETITQFEHVK GTVATIKKDK AKSLTAALQE GKDIIICTLQ
TFPFAVNAIS EMSGKHFAVV VDEAHSSQSG EGADSVKKIL GSADLESDEQ EEEPEDDEDL
VNQRVEAELK HKGRLPNVSF FAFTATPKTK TLELFGTQQP DGSYEPFSLY TMKQAIEEKF
ILDVLENYTT FKVYFNLLKT IEDDPRYSKK KGMHLLKAYV DNHDHSIRTK TVIIVENFRE
QVMHRIDGQA KAMLVTKSRA SAVKYKLAFD RYIKTHEYPF KALVAFSGTV QDGETGREFT
EAIMNGFSES QTAEVFKREE YRILIVANKF QTGFDQPLLY AMYVDKRLGG VNAVQTLSRL
NRKYPDKEDT MVLDFENEAD LIQRSFQPYY EKTLLTEATD PNKLYDFEYE LKEYHIFEEE
DVETFAREYF SPKGKQEKLH SILSPVVDRY LAKPKEKQHD FRTLAKRYVR LYSYLSHVIP
FKDVDLEKLY QFLRHLHRKL PVSRDRLPVE ITENINMDSY RIQQTSNGKI ALADEEGKLG
PSREPENFSE PEEELTPLSI IIQEINERFG DIDFTDADRL RYFSEDMERR LVENENLARA
ANPEINTKDN FKLIYNDYFD DILNDMIDSN FDLYTKINDD RDFGNIFRKA LFESVYRHLT
KGKKK