Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A1014 |
Symbol | |
ID | 3625593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 1237407 |
End bp | 1240304 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637699903 |
Product | type I restriction-modification system restriction subunit |
Protein accession | YP_304562 |
Protein GI | 73668547 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTTG ACTTTTTGAG GACTACACAA CCTGAGACAT GGGATGAATT TAAAAAACGC AGGGGACCGA GGCATAGGGA ACTTTTTCTG GAAAGACTTC AGAAACAGAT TGATAGACGC GGAGTGCTTG ACCTTCTGCG GAAAGGGATT AAGGACAGTG GGTGCCATTT CAAGCTCGCT TATTTCAAGC CGGAAAGTGA TCTTAATGAA GAACACAGAC GGCTTTACAA TGGGAATATC TTTTCTGTAG TGCGGCAGCT TCATTACAGT AAAAAGGAAC CTTTGCTTTC CCTTGATCTT GCGCTTTTTT TGAATGGGTT TCCGGTAGTT ACAGCAGAAC TTAAAAATCC TCTGAAGGAT CAGAACGTTC AGGATGCCGT AAAACAGTAC AGGGATACCA GAAACCAGAA CGAGCCTCTT TTTAAGCTGG GACGTTGTTT TGCTCATTTT GCGGTTGATC CTGACCTTGT ATTCATGACT ACGGAACTGA AAGGGAGTTA CACGGATTTT CTGCCATTTA ACAGGGGAAG AAACGGAGGA GCTGGGAATC CTGACAACCC GAATGGGTAC AGGACTTTTT ACCTCTGGAA GCGGGTCTGG CAGAAAGACC AGATGCTTGA GATTATCCAG AATTTCCTGC AGTTCGTAAA GGAAGAAAAG GAAGGAAGAA CCGGAAAGAA TAAAAAAGAT GGAAAAGATG GAAAAGAAAA GAAAATTGTC CGGAAATTCA TTTTCCCGCG CTATCACCAG CTCTATTCTG TAAAAAATCT TGTTACACAT GCCAAATATC ACGGTACAGG TTATAATTAC CTTATCCAGC ACAGCGCAGG AAGTGGGAAA AGCAACACCA TAGCCTGGCT CTGTCTGCAA CTTTGCGGGC TGCATGATAC CGAAAACAAG CGGGTTTTTG ACTCAATTAT TGTTGTTACG GACCGAAGAG TGCTTGACAG GCAGCTCCAG GAAACCATCA CACAGTTCGA ACATGTCAAG GGCACAGTTG CAACCATCAA AAAAGACAAA GCAAAAAGCC TGACAGCCGC ACTTCAGGAA GGAAAGGATA TCATCATCTG TACACTTCAG ACTTTTCCCT TTGCCGTAAA CGCGATCAGT GAGATGTCGG GAAAGCACTT TGCCGTAGTC GTCGATGAAG CCCATTCTTC CCAGAGTGGG GAAGGGGCAG ACAGCGTAAA AAAGATCCTT GGATCTGCGG ATCTCGAAAG TGATGAGCAG GAAGAAGAGC CTGAAGATGA TGAAGACCTC GTAAACCAGA GAGTAGAGGC AGAACTGAAG CACAAAGGCA GGCTTCCCAA TGTCAGTTTC TTTGCCTTTA CCGCAACCCC GAAAACCAAA ACCCTTGAAC TCTTCGGTAC CCAGCAGCCT GACGGGAGCT ATGAGCCTTT TAGCCTCTAT ACAATGAAGC AGGCAATTGA GGAAAAATTC ATCCTGGATG TGCTTGAAAA TTACACTACC TTCAAAGTTT ATTTCAACCT GCTCAAAACA ATAGAAGACG ACCCCAGATA CAGCAAAAAG AAAGGCATGC ATCTCCTCAA AGCTTATGTG GACAACCATG ACCACTCCAT AAGGACAAAA ACGGTCATCA TTGTCGAGAA CTTCCGTGAA CAGGTCATGC ATAGAATTGA CGGGCAGGCA AAGGCAATGC TTGTCACAAA ATCCAGGGCA AGCGCAGTTA AATATAAACT GGCTTTTGAC AGGTACATAA AGACCCATGA ATATCCTTTC AAAGCCCTTG TAGCCTTTTC AGGTACGGTT CAGGATGGGG AAACCGGCAG AGAATTTACA GAAGCAATCA TGAACGGTTT TTCGGAAAGC CAGACTGCAG AAGTTTTCAA GCGGGAAGAA TACCGCATCC TGATAGTGGC CAATAAATTC CAGACCGGCT TTGACCAGCC TTTATTGTAC GCAATGTATG TGGATAAAAG GCTCGGGGGC GTAAATGCTG TCCAGACTTT AAGCCGCTTG AACCGTAAAT ATCCAGATAA AGAAGATACG ATGGTCCTGG ACTTTGAAAA CGAAGCAGAC CTTATTCAGC GGAGTTTTCA GCCCTACTAC GAAAAAACCC TGCTCACTGA AGCAACTGAC CCCAATAAGC TCTACGATTT TGAGTATGAG CTTAAAGAAT ACCATATTTT TGAAGAAGAA GATGTAGAAA CCTTTGCCAG GGAATATTTC TCACCAAAAG GCAAACAAGA AAAGCTTCAT TCCATACTTT CCCCGGTTGT TGACCGCTAC CTCGCAAAAC CAAAAGAGAA ACAGCATGAT TTCAGAACGC TGGCTAAAAG ATATGTAAGG CTCTATTCCT ACCTTTCTCA TGTAATTCCC TTCAAAGACG TTGATCTGGA AAAACTTTAC CAGTTTTTGC GGCACCTGCA CAGGAAACTT CCAGTTTCAA GAGACCGCCT TCCTGTGGAA ATTACTGAAA ACATAAACAT GGATTCTTAC CGTATTCAGC AGACCAGCAA CGGAAAGATA GCTCTTGCGG ACGAGGAAGG AAAGCTTGGC CCTAGCCGGG AACCCGAAAA CTTCAGTGAA CCTGAAGAAG AACTCACACC CCTCTCCATA ATTATTCAGG AAATAAATGA AAGATTTGGA GATATTGATT TTACAGACGC AGATAGACTA AGATACTTTT CAGAAGATAT GGAGCGCCGT CTTGTAGAAA ACGAAAACCT TGCACGGGCT GCAAATCCGG AAATAAACAC AAAGGACAAC TTCAAACTTA TCTATAACGA CTACTTTGAC GATATCCTGA ACGATATGAT AGATTCAAAC TTTGATCTCT ACACCAAAAT CAATGATGAC AGAGATTTCG GAAATATATT CAGAAAAGCT CTCTTTGAAA GTGTTTACAG GCATTTGACT AAAGGAAAGA AAAAATAA
|
Protein sequence | MLFDFLRTTQ PETWDEFKKR RGPRHRELFL ERLQKQIDRR GVLDLLRKGI KDSGCHFKLA YFKPESDLNE EHRRLYNGNI FSVVRQLHYS KKEPLLSLDL ALFLNGFPVV TAELKNPLKD QNVQDAVKQY RDTRNQNEPL FKLGRCFAHF AVDPDLVFMT TELKGSYTDF LPFNRGRNGG AGNPDNPNGY RTFYLWKRVW QKDQMLEIIQ NFLQFVKEEK EGRTGKNKKD GKDGKEKKIV RKFIFPRYHQ LYSVKNLVTH AKYHGTGYNY LIQHSAGSGK SNTIAWLCLQ LCGLHDTENK RVFDSIIVVT DRRVLDRQLQ ETITQFEHVK GTVATIKKDK AKSLTAALQE GKDIIICTLQ TFPFAVNAIS EMSGKHFAVV VDEAHSSQSG EGADSVKKIL GSADLESDEQ EEEPEDDEDL VNQRVEAELK HKGRLPNVSF FAFTATPKTK TLELFGTQQP DGSYEPFSLY TMKQAIEEKF ILDVLENYTT FKVYFNLLKT IEDDPRYSKK KGMHLLKAYV DNHDHSIRTK TVIIVENFRE QVMHRIDGQA KAMLVTKSRA SAVKYKLAFD RYIKTHEYPF KALVAFSGTV QDGETGREFT EAIMNGFSES QTAEVFKREE YRILIVANKF QTGFDQPLLY AMYVDKRLGG VNAVQTLSRL NRKYPDKEDT MVLDFENEAD LIQRSFQPYY EKTLLTEATD PNKLYDFEYE LKEYHIFEEE DVETFAREYF SPKGKQEKLH SILSPVVDRY LAKPKEKQHD FRTLAKRYVR LYSYLSHVIP FKDVDLEKLY QFLRHLHRKL PVSRDRLPVE ITENINMDSY RIQQTSNGKI ALADEEGKLG PSREPENFSE PEEELTPLSI IIQEINERFG DIDFTDADRL RYFSEDMERR LVENENLARA ANPEINTKDN FKLIYNDYFD DILNDMIDSN FDLYTKINDD RDFGNIFRKA LFESVYRHLT KGKKK
|
| |