Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A2662 |
Symbol | |
ID | 3626713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 3370021 |
End bp | 3373251 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637701518 |
Product | putative type I restriction enzyme R protein |
Protein accession | YP_306148 |
Protein GI | 73670133 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.322424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.707209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAAAA TGGTTGCCGA GATAAAAAAT AATGACGGAT CAGCGAGAAA CGATGTTTTA AAAGAAGAAG ATGAATCAGG TTCTGGTGAT TATAGTGAAG GCACCCTCGT AGAAGAGTCC GCAATCCGAG AATTCGAAAA ACTGGGTTAT ACTTTTTTAA ACTGCTTCAA TGAAAAAATC AGGCTTGACG GAAAAGGGAC ACTGGGCAGG AAAACAAAAT CCGAAGTATT GCTTTTCAGG AAACTGAGAG AGGCAATCAA AAAACTCAAT CCTGAGATTT GCCCCGAAGC TGAAGAGTCC GCAATCCGGG AACTGGCAAA AGACAGGAGC AGGCTTAGCC CTGTAAAGGC AAATCAGGAA ATTTATTCAT TAATCAAAAA CGGGGTCAAG GTCAAAGTCC GAAATGAAAA AGGGGAAATT GAAGACCGGA CCATAAAGAT TATCGATTTT GAAAATCCCG AAAATAACGA CTTCTTTTTA GCTTCCCAGT TCTGGATTAT GGGAGAAATG GAATCAAGGC GGACCGACCT TCTGGGATTT GTTAACGGGA TTCCTTTAAT TTTTCTTGAG CTAAAATCGA CTGCCAGAAG GGTCAAAGAG GCTTTTGACG ATAATCTTAC AGACTATAAA GAAACAATTC CACAGCTTTT CTGGTACAAT GCATTTATAA TCCTCTCCAA CGGCAGAGAC TCAAAAATCG GCACTGTTAC AAGCGGTTTT GATCATTTTG GGGAATGGAA AAGGGTTGAA GATGAAAATG AAACAGGAAG TACCCTGCTG GATACAATGA TAAAGGGAAC CTGCGAAAAA AGCCGCTTCC TTGATATCCT GGAAAACTTC ACGCTTTTTT CTAGCAGTGA AGGGCACTCA GTCAAAATTG TTGCCAAAAA TCACCAGTAC CTTGGAGTTA ATAATGCCAT AGAGTCCTTC AAAAAACGCA ATGAGGATGA AGGAAAAATC GGTGTCTTCT GGCACACTCA GGGTTCTGGA AAAAGCTATT CCATGATATT TTTTACCCAA AAAATCCTGC GAAAATTTCC TGGCAATTAC ACCTTCGTGG TCGTAACCGA CAGGGATGAA CTGGACGAGC AGATATATCA GAACTTCCAG AACGCAGGCG TAATAAGCGA AGTTGGCGTA CAGGCCAGAA GTAGTAGGCA TCTCAGGCAA CTGCTAACTG AAGACCACAG GCTGGTTTTC ACTTTGATCC ACAAATTCGG GACTAACAAA GGAGTTAAGC ATCCCCTACT TTCGGATAGA GATGACATTA TAGTAATTGC AGACGAAGCC CACAGGACCC AGTACGACAC CCTGGCGCAA AACATGAGAG ATGCTCTGCC GAATGCGAAT TTTATAGGTT TTACAGGCAC GCCTTTAATG GCTGGCGAAG AAAAGACAAA AGACACATTC GGAGACTACG TAAGCATCTA CAACTTCATA CAATCGATAG AAGATGGAGC AACCGTCCCC CTCTATTATG AAAATCGCGT GCCTGAAGTC CAGTTGCATA ACGAGAGCCT CAATGATGAT ATTTATCGAG AAATCGAAAA AGCAGGTTTG AATGATGAAG AAGAATCCAA ACTTGCTACC GACTTTGCAA AGGAATACCA GATAATCGTA AGGGAAGAAC GCCTGGAAAC GATTGCAAAA GACATTGTAA CGCATTATAC CACACGTGGT TACGCCGGAA AAGCCCTTGT AGTATCCATT GATAAGCTTA CAACGGTCAG GATGTACGAT AAAGTCCAGA AATACTGGAA AATGCATATT AACGAACTCA AAGAAAAGAG AAAAGAAATA ACTGAAGGTA AGAAAGCTAT AGATCTGGAA AAACAGATCT CCGAACTTGA AAACACTGAT ATGGCAGTCG TTATCAGCGA AGGGCAGAAT GAGGTAAAGA AATTTGAGGA AAATGGGCTG GATATAAGGC CACACAGGAA AAGAATATCT GTGGAAGATC TGGAAGAAAT TTTCAAGGAT TCAACAAGTA AGCTCAAAAT TGTTTTCGTG TGCAGCAAAT GGAGAGAAGG GTTCGATGTC CCCTCGCTTT CCACGATTTA TCTTGACAGG CCCATGAAGG ACCACAGCCT CATGCAAACG ATTGCCAGGG CAAACAGGGT CTTTGGAGAT AAACCCGGAG GCTTCATTGT CGACTATGCT GGTATTTTCA GAAATCTTGA ACAGGCCCTC AAGATTTATG CAACGCCCAG ATCTGGCGGA GTTGAAATCC CACTCAAGCC GAAAGAAAAA CTTGTGGAAG CCCTTGAACA GAAGCTTAAA GAAATAAATA AATTCCTTTC CGGCCTTTCG GTAAATCCTG ACAAAATCAT CAAGATGAAG TCGAGTTTTG ATAAAATAAA TCTCTTGAAA AACGCAACTG ATGCTATTCT CATAAACGAG TCAACAAAAA AGAAGTTTTT AACCGAAGCT GGGACTGCTC TAAAAATATA CAAATCTATA CTTCCCCACA AACGGGCCTC AGAGTTTTTT CCACAGGTAA CTCTTTATGA AGAACTGGTA AACGAAATTC GTTCTCTTGA TCCTGAAGTC GATATCTCAA GAGTAACGGA CGGTATACAG CGAGTCCTCG ATACATCGAT TAAGCCCAGA GAATATATAA TCAAGGAATC AAAAAAGGGC AGAATAATTG CTCTTAGAGA TATCGACTTT GATGCCCTGG CAGATAGATT CGATAAGCAG CATAAGAACA CTGAGTTTGA ATGGCTAAAA AATCTTCTCT CGTACAAACT AAAAGAAATG GTTAAAATTA ATAACAGCCG CCTTGATTAC CAGAAAAGTT TTGAAAAACT GATCGAAAAT TATAATTCAG GCTCTGATAA CTCAGATTAT CCTTACAGAG AACTAATCGA ATTTGCTAAA AAGCTGAAAA AAGAAGATGA GAGAGCAATA AAGGAAAATT TAACTGAAGA AGAACTCTCC CTCTTTGACA AGCTAAAAAA ACCTGATCTA ACCGAGAAAG ATAAAAAACA GGTAAAGCAG GTTGCAAGAG ATCTGCTTTC CACACTAAAA GCTGAGAAAC TCGTTCTGGA CTGGCGGAAA AAACAGCAGG CAATGGCCGC AGTTAAAAAA GAAATTGAAG ACGAACTTGA TAAGGGACTG CCCGAATCAT ACGACTCCAG AATCTATGAA GAGAAATGCA ATACAGTCTT TCAGCACATT TATGATTCTT ATGCCGGTGA CAGCCATAGC ATTTATGAAG CAATAGCCTG A
|
Protein sequence | MLKMVAEIKN NDGSARNDVL KEEDESGSGD YSEGTLVEES AIREFEKLGY TFLNCFNEKI RLDGKGTLGR KTKSEVLLFR KLREAIKKLN PEICPEAEES AIRELAKDRS RLSPVKANQE IYSLIKNGVK VKVRNEKGEI EDRTIKIIDF ENPENNDFFL ASQFWIMGEM ESRRTDLLGF VNGIPLIFLE LKSTARRVKE AFDDNLTDYK ETIPQLFWYN AFIILSNGRD SKIGTVTSGF DHFGEWKRVE DENETGSTLL DTMIKGTCEK SRFLDILENF TLFSSSEGHS VKIVAKNHQY LGVNNAIESF KKRNEDEGKI GVFWHTQGSG KSYSMIFFTQ KILRKFPGNY TFVVVTDRDE LDEQIYQNFQ NAGVISEVGV QARSSRHLRQ LLTEDHRLVF TLIHKFGTNK GVKHPLLSDR DDIIVIADEA HRTQYDTLAQ NMRDALPNAN FIGFTGTPLM AGEEKTKDTF GDYVSIYNFI QSIEDGATVP LYYENRVPEV QLHNESLNDD IYREIEKAGL NDEEESKLAT DFAKEYQIIV REERLETIAK DIVTHYTTRG YAGKALVVSI DKLTTVRMYD KVQKYWKMHI NELKEKRKEI TEGKKAIDLE KQISELENTD MAVVISEGQN EVKKFEENGL DIRPHRKRIS VEDLEEIFKD STSKLKIVFV CSKWREGFDV PSLSTIYLDR PMKDHSLMQT IARANRVFGD KPGGFIVDYA GIFRNLEQAL KIYATPRSGG VEIPLKPKEK LVEALEQKLK EINKFLSGLS VNPDKIIKMK SSFDKINLLK NATDAILINE STKKKFLTEA GTALKIYKSI LPHKRASEFF PQVTLYEELV NEIRSLDPEV DISRVTDGIQ RVLDTSIKPR EYIIKESKKG RIIALRDIDF DALADRFDKQ HKNTEFEWLK NLLSYKLKEM VKINNSRLDY QKSFEKLIEN YNSGSDNSDY PYRELIEFAK KLKKEDERAI KENLTEEELS LFDKLKKPDL TEKDKKQVKQ VARDLLSTLK AEKLVLDWRK KQQAMAAVKK EIEDELDKGL PESYDSRIYE EKCNTVFQHI YDSYAGDSHS IYEAIA
|
| |