Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3119 |
Symbol | |
ID | 3626570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 4009517 |
End bp | 4011169 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637701958 |
Product | CRISPR-associated Cas1/Cas4 family protein |
Protein accession | YP_306586 |
Protein GI | 73670571 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000141365 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAC CTGTAGCTAG GTATGAAGAG CCGAAACTGA TTCCTGCAAG AATGCTAAAT GAATTTGTAT ATTGTCCACG TTTATGTTAT ATGGAGTGGG TACAGGGAGA ATTTGAGCAT AGTGAAGATA CTTTGGAAGG AAAATTTGTT CATCGAAATG TAGATCAAGA AAAAACCAAA GATTTACAAG GTGAAGAAAA GAAGATTCAT AGCACTTCTG TGATGCTTTC GGGTTATGAG ACTGGTGTGA TTACAAGAAT TGACCTTCTA GAAGAATCAA ACGGAAAAGC CGTGCCAGTT GAATACAAAA AGGGCTATGT TCCAGATATT CCTGAAAAAG TCTATGAACC GGAGAGAATT CAATTATGTG CTCAAGGCTT GGTGTTGAAA GAAAAGGGAT TTGATTGCAC GGAAGGAGTT ATTTATTTTG TTAATTCAAA AAAAAGAGTG GCAGTTGACT TCGATGAGGA GCTTATCCAA AAAACTAAAG AAACAATTTT GAGGTTTCTT GAGACAATTG GAAAAAAAGA GATTCCGGCT CCTCTTGAAA ATAGTCCTAA ATGCTCCAGA TGTTCACTTT CGGGAATTTG TCTTCCCGAT GAGACTAATA TTTTAAAAGG TTCAGCCTCA CAAGTTAGAT CATTAAATGT ATCTAAAGAC GACAAAAAAC CCGTTTATGT TACTGGATGG GGAACTTCAG TACACAAGAA AGGAGATAGG TTGGTTATTA AGAAGAATGA TGAAGAACTG CAGAGCGTCC CGTTAAGGCA AATATCCCAA CTATCAATTT ATGGGGATGC TCATATTTCT TTGCCGGTAC TAAGAAGTTT AATTGAAATG AATGTGCCTG TGTGTTATTT TTCCTTTGGG GGTTGGTTTT ATGGGCTATC ACATGGGGTT ATGAGTAAAA ATGTTGATTT AAGGATTCAT CAATATCAGA CTGCTTTTGA TTCAGAAAGG TCACTGGCAA TTTCTCGCAA GATGATTGCT GGCAAAATCA AAAATTGTCG GACTTTATTA AGAAGAAACG ACACTGAAGT TTCAGAGAAA ATTCTCTCTC AATTAAATTC TCTTGAAAAA AAGGCATCGA ATGCAAAAGA AATCGGACAG CTTTTAGGTA TAGAAGGAAC TGCAGCCCAA ATTTATTTTT CGAGATTTGG CAATATGTTG AAACAAGATC TTGACTGTAA ATTTGAAAAT CGAAATAAAA GGCCTCCTAC CGACCCGGTA AATGCTGTTC TTTCATATTT ATATGGTATA TTGACAAAAG AAGTCTTTGT AACTTTATTT TCGGTAGGTT TTGATCCTTA TATGGGTTTC TATCACCAAC CTAAATATGG AAAACCAGCT CTAGCACTTG ATTTAATGGA AGAGTTTCGG CCATTGATAG CAGATTCTGT TGCTCTAACG CTTTTCAACA ACAAAACAGT GACACTAGAA GATTTCGAAA TAACAAATTT TGGAGTTTCG TTAAAGGACA ATACAAAAAA GAAAATAATT AGTGGATATG AGAGAAGAAT AAATACGGAA ATAACTCACC CTATCTTTGG GTACAAAGCA AGTTATAGAA GAATTTTGGA AATACAGGTT AGACTTTTAG GCAGAACCGT TACTAAAGAA ATAGAATCAT ACACTCCATT TTGTACAAGA TAA
|
Protein sequence | MDEPVARYEE PKLIPARMLN EFVYCPRLCY MEWVQGEFEH SEDTLEGKFV HRNVDQEKTK DLQGEEKKIH STSVMLSGYE TGVITRIDLL EESNGKAVPV EYKKGYVPDI PEKVYEPERI QLCAQGLVLK EKGFDCTEGV IYFVNSKKRV AVDFDEELIQ KTKETILRFL ETIGKKEIPA PLENSPKCSR CSLSGICLPD ETNILKGSAS QVRSLNVSKD DKKPVYVTGW GTSVHKKGDR LVIKKNDEEL QSVPLRQISQ LSIYGDAHIS LPVLRSLIEM NVPVCYFSFG GWFYGLSHGV MSKNVDLRIH QYQTAFDSER SLAISRKMIA GKIKNCRTLL RRNDTEVSEK ILSQLNSLEK KASNAKEIGQ LLGIEGTAAQ IYFSRFGNML KQDLDCKFEN RNKRPPTDPV NAVLSYLYGI LTKEVFVTLF SVGFDPYMGF YHQPKYGKPA LALDLMEEFR PLIADSVALT LFNNKTVTLE DFEITNFGVS LKDNTKKKII SGYERRINTE ITHPIFGYKA SYRRILEIQV RLLGRTVTKE IESYTPFCTR
|
| |