Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0599 |
Symbol | |
ID | 3996572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 609520 |
End bp | 610431 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637958402 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_565320 |
Protein GI | 91772628 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000000773136 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCAA AGTTAAAACC TGTTGCAATT AAGGAACGAT TGTCTTTGTT GTTTCTGGAA AAGGGCGAAC TTGATGTAAT CGATGGTGCT TTTGTTCTTG TGGATAAGAA TGGAATACGT TCACAGATAC CAGTAGGTGG TATTGCTTGT TTGATGCTGG AGCCAGGCAC GAGAGTATCA CATGCAGCGA TATCTCTTGC TGCTCGTGTT GGCTGTCTGC TTATATGGGT AGGAGAGGCA GGAGTACGTC TTTATTCCGC AGGGCAGCCA GGGGGTGCTC GTGCCGATCG TTTGTTGTAT CAGGCAAAGC TTGCGTTAGA TGAGGAACTT CGGCTTAAGG TCGTTCGTAA TATGTATGAG ATGCGGTTCA ATGAACCAGT ATCTGAGGGT TATAGTGTTG AACAGATGCG GGGTATGGAG GCTGCACGTG TTAAGAAGAT GTATAAGATA TTCGCTCAGC AATATGGTAT TGAGTGGAAA GGGCGAAATT ATGATGTCGA TGACTGGGAT GCCGGGGATG TTCAGAACAG ATGTCTAAGT TCTGCAACTT CCTGTATATA TGGAGTTGCA GAGGCTGCAA TTCTTGCTGC AGGCTATTCT CCTGCAGTTG GTTTCATTCA TACGGGTAAG CCACGTTCCT TCGTGTATGA TGTTGCTGAT ATATTCAAGT TCGAGACCGT TGTTCCCATT GCATTTAGGA TCGCCTCTGA GAAACATACC AATTATGAAA GAGCCGTGAG ACTTGCTTGT AGAGATGCAT TTAGGCAAAC TCGATTGTTG AAAAGGATCA TTCCAAGTAT TGAGGAGATG CTTTCAGCGG GTGGTATTGA TCTACCTTCT GCAGCAAAGG ATACTCTCCC TCCTGCTATC CCTAATGAAA GAGGTATTGG CGATGTTGGT CATCGTGCTT GA
|
Protein sequence | MLPKLKPVAI KERLSLLFLE KGELDVIDGA FVLVDKNGIR SQIPVGGIAC LMLEPGTRVS HAAISLAARV GCLLIWVGEA GVRLYSAGQP GGARADRLLY QAKLALDEEL RLKVVRNMYE MRFNEPVSEG YSVEQMRGME AARVKKMYKI FAQQYGIEWK GRNYDVDDWD AGDVQNRCLS SATSCIYGVA EAAILAAGYS PAVGFIHTGK PRSFVYDVAD IFKFETVVPI AFRIASEKHT NYERAVRLAC RDAFRQTRLL KRIIPSIEEM LSAGGIDLPS AAKDTLPPAI PNERGIGDVG HRA
|
| |