Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0651 |
Symbol | cas1 |
ID | 3102489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 685018 |
End bp | 686052 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637169862 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_113164 |
Protein GI | 53804988 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.280265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGC TGCAAAACAC CCTCTACGTC ACCACGCCCG AGGCTTATCT ACGCCTGGAG GGCGAGACCG TGTGCGTGAT GATCGAGGAG CAAAAACGCC TGCAGGTCCC GCTGCATCAT CTATCCGCCT TCGTGCTGTT CGATCACGTC ATGCTCAGCC CTGCCTTGCT TGGGCGCTGC GCCGAGGATG GCCGATCGGT CGTATGGCTC GATCGGGCGG GCCGGTTCAG GGCGCGGCTG GAAGGGCCGG TGAACGGCAA CATACTTCTG CGCCAGGCGC AATTCCGCGC GGCGGAAGAC GGTGCGCAAA CCCTGGGCCT GGCTCGCGCG GTATTGGCCG GCAAGCTGCG TAACAGCCGG CAGTTGTTGA TGCGGGGCGC GCGGGAAACG GACGACGTGG TCGAAAGAGA CGCTTTGGTG CGCGCCGCCA AGCTCATCGC CAATCAAGTG CGCAAGCTCC CTTTGGCGCA AGATCTCGAC ACCCTACGGG GCTTGGAAGG CGATGTGGCC AGGCTTTATT TCGAGGCCCT GCCCAAAGTC ATGAGGGCGA AGGCGCGGGC CGAGTTCCCT TTCGACTGCC GCAACCGGCG CCCGCCGCGC GACCGTTTCA ACGCATTGCT CTCTTTCCTG TATGCCCTGG TGTTGAGCGA CTGCCGGGCG GCCCTGGAAA CCGTCGGGCT CGATCCGCAA TTGGGCTTTC TCCATGCCGT GCGTCCCGGT CGTCCGGCCT TGGCGCTGGA TCTGTTGGAA GAATTCCGCG CCCCGCTGGC GGACCGGCTG GCCTTGACCC TCGTCAACCG CGGACAATTG CAGGCCAGCG ATTTCGACGA ACGGGAAGGC GGTGCCGTGC TGCTCAACGA CAAGGGCCGC AAGACCGTCA TCGCCGCCTA TCAAACCCGC AAGCAGGAAG CAATCACGCA TCCGCTGCTC AAACAAACCC TGCCGATCGG CCTGCTGCCT CATTGGCAAG CGCGCTTGCT GGCCCGCTAT CTGCGCCAGG ACGTCGCGCA TTACGTGCCT TATCTGCACC GCTGA
|
Protein sequence | MTVLQNTLYV TTPEAYLRLE GETVCVMIEE QKRLQVPLHH LSAFVLFDHV MLSPALLGRC AEDGRSVVWL DRAGRFRARL EGPVNGNILL RQAQFRAAED GAQTLGLARA VLAGKLRNSR QLLMRGARET DDVVERDALV RAAKLIANQV RKLPLAQDLD TLRGLEGDVA RLYFEALPKV MRAKARAEFP FDCRNRRPPR DRFNALLSFL YALVLSDCRA ALETVGLDPQ LGFLHAVRPG RPALALDLLE EFRAPLADRL ALTLVNRGQL QASDFDEREG GAVLLNDKGR KTVIAAYQTR KQEAITHPLL KQTLPIGLLP HWQARLLARY LRQDVAHYVP YLHR
|
| |