Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0852 |
Symbol | |
ID | 4795831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 837855 |
End bp | 838889 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640099515 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001030290 |
Protein GI | 124485674 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.749632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAC TACGTAATGT TTTGTATGTG ACAAATCCCA AGTCCTATCT TTCGCGGGAC GGGGAAAGTA TCGTTGTATC CGTTGAAAAC CAGGAACTTG CCAGAGTCCC GATCCGTAAT CTGGAGGGGG TCGTTTGTTT CGGGTATATG GGGGCAAGTC CGGGGATGAT GGCGCTCTGC ACGGAAAATG ATGTGGGCAT GTGTTTTGTT TCGCCTTATG GAAAATTTAT GGCACGGATC GGTGGAAATG TTTCGGGAAA TGTGCTGCTC CGGAAACGTC AGTACGCCGT TTCGGATGAT GAGGAAGCAT CGAAAAAGAT CGCGGCATAC TGCATTCTCG GGAAACTGAT GAACTGCCGG ACGGTCCTTC AGCGGTTTTC CCGCGATTAT CCGGATATGG TGTCGAGGGA GTTCGAGCAG AATTTCAAAC GGCTGTCTGA GGGGATCCTC CAAATCCGAG CTGGGACCTG TGGTTCGCTG AATGAGCTGA GAGGCTTTGA GGGGATCCTG TCTAAGTATT ATTTTCACAG TTTCAATGAT CTGATCCTGT CAACGGAACC GGAGTTCTCG TTTGAAAACA GGTCAAGGCG TCCTCCTCTG GATCGGGTGA ATGCATTGCT TTCATTTTCC TATACGCTGA TCGCGGCTGA TTGTGCGTCT GCTCTGGAGT CGGTGGGTCT CGATCCGCAG GTCGGTTTTT TGCACCGGGT CCGGCCTGGA AGACCAAGTC TGGCTCTCGA TCTGATGGAG GAGTTTCGCC CGTATCTTGG CGATAGGTTT GTGCTGAGTC TCATCAATAA TCGGGTGGTG AAAGCCGATG ATTTCGCGGT GAAGGAGAAC GGAGCGGTTT TGCTGACGGA TGATGCCCGA AAGACGGTCC TGCAGGCATG GCAGAAGCGA AAGAAGGAGG AGGTCATGCA TGGGTATCTG GAGGAGAAGA TGCCTGTGGG TCTTTTGCCC TATGCCCAGG CGATGCTTCT CGCCCGTTTT CTCCGGGGAG ATATCGACGG GTATCCGCCG TTTGTGGTGA GGTGA
|
Protein sequence | MRKLRNVLYV TNPKSYLSRD GESIVVSVEN QELARVPIRN LEGVVCFGYM GASPGMMALC TENDVGMCFV SPYGKFMARI GGNVSGNVLL RKRQYAVSDD EEASKKIAAY CILGKLMNCR TVLQRFSRDY PDMVSREFEQ NFKRLSEGIL QIRAGTCGSL NELRGFEGIL SKYYFHSFND LILSTEPEFS FENRSRRPPL DRVNALLSFS YTLIAADCAS ALESVGLDPQ VGFLHRVRPG RPSLALDLME EFRPYLGDRF VLSLINNRVV KADDFAVKEN GAVLLTDDAR KTVLQAWQKR KKEEVMHGYL EEKMPVGLLP YAQAMLLARF LRGDIDGYPP FVVR
|
| |