Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1607 |
Symbol | |
ID | 7272149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1653483 |
End bp | 1655087 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643570220 |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_002466642 |
Protein GI | 219852210 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.982851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAACC TGATCGAACA GGCATGGATT CCAGTGATCC GAAAAGATGG AGAGCGATCA ACGATCGCCC CCTGGGAACT GACTAGCGAC TATCAGGAGA ATCCTATCGT CGAACTGGAT GCACCGCGGC CGGATTTCAA TGGCGCATTG GTCCAGTTTC TGATCGGTAT CGTCCAGACA GAGCTCCCTC CAACGAATCC CGTGACATGG AAGCGGATGT TCCGGAGACC TCCTGAACCT GCAGATCTGA AAGCGTCGTT CAGTACACAT ATAGAGGCGT TCAACCTTGA CGGGGACGGG CCACGGTTCA TGCAGGATCT GACCCTTGCG AAGGGGGAAG CACTCGCGAT CGATAAACTG CTGATCGAAA GGCCGGGAGA GCAGACCGTC AAGAAAAACA CCGATCATTT CCTTAAACGA GGAGGGATCG ATCACCTCTG TATGACCTGT GCCGCAATGG CACTCTTCAC CTTGCAGACC AATGCCCCAT CTGGAGGAAG GGGGCATCGG ACCTCATTGC GGGGGGGAGG ACCCCTGACC ACTCTTGTCA CCGGAAGAAC ACTCTGGGAG ACTGTCTGGC TGAATGTGAT CTCACCTCAG GAACTGGAAC GTTACGGCAA CAGTGCCCTG ACCAGTGCAG CCGATATCTT TCCCTGGATG GGGGAGACCA GGACCAGCAA CAACAATGAG ATCACGACAC CCCAGGATGT GAATCCTGCC CAGATGTTCT GGGGAATGCC GCGGAGGATC CGACTCGACC TCGATGGAAA ACCAGAACCC GGTGAATGTG ATCTCTGTGG AAAAACCACC GAAAGACAGG TCAGTACGTT TTCTGCGAAG GATAGCGGTG TCAATTACAA GGGTGGATGG TGCCATGTGC TCTCTCCATA TTCGACCAAC CCCAAGGGAG AACTGCTGGC CAAGCATGCC CAGCCCGGTG GAGTCACCTA TCGGAACTGG CTGGGACTGG TCCAGAACGA TTCACAGAAC AACAGCCAGC CGGCCGCAGT GGTCTCACTC TTCCGGGAAC AGCGTCAGCT GGGACTCAAT GGGTTTCAAC CACACCTCTG GGCCTTCGGA TATGACATGG ATAACATGAA AGCACGCTGC TGGTACGAAG GGAAGATGCC GCTCCATCAT ATCGACGAGG GACTCTTGCC CGGGTATGAA GAAGAGATCG CACGCCTGGT CAGAACTGCC GGCCTGATCG GATTCAGTGT CCGGACGTCC ATCAAAAAGG CACTCTTCTC CCGGCCAGAG GATGCTACCG GGGATCTCTC GTTCATCGAT GCCCGGTTCT GGCAGGACAC CGAGCCGGCA TTTCATAAAA CGCTCGATGA ACTCGCCACC CTGCTGAAGG ATGGGGGCGA TAGAACTACA TTGAAGTTGA ACTGGCTGAA GTCTCTCAGA GATGAAGGGA AGCGGCTCTT CGATGACTAT TCCCAGGCTG ATCTGATTGA TCAGACCGAT CCCAAGAGGG TTGCCCTGGC CTGGCGGGAT CTCCAACGGT TTACTTCAAG ATTCAATAAA AAGGTCCGCG AAACGCTCGA CCTTCCTATT GAGGCAAAAC CGGATGAGGC GGATATCCCT GATGCTGGCG TATGA
|
Protein sequence | MLNLIEQAWI PVIRKDGERS TIAPWELTSD YQENPIVELD APRPDFNGAL VQFLIGIVQT ELPPTNPVTW KRMFRRPPEP ADLKASFSTH IEAFNLDGDG PRFMQDLTLA KGEALAIDKL LIERPGEQTV KKNTDHFLKR GGIDHLCMTC AAMALFTLQT NAPSGGRGHR TSLRGGGPLT TLVTGRTLWE TVWLNVISPQ ELERYGNSAL TSAADIFPWM GETRTSNNNE ITTPQDVNPA QMFWGMPRRI RLDLDGKPEP GECDLCGKTT ERQVSTFSAK DSGVNYKGGW CHVLSPYSTN PKGELLAKHA QPGGVTYRNW LGLVQNDSQN NSQPAAVVSL FREQRQLGLN GFQPHLWAFG YDMDNMKARC WYEGKMPLHH IDEGLLPGYE EEIARLVRTA GLIGFSVRTS IKKALFSRPE DATGDLSFID ARFWQDTEPA FHKTLDELAT LLKDGGDRTT LKLNWLKSLR DEGKRLFDDY SQADLIDQTD PKRVALAWRD LQRFTSRFNK KVRETLDLPI EAKPDEADIP DAGV
|
| |