Gene Mpal_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1607 
Symbol 
ID7272149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1653483 
End bp1655087 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content55% 
IMG OID643570220 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_002466642 
Protein GI219852210 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.982851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACC TGATCGAACA GGCATGGATT CCAGTGATCC GAAAAGATGG AGAGCGATCA 
ACGATCGCCC CCTGGGAACT GACTAGCGAC TATCAGGAGA ATCCTATCGT CGAACTGGAT
GCACCGCGGC CGGATTTCAA TGGCGCATTG GTCCAGTTTC TGATCGGTAT CGTCCAGACA
GAGCTCCCTC CAACGAATCC CGTGACATGG AAGCGGATGT TCCGGAGACC TCCTGAACCT
GCAGATCTGA AAGCGTCGTT CAGTACACAT ATAGAGGCGT TCAACCTTGA CGGGGACGGG
CCACGGTTCA TGCAGGATCT GACCCTTGCG AAGGGGGAAG CACTCGCGAT CGATAAACTG
CTGATCGAAA GGCCGGGAGA GCAGACCGTC AAGAAAAACA CCGATCATTT CCTTAAACGA
GGAGGGATCG ATCACCTCTG TATGACCTGT GCCGCAATGG CACTCTTCAC CTTGCAGACC
AATGCCCCAT CTGGAGGAAG GGGGCATCGG ACCTCATTGC GGGGGGGAGG ACCCCTGACC
ACTCTTGTCA CCGGAAGAAC ACTCTGGGAG ACTGTCTGGC TGAATGTGAT CTCACCTCAG
GAACTGGAAC GTTACGGCAA CAGTGCCCTG ACCAGTGCAG CCGATATCTT TCCCTGGATG
GGGGAGACCA GGACCAGCAA CAACAATGAG ATCACGACAC CCCAGGATGT GAATCCTGCC
CAGATGTTCT GGGGAATGCC GCGGAGGATC CGACTCGACC TCGATGGAAA ACCAGAACCC
GGTGAATGTG ATCTCTGTGG AAAAACCACC GAAAGACAGG TCAGTACGTT TTCTGCGAAG
GATAGCGGTG TCAATTACAA GGGTGGATGG TGCCATGTGC TCTCTCCATA TTCGACCAAC
CCCAAGGGAG AACTGCTGGC CAAGCATGCC CAGCCCGGTG GAGTCACCTA TCGGAACTGG
CTGGGACTGG TCCAGAACGA TTCACAGAAC AACAGCCAGC CGGCCGCAGT GGTCTCACTC
TTCCGGGAAC AGCGTCAGCT GGGACTCAAT GGGTTTCAAC CACACCTCTG GGCCTTCGGA
TATGACATGG ATAACATGAA AGCACGCTGC TGGTACGAAG GGAAGATGCC GCTCCATCAT
ATCGACGAGG GACTCTTGCC CGGGTATGAA GAAGAGATCG CACGCCTGGT CAGAACTGCC
GGCCTGATCG GATTCAGTGT CCGGACGTCC ATCAAAAAGG CACTCTTCTC CCGGCCAGAG
GATGCTACCG GGGATCTCTC GTTCATCGAT GCCCGGTTCT GGCAGGACAC CGAGCCGGCA
TTTCATAAAA CGCTCGATGA ACTCGCCACC CTGCTGAAGG ATGGGGGCGA TAGAACTACA
TTGAAGTTGA ACTGGCTGAA GTCTCTCAGA GATGAAGGGA AGCGGCTCTT CGATGACTAT
TCCCAGGCTG ATCTGATTGA TCAGACCGAT CCCAAGAGGG TTGCCCTGGC CTGGCGGGAT
CTCCAACGGT TTACTTCAAG ATTCAATAAA AAGGTCCGCG AAACGCTCGA CCTTCCTATT
GAGGCAAAAC CGGATGAGGC GGATATCCCT GATGCTGGCG TATGA
 
Protein sequence
MLNLIEQAWI PVIRKDGERS TIAPWELTSD YQENPIVELD APRPDFNGAL VQFLIGIVQT 
ELPPTNPVTW KRMFRRPPEP ADLKASFSTH IEAFNLDGDG PRFMQDLTLA KGEALAIDKL
LIERPGEQTV KKNTDHFLKR GGIDHLCMTC AAMALFTLQT NAPSGGRGHR TSLRGGGPLT
TLVTGRTLWE TVWLNVISPQ ELERYGNSAL TSAADIFPWM GETRTSNNNE ITTPQDVNPA
QMFWGMPRRI RLDLDGKPEP GECDLCGKTT ERQVSTFSAK DSGVNYKGGW CHVLSPYSTN
PKGELLAKHA QPGGVTYRNW LGLVQNDSQN NSQPAAVVSL FREQRQLGLN GFQPHLWAFG
YDMDNMKARC WYEGKMPLHH IDEGLLPGYE EEIARLVRTA GLIGFSVRTS IKKALFSRPE
DATGDLSFID ARFWQDTEPA FHKTLDELAT LLKDGGDRTT LKLNWLKSLR DEGKRLFDDY
SQADLIDQTD PKRVALAWRD LQRFTSRFNK KVRETLDLPI EAKPDEADIP DAGV