Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1608 |
Symbol | |
ID | 7272150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1655225 |
End bp | 1657963 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643570221 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_002466643 |
Protein GI | 219852211 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.152444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.436385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCG GATCTGACAA TTATTATTCC TACTGGGGAA AGGCACAGAG GGAGGGTGAA CCAGAGAAGG GGGCCGGTTA TCACCTTCTC GTGTACCACT GCCTCGATGT TGCAGCGGTG GGACAGAAAT TATTAGAACA AGATCCTTTG CTGATGCACA GGTTCACAGC CCTGACCGGA TTCAACGACG ATCAGGTCCT TCACCTCATT CCATTTCTCC TCGCACTCCA CGATCTCGGG AAATTTTCAG ACCGCTTCCA GAACCTGAAG CCCGATCTGA TGGAACTGCT GCAGGGGAAT AGAGGCTCAC GACGCTACCC CATACGCCAT GACTCAATGG CTCTCTATCT CTTTGAACAG GATATCCGGG ATCTGGCATG GAAGAAAAAC TGGTTCTGGA TGGATCATGA TCTTCCCTAC GCTGAAGATG AATGGTTTGA AATCTTTGCT CCCCTGTTCC GAGCGGTCAG CGGTCATCAT GGAGAACCCC CCCAGCATGA CGAAACTGTG AGCGTGGATT CCCTGTTTCT GGACGGAGAC CGGGCTGCAG CCCGGGACTT TGCTGTTCAG GCTGCCACGC TTCTTTTGGG CTCCGGCTCC TATCCGCCAA TCGAATATTC TGAGGGCTGG TTCGAACAGT TCTCCACCGC ATCATGGCTC CTCGCCGGCC TGGTGACTTT GAGTGACTGG CTTGGATCGA ACCGGACATT CTTTCCATTC CAGACAGAAG CCATGGACCT GGCTGTGTAC TGGAAGAATT TCGCCCAGAA GGGTGCAGAG GATGCTGTTA GGGATGCCGG GGTGCTGCCG GCCACGGTCT CATCGAACTG TGGGATGGCC GGCCTCTTTC CACCAGATCT GCCTCCCAAA ATTACCAGTC CTTCCCCCAT TCAGACGTAC CTCTCATCCT GTTCACTTAC CCCGGAACCA CGCCTCACTA TCATCGAAGA GACGACCGGG GGTGGAAAGA CTGAAGCAGC TCTGGTCCTC GCTCACCGAC TGATGAACTA TGGATGCGGA GAGGGGATCT TTATGGGTCT GCCCACCATG GCGACTGCGG ATTCCCTGTA TGGCAGAATT GCACAGACCT ACCAGCGGAT GTATACGGGC GATCAGCGGG CATCTGTTGT ACTGGCCACC AGTGCGAGAG ATCTATCAGA TCTCTTTAAA AAATCGGTGC TGCCACCAGG ACAGTGGAGC CATGAACAGT ACAAGCCCGG AGAGGAGACT GCATCTGCTA CCTGTACAAT CTGGCTTGCC CAGAACCGAA AGAAAGCGCT GCTCGCTCAG GTGGGTGTCG GAACCATCGA CCAGGCATTA ATGGCTGTAC TTCCATTTCG TCACCAGTCT CTGCGGCTGC TCGGATTGGC CCGAAATATT CTGATCGTCG ATGAGGTACA TGCGTATGAT CCGTACATGC ACACAGTCCT CTGCGGATTG CTCAAGTTTC AAGCAGCATT TGGAGGCAGC GCGATCCTCC TATCAGCAAC CCTGACCATG GCACAGAGAC AGGATCTGAC CAGTGCCTTC TGTGCAGGAC TCGGACGGAG AGCAAAAACT CTGACCGATG AGACGTATCC CCTGATCACC ATGGCTACGG CCCAGGATGT AACGGAGACT CCAATCGATT CATCTGCTGC CAGGGTTCGA ACGGTCAGGG TGCAGATGGT CGATGATTCT GCTGATGTCC TAGAACAGAT CGTCAGAGCT GCTGGTGATG GCCGATGTGT CTGCTGGGTC AGAAACACCA TCGACGATGC GATCAACGCC TTCAATGAAC TCAACACGAG GCTGGAGAGC AGACAGGTGC TGCTCTTTCA TGCCAGGTTT GCCCTGGGTG ATCGTCTGGA TATCGAGAAA AAGGTACTGG ACACTTTTGG AAAAGAGAGT CTGGATCCAA TCAGACGGGG GATGGTCCTG GTCGCCACCC AGGTGGTGGA ACAGTCACTT GATCTCGACT TCGATCTGAT GATAACCGAT CTCGCTCCAA TGGACCTGAT CATTCAGCGG GCCGGCAGGC TTCACCGTCA TCCGCGAGGA GATCGGGGGG AGGCCGTAAT GGTAGTGCTG GCTCCCCCTC TGACCCGTAC ACCAGATCCT GACTGGTATA AGCGGGTCTT TCCAAAGGGT GGATATGTCT ACCCCAACCA TGGGCAGCTC TGGCTGACGG CTCAACTGCT CGATACAAAG GGAAAGATTG TGATGCCTGA TGGGGCCAGA GATCTCATCG AAGGGGTATT TGGAGATGGT GCACAGATCA GGATTCCATT GTCACTCCTA CCGCGCGAGG TCCTGGTGGA CGGAAAAAAG CGGGGAGACA TTTCAATCGC ACGCCTGAAT ACCCTTGAAG TGTCCGATGG ATACCGGAGA ACCCCCACGC AGTGGGTGGA GGAGGCCCAC GTCTCCACCC GACTCGGTCA GCTCACCAGC ACCCTCGTAC TGGCCAGATG GGACGGCACC ACGCTGACGC CATGGTATTC CTCCAACCAG AACGCATGGG ACCTGAGTCA GGTCCATATT GCAGAGAAGA AGGTTGCCAG CGCAGCAGTG TTTGAAGGTG AACTCGGTGC GGCGGTGAAA CGATTAGAAG ACCAGATTCC GGGTCTTGGT AAATGGTTGA TTCTCGTTCC ACTCAGTTGT ACCAGTGCTG GTACATGGGA AGGGCCTGCC AGAAACGATG CGGACGAGAA TGTTACGGTG ATCTATGATC CTGTTTTGGG TTTCATGATG AAAAATTAG
|
Protein sequence | MKTGSDNYYS YWGKAQREGE PEKGAGYHLL VYHCLDVAAV GQKLLEQDPL LMHRFTALTG FNDDQVLHLI PFLLALHDLG KFSDRFQNLK PDLMELLQGN RGSRRYPIRH DSMALYLFEQ DIRDLAWKKN WFWMDHDLPY AEDEWFEIFA PLFRAVSGHH GEPPQHDETV SVDSLFLDGD RAAARDFAVQ AATLLLGSGS YPPIEYSEGW FEQFSTASWL LAGLVTLSDW LGSNRTFFPF QTEAMDLAVY WKNFAQKGAE DAVRDAGVLP ATVSSNCGMA GLFPPDLPPK ITSPSPIQTY LSSCSLTPEP RLTIIEETTG GGKTEAALVL AHRLMNYGCG EGIFMGLPTM ATADSLYGRI AQTYQRMYTG DQRASVVLAT SARDLSDLFK KSVLPPGQWS HEQYKPGEET ASATCTIWLA QNRKKALLAQ VGVGTIDQAL MAVLPFRHQS LRLLGLARNI LIVDEVHAYD PYMHTVLCGL LKFQAAFGGS AILLSATLTM AQRQDLTSAF CAGLGRRAKT LTDETYPLIT MATAQDVTET PIDSSAARVR TVRVQMVDDS ADVLEQIVRA AGDGRCVCWV RNTIDDAINA FNELNTRLES RQVLLFHARF ALGDRLDIEK KVLDTFGKES LDPIRRGMVL VATQVVEQSL DLDFDLMITD LAPMDLIIQR AGRLHRHPRG DRGEAVMVVL APPLTRTPDP DWYKRVFPKG GYVYPNHGQL WLTAQLLDTK GKIVMPDGAR DLIEGVFGDG AQIRIPLSLL PREVLVDGKK RGDISIARLN TLEVSDGYRR TPTQWVEEAH VSTRLGQLTS TLVLARWDGT TLTPWYSSNQ NAWDLSQVHI AEKKVASAAV FEGELGAAVK RLEDQIPGLG KWLILVPLSC TSAGTWEGPA RNDADENVTV IYDPVLGFMM KN
|
| |