Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3909 |
Symbol | |
ID | 8335262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4432703 |
End bp | 4437325 |
Gene Length | 4623 bp |
Protein Length | 1540 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644957035 |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_003114638 |
Protein GI | 256393074 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0587749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0865606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCAATG TCGGCAGCAC GAGATGTTGG GGGGATGGTG GGTTGCGTAA CGCCGCAGAG GACCTGTCCG CAGCAACGCG ATCGGCATGG GCAAAGTCCG ACCCGGATTC AGGGCAAAGT CTGTCCCTGA TTCGGCATCT TGCGGACTCT GCGGCGATCG CTGAACACCT GTGGGACCAA TGGCTTCCTG ACCATGTTAA GAGCCTTATT GCTGAGGGCC TTCCGGAAGG GCTGGTCGAT GGCAGGACTC TGGCGGTATG GCTCGCAGGA ACCCACGACA TAGGGAAGCT AACGCCGGCA TTTGCCTGCC AGTGCGAACC GCTCGCACAG GCAATGCGTG AATGTGGCCT TGATATGCCG ACACGCACGC AGTTCGGCGA CGACCGACGG GTGGCGCCGC ATGGGCTCGC GGGCCAGGTA CTGCTGCGTG AGTGGCTCAT GGAGCGGCAT GGTTGGTCCG GACGATCCGC CGATGCGTTT ACGGTCATTG CGGGTGGACA CCACGGGGTG CCCCCGAGCT ATTCGCAGCT TCATGATCTT GATGCGTATC CGGAACTTCT TCGCACGCCT GGGGCAAGCG AGGGCATATG GAAGTCATCG CAGCATGAGC TGCTTGATGC GTGTGCTGTG ATGACCGGAG CGTCGAGTCG TCTGGCGCAC TGGCGCGGTT TGAGGCTTTC GCAGCAGGCT CAGGTACTGC TGACTGGTCT GGTCATCGTT GCTGACTGGA TCGCTAGCAA CACCGATCTG TTCCCTTATC CGGCTCTGGG AACGGGCGAG GCGGCGATCG ATCCTGGTAA GCGCGTTGAG TTGGCCTGGC GGGGCCTGGA GTTGCCAGCA CCGTGGGCGC CCAAATATCT GATGCCTGGC ATGCAGGGTT TGCTTGCTAG CCGGTTCGGG CTGCCGGCGG ACGCGCAGCT GCGTCCGGTC CAGCAGATGG CTGTGCAGTT GGCGTCTGCT AATGCCGCGC CAGGCCTGTT GGTCATCGAG GCGCCGATGG GGGAAGGGAA GACCGAAGCT GCGCTTCTTG CTGCGGAAAT CTTGGCGGCC CGGAGTGGCG CGGGTGGGGT ATTTCTGGCG CTGCCGACAC AGGCCACCAG CAATGCGATG TTTGCACGGG TGGTGAACTG GCTTCGCCAG GTGCCGCGCG AAGGCGTTGC CTCCGTTCAT TTGGCACATG GAAAGGCGGC GTTGGATGAC GCCTTCGCCT CGTTTCTGCG GGCTGCACCG AGGCTGACCT CTATTGACGC GGACGGGTAC GCCGGTGAGG CCAATGTTCG CCGCGACAGG CGAGCAGGCT CGGCGGATAT GGTGGCTCAT CAGTGGCTTC GTGGCCGCAA GAAGGGGATC TTGTCGCCGT TCGTTGTCGG CACGATCGAC CAGTTGCTGT TCACGGGCTT GAAGTCGCGG CATCTCGCGC TGCGACATCT GGCGGTGGCG GGCAAGGTCG TCGTCATCGA CGAGGTGCAC GCCTACGACG CCTACATGAG CGTGTATCTG GAGCGGGTGC TGTCCTGGCT CGGCGCCTAC CGGGTCCCGG TGGTGTTGTT GTCAGCGACG TTGCCGGCCG ACCGTCGGCA GGCTCTCGTC GAGGCCTACG GCGGCATCAC GTCGGAGGCG CTGCGGGATG CCCGTGAAGC GTACCCGGTC CTGACAGCTG TGACGATAGG TGCGCCGGCT CAGGCGGTCG GTACCGAGCC GGCCGAGGGT CGCCGCGTCG ACGTGAATGT GGAGGCGTTC GACGACGACT TGGGCCGACT TGCCGATCGT TTGGAGGCTG AGCTGGTCGA CGGGGGTTGC GCGCTGATCA TCCGCAACAC TGTCGGCCGG GTCTTGCAGA CGGCTCAGCA GCTGCGGGAA CGCTTCGGCG CCGGGCAGGT GACCGTGGCG CACTCGCGGT TCATTGACCT GGACCGGGCG CGTAAAGATG CCGATCTACT GGCTCGGTTC GGCCACGACG GTGCACGGCC ACGACGACAC ATCGTTGTGG CCAGCCAGGT CGCGGAACAA TCACTTGACA TTGACTTTGA TTTGCTGGTC ACCGATCTCG CACCCATCGA CCTCGTTCTC CAGAGGATGG GACGGGTGCA CCGACATCAT CGCGGCGGTC CGGAGCAATC GGAGCGTCCG CCCAGCCTAC GCACGGCTCG ATGTCTGGTG ACCGGGGTGG ATTGGGCGGG TATTCCGTCG GCACCGATAG CAGGTTCGGT GGCGGTCTAC GGGCTGCACC CGTTGCTGCG TAGCCTGGCT GTCCTGCAGC CGTACCTGAC AGGAAGCGCG CTGACGTTGC CGGGTGACAT CAATCCCCTG GTTCAGTGCG CGTATGCGCA GAGTTTTGTC GCACCGACGG GTTGGGGCGA GGCTATGGAC GCCGCGCAAG CCGAGCACAT GGCGCACATA GTGCAGCAGC GCGAGGGGGC CATGGCATTC TGCCTCGACG AGGTCCGCGG GCCGGGGCGA TCTCTCATCG GGTGGATCGA CGGCGGCGTT GGCGATGCGG ACGATACCCG GGCTGGTCGG GCGCAGGTTC GCGACAGCCC GGAGACGATC GAGGTGCTAG TCGTCCAACG AGGCAGCGAT GGTGTCCTGC GGACGCTGCC GTGGCTGGAT CGGGGCCGTG GAGGGCTCGA GCTGCCGACG GAAGCTGTGC CGCCGCCGCG CGCAGCCCGA GCGGCGGCCG CCAGCGCGCT GCGTCTGCCC GGGTTGTTCG CCAAACCGTG GATGTTCGAT CGGGTGCTTC GGGAACTTGA GCGCGAGTAT CACGAGGCCT GGCAGGCGAA GGAGAGCTCG TGGCTGCAAG GCGAGTTGCT TCTTGTGCTT GATGAGGAGT GCCGGACCGT CTTGGCCGGC TACGAGTTGT CCTACAACCC GGATGACGGT TTGGAGATGG TGATGCCTGG TGAACCGCAT GCCGCTGTAG TACGGGACAA GGAGGCCTCG GATGACAAGA CGGCTTCGTT CGATCTGACC TCAGCGCCGT GGCTCCCGGT GTTGTACGCC GACGGTATGC AGGGTGTGCT GTCGCTGCGA GATGTGTTCG CCCAGTCGAA CTTGATCCGC AGGTTGGTCG GGGATCTTCC GACCCAGGAC TTTGCGCTGC TACGTCTGCT ACTGGCCGTC CTATATGACG CGGTAGACGG GCCGCGCGAC GGTCAGGACT GGGAGGACCT GTGGACCTCG GATGACCCGT TCGCAGCGGT GCCAGCCTAT CTGGACAGCC ACAGGGAGCG GTTCGATCTG CTGCACCCTG CCACGCCTTT CTATCAAGTT CCCGGGCTGC AGACAGCGAA AGGTGAAGTA GGGCCGCTCA ACAAGATCGT GGCCGACGTG CCGGACGGTG ATCCGTTCCT CACCATGCGC ATGCCCGGTG TCGAACAGCT CAGTTTCGCT GAGGCCGCCA GATGGCTGGT TCACACACAA GCGTTCGACA CCTCGGGGAT CAAGTCGGGT GTCGTCGGCG ATCCCAAAGC GGTGAACGGC AAGCGGTATC CGCAAGGTGT CGCCTGGCTA GGCAACCTCG GCGGGGTATT CGCCGAAGGC GACACGCTGC GCCAGACCCT GCTGCTCAAC CTCATCCCTG CAGACACCAC GAATCTGCAG GTCACCTCCG CCCAAGATGT ACCCGCGTGG CGAGGTACAA ACGGCAGGGC CGGGAGCGAC CATGCTGACG CTGAACCCCG TGTTCCTGCT GGGCTACGCG ACTTGTATAC CTGGCAGTCG CGGCGTATCC GGCTGGAGTA TGACACCCGC GGCGTCACTG GGGCTGTGCT GACCTACGGC GACGAACTCA CTGCGCACAA CAAGCATGGC GTGGAACCGA TGACAGGCTG GCGGCGAAGC AAGCCCCAGG AGAAGAAACT CGGCCTATCC ACGGTCTACA TGCCGCAGCA GCATGATCCC ACGCGTGCCG CCTGGCGGGG GATCGAGTCT CTGTTGGCTG GGAGCGCAGG TAGCGGAAGC AGCCAGACTG GGGAACCGGC CTCCCACTAC CGCCCCAAGA TCGTGGATTG GCTCGGCGAG CTGGCCCACC ACGGCAACCT GCCAAGCCGC GGACTGATAC GAGTACGTAC ATCAGGCGCC GTGTACGGAA CACAACAGTC GATCATCGAC GAAGTGGTCA GCGATGAACT GACTATGGCC GTCGTGCTTC TCCACGAAGA CGACCCCCGC TTCGGTAAAG CCGCCGTCAC AGCAGTCAAG GACGCGGACT CCGCAGTTGC CGCTCTAGGC GACCTGGCCA GCGACCTTGC CCGGGCCGCG GGACTCGACC CGGAACCAGA ACGCGTCACC GCGCGGGACC GAGCGTTCGG GGCCCTTGAC GGCCCGTACC GCCGTTGGCT GCTGGACCTG GGCAACAGCA CTGACCCGGC CGCTATGCGC GCCGTGTGGC AGGGGCGGGT CTACGACATC ATCGCAGTGC AGGGCCAGAT GCTCTTGGAC TCAGCTGGCT CGGCAGCCGC ACAGGGCCGG ATGGTGAAGA CCACCCGCGG GGAGCGCTGG ATGGACGATT CCTTGGCTGA CTTGTACTTC AAAGGCCGCA TCGCCAAGGC GCTCAGTAGC CGTCTCGGTA AGAAGCCAAC CGATCCCGGC GAACCCGTAG GCATACAGGA GGACCCGGCA TGA
|
Protein sequence | MFNVGSTRCW GDGGLRNAAE DLSAATRSAW AKSDPDSGQS LSLIRHLADS AAIAEHLWDQ WLPDHVKSLI AEGLPEGLVD GRTLAVWLAG THDIGKLTPA FACQCEPLAQ AMRECGLDMP TRTQFGDDRR VAPHGLAGQV LLREWLMERH GWSGRSADAF TVIAGGHHGV PPSYSQLHDL DAYPELLRTP GASEGIWKSS QHELLDACAV MTGASSRLAH WRGLRLSQQA QVLLTGLVIV ADWIASNTDL FPYPALGTGE AAIDPGKRVE LAWRGLELPA PWAPKYLMPG MQGLLASRFG LPADAQLRPV QQMAVQLASA NAAPGLLVIE APMGEGKTEA ALLAAEILAA RSGAGGVFLA LPTQATSNAM FARVVNWLRQ VPREGVASVH LAHGKAALDD AFASFLRAAP RLTSIDADGY AGEANVRRDR RAGSADMVAH QWLRGRKKGI LSPFVVGTID QLLFTGLKSR HLALRHLAVA GKVVVIDEVH AYDAYMSVYL ERVLSWLGAY RVPVVLLSAT LPADRRQALV EAYGGITSEA LRDAREAYPV LTAVTIGAPA QAVGTEPAEG RRVDVNVEAF DDDLGRLADR LEAELVDGGC ALIIRNTVGR VLQTAQQLRE RFGAGQVTVA HSRFIDLDRA RKDADLLARF GHDGARPRRH IVVASQVAEQ SLDIDFDLLV TDLAPIDLVL QRMGRVHRHH RGGPEQSERP PSLRTARCLV TGVDWAGIPS APIAGSVAVY GLHPLLRSLA VLQPYLTGSA LTLPGDINPL VQCAYAQSFV APTGWGEAMD AAQAEHMAHI VQQREGAMAF CLDEVRGPGR SLIGWIDGGV GDADDTRAGR AQVRDSPETI EVLVVQRGSD GVLRTLPWLD RGRGGLELPT EAVPPPRAAR AAAASALRLP GLFAKPWMFD RVLRELEREY HEAWQAKESS WLQGELLLVL DEECRTVLAG YELSYNPDDG LEMVMPGEPH AAVVRDKEAS DDKTASFDLT SAPWLPVLYA DGMQGVLSLR DVFAQSNLIR RLVGDLPTQD FALLRLLLAV LYDAVDGPRD GQDWEDLWTS DDPFAAVPAY LDSHRERFDL LHPATPFYQV PGLQTAKGEV GPLNKIVADV PDGDPFLTMR MPGVEQLSFA EAARWLVHTQ AFDTSGIKSG VVGDPKAVNG KRYPQGVAWL GNLGGVFAEG DTLRQTLLLN LIPADTTNLQ VTSAQDVPAW RGTNGRAGSD HADAEPRVPA GLRDLYTWQS RRIRLEYDTR GVTGAVLTYG DELTAHNKHG VEPMTGWRRS KPQEKKLGLS TVYMPQQHDP TRAAWRGIES LLAGSAGSGS SQTGEPASHY RPKIVDWLGE LAHHGNLPSR GLIRVRTSGA VYGTQQSIID EVVSDELTMA VVLLHEDDPR FGKAAVTAVK DADSAVAALG DLASDLARAA GLDPEPERVT ARDRAFGALD GPYRRWLLDL GNSTDPAAMR AVWQGRVYDI IAVQGQMLLD SAGSAAAQGR MVKTTRGERW MDDSLADLYF KGRIAKALSS RLGKKPTDPG EPVGIQEDPA
|
| |