Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0922 |
Symbol | cas3 |
ID | 4205732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1052341 |
End bp | 1054551 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 25% |
IMG OID | 642565480 |
Product | CRISPR-associated helicase Cas3 domain-containing protein |
Protein accession | YP_698246 |
Protein GI | 110802774 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 [TIGR01596] CRISPR-associated endonuclease Cas3-HD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.013511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAA GTAAGTTTAC TGCAAAATTA TATAAAATTA TTAAAGAAGA TAAAATTATT AAAGAAAATA AAACTGTTAG AGAGCACACA AATGATTTAT TAAATAACTT AGAATTTTTA AATGATTTAG GATATATAAA TAAAGAATCT ATATATAATT TAACTAAAAT TGCATGTGAA TATCATGATT ATGGAAAGGT TAATAGGGAG TTCCAAAATA GAATAAAAAA TGAAACAAAA TTTAATGAGA AAAAAGAGAT AGCACACAAT GTGCTTTCTC TTTACTTTAT AAACCCAGAT GACTTTGAAG AGGAAAAAGA TTACTATAGA GTATGTTTTG CTGTATTATA TCATCATTAT TATTGTGATG GACTTAAAGT TTTAAGTGAA AAAACTGATT TGATAGAAAA TTTATTAAGT GAATTTGAAA CTTATGAACT TGATGAGTTT ACTCCTATGG AAATAGGTGA TATAAAGAGT GATGATGAAG CTATTTTAAT AAAAGGCTTC TTAAATAAAT GTGATTTTAG TGCAAGTGGA AATACTGAAA TAGAGTATAA AAATAATTTC TTAGAAGAAA GTTTAGAGAG ACTATTAAAT TCTTGGAAGA TAGATAATAA AGAGGCTAAA TGGAATGAAC TTCAACAGTT TGCTAAGAAT AATAGAGATA ATAATATAAT GGTTGTTGCC CAAACTGGAA TGGGTAAAAC TGAAGCAGGT TTGCATTGGA TTGGAAATAA TAAAGGTTTT TTTATTCTTC CATTAAAGAC AGCCATAAAT GCTATATATG ATAGAGTATC TAATGGTATA GTAAAAGAAA ATAACAATGA AGAAAAGCAT AAAGTTGCAT TAATTCATTC AGATTCTTTA TCATATTACA TTTCTCAAAA TGAAGATACA AACATAGTAT TAGATCATCA TAGAGAAGGA AAACAACTAT CTATTCCTAT TAGCATAGCA ACCTTAGATC AAATATTTGA TTTTGTATTT AAGTACAAGG GCTATGAAGT GAAATTAGCT ACTTTATCAT ATTCAAAAGT GGTAATAGAT GAAATACAGA TGTATAGCGC TGATTTATTA GCCTACTTAA TATTAGGAAT AAAAACCATA ATAAGGCTAG GAGGAAAAGT TGCAATTTTA ACAGCTACCT TAGCTCCCTT TGTTAAAGAT TTATTATTAG AAGATAATAA TAAATTAGGC TTTGTAGAAG GTACATTTGT AAATGAATTA AAAAGGCATA ATTTAAAAAT ATATGATGAC AAGATAAATG CAGATGTAAT TTATAATAAG TATATTGATA ATAAAGAGAA GAAAATAAGC AATAAAATAT TAGTTGTATG CAATACGATA AAAAAGGCTC AAGAAATTTA CAATGAACTA AAAGATAAAA ATATAGATAA TTTAAATATT TTACATAGTA AATTTATAAA AAGAGACAGA GCTTATAAAG AAGAGCAAAT ATTAGAGTTT GGGAAAACTA ATAATATTGG AGATGGCATA TGGATATCAA CTCAGATTGT AGAAGCTAGT TTAGATATTG ATTTTGACTA TTTATTTACA GAACTATCAG ATATAAATGC TTTATTCCAA AGACTTGGTA GATGTAATAG AAAAGGGGTT AAGCCTGCTT ATGACTATAA CTGTTATGTT TTCTTACAAA TAGAGAGAAA TTTATTAACT AATGGAGATA AGGGGTTTAT AGATAAAAAA ATATATGAAT TATCAAAAGA AGCATTAAGA AACGTAGATG GTCTTTTATC AGAAGAGGAA AAAGTAAATA TTATAAATAG AACATTAACC ACTGAGAATA TAAAGGGAAG TGATTATTGG TCAAAATACA CAGAATTTTT TGATTATGTT AGTGGACTTA ATCCATATGA AGTAGACAAG AAAGATGTTA AACTTAGAAA TATAATATCT TTTGATATAA TACCTCAATG TATTTATGAG AAAAATGAAT CAGAAATTGA AAGCTTACTT GAAATTATAA ATGATGAAAA AACAGATAAA ATAAATAGAA TAAAATCCAT AGATAAAATA AAAAGTTTTA CAGTATCAGT TGGGCAATAT GATATTGAAC TTTTAAAAAA TAAACTTATT ATGGAGTTAG AAGTAGGAAA GTATGAAAGA ATTCCAGTAT ATAAATGCAA TTATTCAGAG TTAGGATTTA CTAGAAACAC TAAAGAAGAT GTATATGATA ATTTCATATA A
|
Protein sequence | MDLSKFTAKL YKIIKEDKII KENKTVREHT NDLLNNLEFL NDLGYINKES IYNLTKIACE YHDYGKVNRE FQNRIKNETK FNEKKEIAHN VLSLYFINPD DFEEEKDYYR VCFAVLYHHY YCDGLKVLSE KTDLIENLLS EFETYELDEF TPMEIGDIKS DDEAILIKGF LNKCDFSASG NTEIEYKNNF LEESLERLLN SWKIDNKEAK WNELQQFAKN NRDNNIMVVA QTGMGKTEAG LHWIGNNKGF FILPLKTAIN AIYDRVSNGI VKENNNEEKH KVALIHSDSL SYYISQNEDT NIVLDHHREG KQLSIPISIA TLDQIFDFVF KYKGYEVKLA TLSYSKVVID EIQMYSADLL AYLILGIKTI IRLGGKVAIL TATLAPFVKD LLLEDNNKLG FVEGTFVNEL KRHNLKIYDD KINADVIYNK YIDNKEKKIS NKILVVCNTI KKAQEIYNEL KDKNIDNLNI LHSKFIKRDR AYKEEQILEF GKTNNIGDGI WISTQIVEAS LDIDFDYLFT ELSDINALFQ RLGRCNRKGV KPAYDYNCYV FLQIERNLLT NGDKGFIDKK IYELSKEALR NVDGLLSEEE KVNIINRTLT TENIKGSDYW SKYTEFFDYV SGLNPYEVDK KDVKLRNIIS FDIIPQCIYE KNESEIESLL EIINDEKTDK INRIKSIDKI KSFTVSVGQY DIELLKNKLI MELEVGKYER IPVYKCNYSE LGFTRNTKED VYDNFI
|
| |