Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1971 |
Symbol | |
ID | 5705218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2268336 |
End bp | 2271125 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641271476 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001536847 |
Protein GI | 159037594 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.685894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGGGTCG ACCATGACGG GGTCGATGCT GGGATCTGGG GCAAGTCTAA CCATCTCCCG AGACCGTATC CGTTGCTCTG GCATTTGATT GATACCGCGG CCGTTGCTGG TGCGTTGTGG GATCGCTATC TGGCCCCGAA CCAACGGAGG GTCATCGCGG AGGGGCTGGC CACCGACGAC AGTCACGCTA AATCGTTGAT CATGTTTTGG GCGGGATTAC ACGATGTCGG CAAGGCAACC CCAGGGTTTC AGAAGAAGGA CGGGAGCGCG TTCGATGCGC TGCGGAGTGC CGGATCGTAT GAGGATGTTG TAGGCGATGG CTCGTTGAGG CACGACACCG CTGGTCAGTT GGCCATGCCG GATCTTCTTG CTCCTTTGGG GTATCCGCCG TTGGTCAGGC GGGTCAGTGA CCGCTTGGCC TACCGCGTCG CTCAGATCGT CGGCGGCCAT CACGGGCGAT TCCAGCAAAC CAGTGGTCCC CATGGTGGCC GGTTGCAGCT GGGCGAGGGC GGCTGGTCGA AGCAGCGCGC CGCCCTAGTG CGGGTGGTGC ACGAGGCCGT GGGTTCACCC GAGCGGCCGC CGGCGATAGG TGCCGCGGCC AGTGTGCTGG TCACGGGGGT CGTGATCCTC GCGGACTGGC TGGCCAGCCA GGAGCACTTC TTGCGTCAGC AGTCTTCACG AGTGCCGCAG TACACGGGCG CCGAAGCTGT GTCGACTCAT CTCGCAGTCT TATCCGCGCC CGCTGCCGCC CTGCTCGACG ACGCCGGCTT GGGGTCACCG TCGTATCGGG CGTTGGGTTT CGAGGATCTG TTCCCGCACC GTCCGAACGC CCTTCAGCGG TCGATCATTG AGGAGTTGCT GCCGGTGGTC GACAGTCCGG GGCTGTTGGT CGTGACGGCG GCTACGGGGG ACGGGAAGAC CGAAGCCGCT TTGGTGGCTG CCAAAAGGTT GGCCGAGGTA TGCGGGGCGC ACGGGTTCTT CTTCGCTTTG CCGACGATGG CGACTTCCGA CGAGATGTAT CGCCGGGTAC GAGCGTTTGC CGCCCGGCTT GCCGCCGGTC CGGCGCCGGT CACGCTGCTG CACAGCATGT CGTGGTTGAA CGCTGAGTAC GAGGCTCGGG CGGGTGTCGA CCCCGGTGAT GCGGCTGAGG TGGTGTCCGA TGAACCCAGT GGTGATGTGG TGGCACCGGA GTGGCTGCGC GGCCCCAAGC GGGGGCTGCT GGCGCCCATG GCGGTGGGTA CGGTGGACCA GGCGTTGCTT GCCGTGCTGA CCACCAAGCA CAACGCGCTG CGTCTGCTCG CTCTCTCGGG CAAGGTGTTC GTCGTTGATG AGGCGCATGC GTACGACGAC TTTACGACCT CCCTGCTGTG TGGCCTGTTG AACTGGTTGG GGGCCTACGG ATGCCCGGTC GTCCTGCTGT CGGCGACGTT GCCCTCGTCG CAGGCGGCGA CTCTTGTAGG GGCGTACGAG GAGGGTGCTG GCGTCGGGCC GACCAACACG TCTGTGCCCT ATCCGGGTTG GACGTTCGTG CCGGCGATCC AGGGGCGTGA CACGGTTTCG ATCTCACCAA CGATGCGGGA GGCCCTGGTC TCGGGTAGGA CCGTGGAGTT CACGCTCGAC GTACGCCCTG TTCGTCACCT CGTCGGGTCG GAGAGCGACC CAGCGGATCG TAGGGCCGTG GTGCGGCAGG TGCTGGCGCC GGTGGCCGAG CAGGGTGGGT GCGCTGCCGT GGTCTGCAAC ACGGTCAACG ACGCCCAGCA GACATACCGC GTCGTTCGTG ACTGGGCGTC CGGGGGAGTG GATGTGGTGC TGCTGCACTC CCGTTTCCCG GCTCGGCGCC GGGAGGAGAT TACGACGCAG ATCATCGGTC GGCTGGGGAA GGACGGCCAT CGCCCTAGAT CGATGATCGT GGTTGCTACT CAGGTGCTAG AGCAGTCACT TGACCTGGAC TTCGACCTGT TGGTGACGGA TCTGGCGCCG ATGGCGCAGC TGTTGCAGCG TGCCGGCCGG TGTCATCGAC ACCGCAGGAC GAGGCCGCCT TGGGCCGCGC AACCGCGGGT GGTGGTTCTC GACCCGCGGG GCCCGGACGG TGTCGAGCAT GTCAGGCCGG CGGCTTGGGG AGACGTGTAC GCGCGGTACC TGCTGCGGGC TACGCACCTT CGGCTGGCGG GGACCGACAG GATCGCCGTT CCGCACGCGG TGCAGAACCA AGTGGAAGCG GTGTATTCGC CACAGTCAGT CGACCACGAT CTCGTGCTAA GCGCCGAATA CGACGAGTAC CGCGCCGAGG CGATGGCCCT ACGTGGGATG GCTGAACTCG GTGTCATTCC TGATCCTGCT CAGGTGGCCG ACCTGGCAGC CCTGAGTCTT ACCGAAACGC CGGAGTGGCG TGCGTCAACC CGCCTGGGAG CTGAGTCAGA GCGGGTGCTC TGCTGCTACC TCGACGCTGG CGGCAGGCAG TGGCTCGACC CCGAGCGGCG CATCCTTCTA CCGCAGCGCG GGGCGAGACC CGACGGCCGC TTCTCCAGCC AAGAGGTACG GATGATCCTC CAGGAGACGA TCCCGCTCCG AACCGACATC CTCATGGGAT ATGACCCACC CGCCATCTTG CCAGCATCCT GGAGCAACAA CGCCTGGCTG AAGCGTCTCC GGCCGCTGTG GCTCCCGATC CTCCCCGACG GGCCTGGGTC GGCCGCTTGG GCTCGTCGAT CAGTGCACCT TGACCCCGAA CTCGGGCTGG TAGTTCTCTC CTCTTCGGAA GGACGGCAGC CTGGTGTGGT GAGTGAGTGA
|
Protein sequence | MGVDHDGVDA GIWGKSNHLP RPYPLLWHLI DTAAVAGALW DRYLAPNQRR VIAEGLATDD SHAKSLIMFW AGLHDVGKAT PGFQKKDGSA FDALRSAGSY EDVVGDGSLR HDTAGQLAMP DLLAPLGYPP LVRRVSDRLA YRVAQIVGGH HGRFQQTSGP HGGRLQLGEG GWSKQRAALV RVVHEAVGSP ERPPAIGAAA SVLVTGVVIL ADWLASQEHF LRQQSSRVPQ YTGAEAVSTH LAVLSAPAAA LLDDAGLGSP SYRALGFEDL FPHRPNALQR SIIEELLPVV DSPGLLVVTA ATGDGKTEAA LVAAKRLAEV CGAHGFFFAL PTMATSDEMY RRVRAFAARL AAGPAPVTLL HSMSWLNAEY EARAGVDPGD AAEVVSDEPS GDVVAPEWLR GPKRGLLAPM AVGTVDQALL AVLTTKHNAL RLLALSGKVF VVDEAHAYDD FTTSLLCGLL NWLGAYGCPV VLLSATLPSS QAATLVGAYE EGAGVGPTNT SVPYPGWTFV PAIQGRDTVS ISPTMREALV SGRTVEFTLD VRPVRHLVGS ESDPADRRAV VRQVLAPVAE QGGCAAVVCN TVNDAQQTYR VVRDWASGGV DVVLLHSRFP ARRREEITTQ IIGRLGKDGH RPRSMIVVAT QVLEQSLDLD FDLLVTDLAP MAQLLQRAGR CHRHRRTRPP WAAQPRVVVL DPRGPDGVEH VRPAAWGDVY ARYLLRATHL RLAGTDRIAV PHAVQNQVEA VYSPQSVDHD LVLSAEYDEY RAEAMALRGM AELGVIPDPA QVADLAALSL TETPEWRAST RLGAESERVL CCYLDAGGRQ WLDPERRILL PQRGARPDGR FSSQEVRMIL QETIPLRTDI LMGYDPPAIL PASWSNNAWL KRLRPLWLPI LPDGPGSAAW ARRSVHLDPE LGLVVLSSSE GRQPGVVSE
|
| |