Gene Sare_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1971 
Symbol 
ID5705218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2268336 
End bp2271125 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content66% 
IMG OID641271476 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001536847 
Protein GI159037594 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.685894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGGGTCG ACCATGACGG GGTCGATGCT GGGATCTGGG GCAAGTCTAA CCATCTCCCG 
AGACCGTATC CGTTGCTCTG GCATTTGATT GATACCGCGG CCGTTGCTGG TGCGTTGTGG
GATCGCTATC TGGCCCCGAA CCAACGGAGG GTCATCGCGG AGGGGCTGGC CACCGACGAC
AGTCACGCTA AATCGTTGAT CATGTTTTGG GCGGGATTAC ACGATGTCGG CAAGGCAACC
CCAGGGTTTC AGAAGAAGGA CGGGAGCGCG TTCGATGCGC TGCGGAGTGC CGGATCGTAT
GAGGATGTTG TAGGCGATGG CTCGTTGAGG CACGACACCG CTGGTCAGTT GGCCATGCCG
GATCTTCTTG CTCCTTTGGG GTATCCGCCG TTGGTCAGGC GGGTCAGTGA CCGCTTGGCC
TACCGCGTCG CTCAGATCGT CGGCGGCCAT CACGGGCGAT TCCAGCAAAC CAGTGGTCCC
CATGGTGGCC GGTTGCAGCT GGGCGAGGGC GGCTGGTCGA AGCAGCGCGC CGCCCTAGTG
CGGGTGGTGC ACGAGGCCGT GGGTTCACCC GAGCGGCCGC CGGCGATAGG TGCCGCGGCC
AGTGTGCTGG TCACGGGGGT CGTGATCCTC GCGGACTGGC TGGCCAGCCA GGAGCACTTC
TTGCGTCAGC AGTCTTCACG AGTGCCGCAG TACACGGGCG CCGAAGCTGT GTCGACTCAT
CTCGCAGTCT TATCCGCGCC CGCTGCCGCC CTGCTCGACG ACGCCGGCTT GGGGTCACCG
TCGTATCGGG CGTTGGGTTT CGAGGATCTG TTCCCGCACC GTCCGAACGC CCTTCAGCGG
TCGATCATTG AGGAGTTGCT GCCGGTGGTC GACAGTCCGG GGCTGTTGGT CGTGACGGCG
GCTACGGGGG ACGGGAAGAC CGAAGCCGCT TTGGTGGCTG CCAAAAGGTT GGCCGAGGTA
TGCGGGGCGC ACGGGTTCTT CTTCGCTTTG CCGACGATGG CGACTTCCGA CGAGATGTAT
CGCCGGGTAC GAGCGTTTGC CGCCCGGCTT GCCGCCGGTC CGGCGCCGGT CACGCTGCTG
CACAGCATGT CGTGGTTGAA CGCTGAGTAC GAGGCTCGGG CGGGTGTCGA CCCCGGTGAT
GCGGCTGAGG TGGTGTCCGA TGAACCCAGT GGTGATGTGG TGGCACCGGA GTGGCTGCGC
GGCCCCAAGC GGGGGCTGCT GGCGCCCATG GCGGTGGGTA CGGTGGACCA GGCGTTGCTT
GCCGTGCTGA CCACCAAGCA CAACGCGCTG CGTCTGCTCG CTCTCTCGGG CAAGGTGTTC
GTCGTTGATG AGGCGCATGC GTACGACGAC TTTACGACCT CCCTGCTGTG TGGCCTGTTG
AACTGGTTGG GGGCCTACGG ATGCCCGGTC GTCCTGCTGT CGGCGACGTT GCCCTCGTCG
CAGGCGGCGA CTCTTGTAGG GGCGTACGAG GAGGGTGCTG GCGTCGGGCC GACCAACACG
TCTGTGCCCT ATCCGGGTTG GACGTTCGTG CCGGCGATCC AGGGGCGTGA CACGGTTTCG
ATCTCACCAA CGATGCGGGA GGCCCTGGTC TCGGGTAGGA CCGTGGAGTT CACGCTCGAC
GTACGCCCTG TTCGTCACCT CGTCGGGTCG GAGAGCGACC CAGCGGATCG TAGGGCCGTG
GTGCGGCAGG TGCTGGCGCC GGTGGCCGAG CAGGGTGGGT GCGCTGCCGT GGTCTGCAAC
ACGGTCAACG ACGCCCAGCA GACATACCGC GTCGTTCGTG ACTGGGCGTC CGGGGGAGTG
GATGTGGTGC TGCTGCACTC CCGTTTCCCG GCTCGGCGCC GGGAGGAGAT TACGACGCAG
ATCATCGGTC GGCTGGGGAA GGACGGCCAT CGCCCTAGAT CGATGATCGT GGTTGCTACT
CAGGTGCTAG AGCAGTCACT TGACCTGGAC TTCGACCTGT TGGTGACGGA TCTGGCGCCG
ATGGCGCAGC TGTTGCAGCG TGCCGGCCGG TGTCATCGAC ACCGCAGGAC GAGGCCGCCT
TGGGCCGCGC AACCGCGGGT GGTGGTTCTC GACCCGCGGG GCCCGGACGG TGTCGAGCAT
GTCAGGCCGG CGGCTTGGGG AGACGTGTAC GCGCGGTACC TGCTGCGGGC TACGCACCTT
CGGCTGGCGG GGACCGACAG GATCGCCGTT CCGCACGCGG TGCAGAACCA AGTGGAAGCG
GTGTATTCGC CACAGTCAGT CGACCACGAT CTCGTGCTAA GCGCCGAATA CGACGAGTAC
CGCGCCGAGG CGATGGCCCT ACGTGGGATG GCTGAACTCG GTGTCATTCC TGATCCTGCT
CAGGTGGCCG ACCTGGCAGC CCTGAGTCTT ACCGAAACGC CGGAGTGGCG TGCGTCAACC
CGCCTGGGAG CTGAGTCAGA GCGGGTGCTC TGCTGCTACC TCGACGCTGG CGGCAGGCAG
TGGCTCGACC CCGAGCGGCG CATCCTTCTA CCGCAGCGCG GGGCGAGACC CGACGGCCGC
TTCTCCAGCC AAGAGGTACG GATGATCCTC CAGGAGACGA TCCCGCTCCG AACCGACATC
CTCATGGGAT ATGACCCACC CGCCATCTTG CCAGCATCCT GGAGCAACAA CGCCTGGCTG
AAGCGTCTCC GGCCGCTGTG GCTCCCGATC CTCCCCGACG GGCCTGGGTC GGCCGCTTGG
GCTCGTCGAT CAGTGCACCT TGACCCCGAA CTCGGGCTGG TAGTTCTCTC CTCTTCGGAA
GGACGGCAGC CTGGTGTGGT GAGTGAGTGA
 
Protein sequence
MGVDHDGVDA GIWGKSNHLP RPYPLLWHLI DTAAVAGALW DRYLAPNQRR VIAEGLATDD 
SHAKSLIMFW AGLHDVGKAT PGFQKKDGSA FDALRSAGSY EDVVGDGSLR HDTAGQLAMP
DLLAPLGYPP LVRRVSDRLA YRVAQIVGGH HGRFQQTSGP HGGRLQLGEG GWSKQRAALV
RVVHEAVGSP ERPPAIGAAA SVLVTGVVIL ADWLASQEHF LRQQSSRVPQ YTGAEAVSTH
LAVLSAPAAA LLDDAGLGSP SYRALGFEDL FPHRPNALQR SIIEELLPVV DSPGLLVVTA
ATGDGKTEAA LVAAKRLAEV CGAHGFFFAL PTMATSDEMY RRVRAFAARL AAGPAPVTLL
HSMSWLNAEY EARAGVDPGD AAEVVSDEPS GDVVAPEWLR GPKRGLLAPM AVGTVDQALL
AVLTTKHNAL RLLALSGKVF VVDEAHAYDD FTTSLLCGLL NWLGAYGCPV VLLSATLPSS
QAATLVGAYE EGAGVGPTNT SVPYPGWTFV PAIQGRDTVS ISPTMREALV SGRTVEFTLD
VRPVRHLVGS ESDPADRRAV VRQVLAPVAE QGGCAAVVCN TVNDAQQTYR VVRDWASGGV
DVVLLHSRFP ARRREEITTQ IIGRLGKDGH RPRSMIVVAT QVLEQSLDLD FDLLVTDLAP
MAQLLQRAGR CHRHRRTRPP WAAQPRVVVL DPRGPDGVEH VRPAAWGDVY ARYLLRATHL
RLAGTDRIAV PHAVQNQVEA VYSPQSVDHD LVLSAEYDEY RAEAMALRGM AELGVIPDPA
QVADLAALSL TETPEWRAST RLGAESERVL CCYLDAGGRQ WLDPERRILL PQRGARPDGR
FSSQEVRMIL QETIPLRTDI LMGYDPPAIL PASWSNNAWL KRLRPLWLPI LPDGPGSAAW
ARRSVHLDPE LGLVVLSSSE GRQPGVVSE