Gene Sare_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2572 
Symbol 
ID5706144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2926471 
End bp2929431 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content70% 
IMG OID641272035 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001537405 
Protein GI159038152 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00323142 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCGATT GGTGCGACCT GAGCGAGGAT GCCCGTCTCG TCTGGGGTAA GACGAACCGT 
GACCGGGGCC TGGTCTGGCT GCCGCTGTTC CGGCATCTCG CCGACAGCGC CGACGTGGCC
GGGCTCTTGT GGGATGTGTG GCTGCCGGTC ACGGTCCGGC AGCGCATCGC CGGGTCTTTT
CCGGGCGGCT TCGACGACGG ACGTGTCCTG GTTCGGTGGC TCGCTGGTGT TCACGACATC
GGAAAGGCCA CGCCCGCTTT CACCTGGCAG GTGGGGCCGT TGCGGGAGGC CATGCGGGAG
CACGGATTCA CCTTCGACCG CCAGGTCGAG GCGGACCGTG GCCGCGCGCC GCACGGGACG
GCCGGGCAGA TCGTGCTCAC GGACTGGCTG ACCGGCCGGC ACGGCTGGGA CCGGGCCTCG
GCCGGGGGGT ACGCGGTGGT CGTGGGCGGT CATCACGGGG TACCGCCGAC CGACGCTGAT
CTGACGGAGA TTCGTGCCCG GCCGTACCTG CTCGGCGCGG GTGGGATCTG GCCTGCAGTG
CAGGATGAAC TGCTCGGTTG GCTGACCCGT CGTGTCGCTG CCACGGACCG GTTGGCCGTA
TGGCGGGAGG TGATCCTGCC GCAGCCGGTG CAGGTGTTGT TGACCGCGAT CGTCATCGTC
GCCGACTGGA TCGCCAGCAA CGAACAGTTC TTCCCGTACG GCTTCCGGCA GGAAGATGCC
TCCGATCGGC TTCAACAGGC GTGGGAGGAG TTGGATCTGC CGACCCCGTG GCGGGCCGTC
GACACCACCG GCGTCGACGC GGCGAGGTTC CTCGCCCAAC GGTTCAAGAT GCCGCCCGGC
GCTCAGCCGT ACCCGGTCCA GGCGGCGGTG CTGGAGCAGG CCCGTGCGAT GCCGCTGCCG
GGGCTGCTGA TCGTCGAGGC ACCGATGGGA GAGGGCAAAA CCGAGGCCGC CCTGGCGGCC
GTGGAAGTCC TCGCCGGGCG AAGCGGGGCG GGCGGGTGCT TCGTGGCGCT GCCCACCAGG
GCGACAAGTG ACGCGATGTT CAGCCGGGTG CTCGGCTGGC TGAGCCGGCT GCCGGACGCC
GACGTCGGTC GGGGTGCCCG AGAGGCGGCG CTGGCCCACG GCAAGGCGAT CCTCAACGAC
GAGTATTCCC GCCTCTATCG CGGAATGTTG CCCAGCGCGA TCGGGGTGGA CGAGGGTGGC
ACGGCTGTCG CCGCGCACGG CTGGCTGGCA GGGCGGAAGC GGAAGATGCT GTCCAGTTTC
GTGGTCGGCA CCATCGACCA GTTGCTCTTT GCCGCGTTGC AGGCTCGTCA TGTGGCACTG
CGGCACCTCG GACTGGCCGG GAAGGTGGTC GTCGTCGACG AGGCCCACGC CTACGACGTC
TACATGAGCC AGTATCTCGA CCGTGCTCTC GAGTGGCTGG GCGCGCACGG CGTTCCGGTG
GTGGTTCTCT CCGCGACCCT CCCGGCGCAT CGGCGCGCCG ACATGATGCA GGCGTACGAC
GTCGGGCGGT TTGGCCCCCG TCGAGGCGCC CGTCGCCGGC GCCGGTTCCG TGCGGGCAGC
GACGAGGCTA CCGACGATGA CCAGGTGCTC CGTGACGAGC GGCGGTATCC GCTACTCAGC
GTCTCAGGTG TGGACCGGGT GCCGACCACG GTGGGATGTG CGGCGTCCGG TCGAAGCTTC
GACGTACGGC TGGAACGCAG CGACGACGAC CTGGCCGCCC TCGCCGACCG GCTCCGGACC
GACCTCACCG ACGGAGGCTG CGTCCTGGTC GTCCGTAACA CCGTCGCCCG GGTGCTCGAG
ACGGCCGACG AATTGCGACG GCTACTCGGC CCAAAAATTC CGGTCAGCGT GGCGCATTCC
CGTTTCATGG CCGCTGATCG GGCGGCCAAG GACACCTGGC TACGGGACAC CTTCGGCCCG
CCCGAGCATC TCACCGAGTG GGGGCGGGCG CGGCCGGCCT GCCACGTCGT CGTCGCCAGC
CAGGTCGCCG AACAGTCCCT GGACATCGAC TTCGACCTGC TGGTGACCGA CCTGGCACCG
GTGGACCTGA TCCTGCAACG ACTCGGCCGC CTGCATCGGC ACCGGCGTGC CGTTCGCCCC
GGCAGGCTGG CACAGCCGCG GTGCCTGGTG ACCGGGGTCG ACTGGGCGAC TGTCCCACCC
GCCCCGGTAC CTGGCTCGGT CACGGTGTAT CAGCACCACG CCCTGCTACG GGCCGCGGCC
GTCCTGCTGC CGTATCTCGA CGGCGACCGG CCCCTGTACC TGCCCCAGGA CATCGCACCC
CTGGTGCAAA CCGCGTACGG CAGAGGGTCG GTGGGTCCGC CGGGGTGGCA GTCCACGTTG
GCCGAGGCGG CGGTCCAGGC CGACGCCCGC GCCGAGGTGG CACGCAAACG CGCGGATACC
TTCCGGGTCG GGCCGGTCCA GGCGCCGGGG GTGAGCCTGG TGGGATGGCT CGACGCCAGC
GTCGGCGACA GCGAGGCCAC GAACGGCAAC GAGTGGCGAG GGCGGGCCCA CGTCCGCGAC
GACGGGGCTG AGGCGTTGGA GGTGCTGATC CTTGTTCGCC GGGGCGAGAA GCTGGTGACC
CCACCGTGGC TCGACCAGGG CGGCGACGTC GAGGTCCCGA CCGACTTCGA GCCCTCGCGG
TCGGTGGCCC GCACGATGCT GCGATGTGCC CTGCCGCTGC CGCAAGCGCT CACCGCGCAC
GGGGGCGTAG ACCGGATCAT TGCCGAGTTG GAGCGGCGGA ACACGTTCCC GGCCTGGGAG
AAAGACCGAC TACTTGGTGG GGAGCTGATC CTCGACCTCG ACGAGCAGGG GCGAAGTCGA
CTCACGGATT TCGTCCTGAC CTACGACCGG GACACCGGGC TACGGGCTAC GGGCTACGGG
CTACGGGCTA CGGGCTACGG GCTACGGGCT ACGGGCCAGC AAGATCGACC ATTACGCTGC
CCACGGGCAA CTATCCGATA G
 
Protein sequence
MADWCDLSED ARLVWGKTNR DRGLVWLPLF RHLADSADVA GLLWDVWLPV TVRQRIAGSF 
PGGFDDGRVL VRWLAGVHDI GKATPAFTWQ VGPLREAMRE HGFTFDRQVE ADRGRAPHGT
AGQIVLTDWL TGRHGWDRAS AGGYAVVVGG HHGVPPTDAD LTEIRARPYL LGAGGIWPAV
QDELLGWLTR RVAATDRLAV WREVILPQPV QVLLTAIVIV ADWIASNEQF FPYGFRQEDA
SDRLQQAWEE LDLPTPWRAV DTTGVDAARF LAQRFKMPPG AQPYPVQAAV LEQARAMPLP
GLLIVEAPMG EGKTEAALAA VEVLAGRSGA GGCFVALPTR ATSDAMFSRV LGWLSRLPDA
DVGRGAREAA LAHGKAILND EYSRLYRGML PSAIGVDEGG TAVAAHGWLA GRKRKMLSSF
VVGTIDQLLF AALQARHVAL RHLGLAGKVV VVDEAHAYDV YMSQYLDRAL EWLGAHGVPV
VVLSATLPAH RRADMMQAYD VGRFGPRRGA RRRRRFRAGS DEATDDDQVL RDERRYPLLS
VSGVDRVPTT VGCAASGRSF DVRLERSDDD LAALADRLRT DLTDGGCVLV VRNTVARVLE
TADELRRLLG PKIPVSVAHS RFMAADRAAK DTWLRDTFGP PEHLTEWGRA RPACHVVVAS
QVAEQSLDID FDLLVTDLAP VDLILQRLGR LHRHRRAVRP GRLAQPRCLV TGVDWATVPP
APVPGSVTVY QHHALLRAAA VLLPYLDGDR PLYLPQDIAP LVQTAYGRGS VGPPGWQSTL
AEAAVQADAR AEVARKRADT FRVGPVQAPG VSLVGWLDAS VGDSEATNGN EWRGRAHVRD
DGAEALEVLI LVRRGEKLVT PPWLDQGGDV EVPTDFEPSR SVARTMLRCA LPLPQALTAH
GGVDRIIAEL ERRNTFPAWE KDRLLGGELI LDLDEQGRSR LTDFVLTYDR DTGLRATGYG
LRATGYGLRA TGQQDRPLRC PRATIR