Gene Sare_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1972 
Symbol 
ID5707875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2271293 
End bp2272780 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID641271477 
ProductCRISPR-associated Cse1 family protein 
Protein accessionYP_001536848 
Protein GI159037595 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.355127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAT CTTTCGACCT GACAGACCAG CCGTGGATAC CGGTGGTTGC CAAGAGTGAG 
TTGGAGCTGG TCGGCCTCCG TGAGCTTTTC GTTCGTGCAG CGGAGTTCGA TGACCTGGCT
GTTCCGGTGC CGCCGGCGGC CTCCGGGTTG TGGCGAATTC TGTACGCGAT CACTGCACGG
GTGACGGGCC TGGACATGCT GCGCGGGCCA CAGTGGCGGC AGCGGCAGGA GCGGCTGCTT
GACCAGGGCG GGTTCGCCGC AGGCGACGTC GATGCCTACT TTGCGAAATA CTCAGACCGC
TTTGACCTGT TCGGTGCGCT TCGTCCGTGG ATGCAGGATC CCCGGCTTGC TGTCGAGTGT
CCGAAGTCCT CCGGGGTCAA CAAGCTGGTG TTCGACCGTC CAGCGGGCAA CTCTCAGGTG
TGGTTCGGCC ATCATACGGA TGCCGACGCG GTGGCTCTGG CTCCGGGGGA GGCGGCGTGG
TATCTGATCG CGCAGCTGTA TTACGGCGCA TCTGGGCGGT GCAGCAGCCG TGAGGTGGCC
GGGCAGAAGT TCGCCAACAG CAACGCCGGC CCGCTGCGGG GGGTCATGTC TTATCACCCG
CTCGGGGAGA ACCTGTTCGA GTCGCTGGTC GTGGGGGTGC CCCCAGGGGT GTCATCAGGG
CAGGACGAGG GGCTTGACCT GTGTCCGTGG GAGCGCGATG AGTTGCCGGA TCCGCTCGGT
GCGCCATGGT CGGTGTCGTG GCCGTGTGGC GCGTTGACGG GCCGTGCGCG GCATGCTGTG
CTGCTGGTTC CAGACGCCGC CGGTGAGGCC GTGTCGGATG CCTACGTCAC CTGGGCGTGG
CGGCTTCCCG GGGCGGCGTC CCCTGATCCG TACGTGGTGC GTCGGCAGAA CAAGGAGGGC
GGCTGGTACC AGCTGCCGGC CGACGACTCC CGGGCTCTGT GGCGGGATGT TGACGCCCTG
CTGGGCGGCA ACACCGAGGT CAAGACGCAC CGTCCCGACA TCATGGCTGT CGCCGCCGAC
CTCGGTCTCG ACGGACGGGT GCGCGCGTAT GGCTTCGATC AGGACGGGCA GGCCAAGGAT
CGGCAGTGGT TCATCGCACT GACTCCACCA GTTCTTGGTT GGTTGAGTGA GCGCGACCCG
GTGACCGCCG ACGGCGTGGC CCTGTTGACC CGGGCGGCGG AGTCGATCGG CCGGCGTGTC
GGGGCCGCAC TGCGACAGGC CTGGCGGGAG TTGGTTAGCG TCAAGGACCG AGAAGGCCCG
TGGGCTCATA CGGGTGAGGC GTACTACTGG ACGCGAGCCG AGGCGGTGTT CTGGGAGCAC
GTGCGAGACG GCCGTTTCGC CGAGGGCGGG CGGGCTTTCG CGCGGTTAGG GCACGAGGCT
ATCGACCACG CCGCGGATGG CGATGCCAGC TCACCGCGGC TGGTGCGGGC CACGCAGACA
GCGCACCGGC TGTTGACAAC TCCGTTGAGG AAGGGATCGG GCGCGTGA
 
Protein sequence
MIKSFDLTDQ PWIPVVAKSE LELVGLRELF VRAAEFDDLA VPVPPAASGL WRILYAITAR 
VTGLDMLRGP QWRQRQERLL DQGGFAAGDV DAYFAKYSDR FDLFGALRPW MQDPRLAVEC
PKSSGVNKLV FDRPAGNSQV WFGHHTDADA VALAPGEAAW YLIAQLYYGA SGRCSSREVA
GQKFANSNAG PLRGVMSYHP LGENLFESLV VGVPPGVSSG QDEGLDLCPW ERDELPDPLG
APWSVSWPCG ALTGRARHAV LLVPDAAGEA VSDAYVTWAW RLPGAASPDP YVVRRQNKEG
GWYQLPADDS RALWRDVDAL LGGNTEVKTH RPDIMAVAAD LGLDGRVRAY GFDQDGQAKD
RQWFIALTPP VLGWLSERDP VTADGVALLT RAAESIGRRV GAALRQAWRE LVSVKDREGP
WAHTGEAYYW TRAEAVFWEH VRDGRFAEGG RAFARLGHEA IDHAADGDAS SPRLVRATQT
AHRLLTTPLR KGSGA