Gene Strop_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0999 
Symbol 
ID5057445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1118388 
End bp1119368 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID640473269 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001157852 
Protein GI145593555 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.614939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCA GCGGGCGCAG CTATTGGCTG ACCGAACCGT GCCGAATCCG ACGCGAAGAC 
AACAGCATCC GGATCGAACG CGCCGATGGA CAACCTGTTC GCATTCCGAT CACCGACATT
CGCGACCTTG TGCTCTTCGA CAACGCCGAC ATCAACACCG CCGCGGTATC GCTACTCAGC
CGGCACGGAG TCACCGTGCA CCTACTTGAC CACTACGGCA ACTATGCTGG CGCGCTGACT
CCAGCCGACG ACATGTCCTC CGCACACGTC GTCCGCGCCC AGGTGGCCCT GACAGGCAAC
CCTCAGGCCC GACTCGCTGT CGCGCAGGCC CTCGTCCGGG CGACCGCGGT CAACGTAGCC
TGGGCCCTGG GCACGGACCT GCTCGATGGG CCACTCGAAC GACTTCCCGC CCAAATCGGT
GCCAGCACCT CATCCGGAGA CCTGATGGGA GTCGAAGGTA ACTTCCGGCG AACCGCGTGG
GGAGTGCTCG ATACCCTGCT ACCGCCCTGG CTCCGGCTTG ACGGACGCAC CCGTCGCCCA
CCCAGTAATG CCGGCAACGC GTTCATCAGC TACCTCAATG CCATCACCTA CGCTCGGGTT
CTCACCGCGA TTCGCTGTAC GCCGCTGCAC CCGGCGATCG GCTTCCTGCA CGCCGACACC
GACCGGCGCC GAAACACCCT CGCCCTTGAC CTCGCCGAAC CGTTCAAGCC GCTGCTCGCC
GAACGACTGC TCCGCCGAGC AGCCGCGCAG CGAACCCTGA CCGCTGCAGA CTTCGTCAGC
GACGTCCGTA GCGCGTCCCT CAGCCAGGCC GGACGGAAAA AGATTGCTGT CATGGTCCGC
GAAGAACTGG CCACCACCGT CCAGCATCGG CAACTCCGGC GAAAGGTGTC CTACGAGGAG
TTGATCCACC TGGAGGCCCT CAAGCTCGTA CGACTATGCC TCGAAGGCAC GACCTACAAG
CCCTTCCGGC CCTGGTGGTA G
 
Protein sequence
MSASGRSYWL TEPCRIRRED NSIRIERADG QPVRIPITDI RDLVLFDNAD INTAAVSLLS 
RHGVTVHLLD HYGNYAGALT PADDMSSAHV VRAQVALTGN PQARLAVAQA LVRATAVNVA
WALGTDLLDG PLERLPAQIG ASTSSGDLMG VEGNFRRTAW GVLDTLLPPW LRLDGRTRRP
PSNAGNAFIS YLNAITYARV LTAIRCTPLH PAIGFLHADT DRRRNTLALD LAEPFKPLLA
ERLLRRAAAQ RTLTAADFVS DVRSASLSQA GRKKIAVMVR EELATTVQHR QLRRKVSYEE
LIHLEALKLV RLCLEGTTYK PFRPWW