Gene Rcas_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3303 
Symbol 
ID5540801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4295005 
End bp4296021 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content63% 
IMG OID640895421 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001433372 
Protein GI156743243 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.597515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.540685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTCA TCGTAGACCA ATTCGGCGTA TTTATCTCAA AGCACCAGGG GCGCATTCGC 
GTCATGAAAG AGAAGGAACG TCTATCGGAA GTGCCGATCA TGCACCTGGA ACAAATCCTC
ATTTGCAGCG ACGGTGTCGG TCTCAGCAGC GACGTGGTGC GTGCCTGCGC CGAAGAAGGG
ATTCCGATCC ACTTCCTCAA CAGCGCCAAC GGCGGCGACT ATGGCACGTT CGTGCATAGC
GGCATCACCG GCATGGCGCT CACCCGCCGT GCTCAATTGC GAGCCGGTGA CGACGAGCGC
GGATTGCGTC TGGCGCAGGC GTTCGCCAGC GGCAAAATCC AGAGCCAGGC GAACATGCTG
CGCTACGCCG CCAAAAATCG CAAAGAGAAC GACCCGGACC TGCACAACGA CCTGATGCGC
ACCGCCACCG AAATCCTCGA CGCCCTCCCG CCGCTGCGCG CCGTGCGCGG CGTCCTTACC
GACGAAACCC GCGCTGCGCT GATGGGCTTC GAGGGAATGG CCGGCGCGCG CTACTGGACA
GCCGTGGCGC GCATCATTCC CGACGACCTC GGTTGGCCCG GACGCGAAAC GCGCGGTGCG
CGCGACCGCT TCAATCAGGC GCTCAACTAC GGCTACGGCG TTCTCCAGTC TCAGGTGCGC
ACTGCCCTGA TCCTTGCCGG GTTGGACCCC AATGCCGGGT TCCTCCACGC CGACCGACCG
GGCAAGCCGA GCCTGACCCT TGACCTGATC GAAGAGTTTC GCCAGGCAGT CGTCGACCGC
ACCCTCATCG GGCTGGTCAA CCGCCAGTTC GAGATTGTCC AGCGCGACGA CGGACTGCTC
GACGAGGACA CCCGCAAACG CATCGCCGAG AAAATCCTCG AACGCCTGAA CAGCACTGAG
CTCTACGAAG GCAAGCGTCA GCCGCTCCGT CACATTCTCC AATGCCAGGC GCGCCACATC
GCCACCTTTG TGCGCGGCGA ACGTCCCACC TATGAACCGT TTGTGATGGG GTGGTGA
 
Protein sequence
MHLIVDQFGV FISKHQGRIR VMKEKERLSE VPIMHLEQIL ICSDGVGLSS DVVRACAEEG 
IPIHFLNSAN GGDYGTFVHS GITGMALTRR AQLRAGDDER GLRLAQAFAS GKIQSQANML
RYAAKNRKEN DPDLHNDLMR TATEILDALP PLRAVRGVLT DETRAALMGF EGMAGARYWT
AVARIIPDDL GWPGRETRGA RDRFNQALNY GYGVLQSQVR TALILAGLDP NAGFLHADRP
GKPSLTLDLI EEFRQAVVDR TLIGLVNRQF EIVQRDDGLL DEDTRKRIAE KILERLNSTE
LYEGKRQPLR HILQCQARHI ATFVRGERPT YEPFVMGW