Gene Rcas_2676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2676 
Symbol 
ID5540158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3458387 
End bp3459385 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID640894798 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001432765 
Protein GI156742636 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCC TTTACATCCA GGAACAGGGC GTGATGGTGC GCAAACGCGA TAATCAGGTG 
CTGGTCACCA GAGACGGTCA GACGCTCCAC GATGTTCCGC TGGCGAAGAT TGACCAGGTG
GTACTGATGG GGCGTGGTGT GCAGATCTCG ACAGCGCTGC TCATCGACCT GCTCGAACGC
GGCATTCCGG TCACGCTCAC CAATCAGCAC GGTAGTCGCC ACTACGCCAC GCTCACGGCA
GGACCGTCGC GGTTCGGCGA TCTGCGCACC GGGCAGATGC AGTACGTCAA CACTCCTTCA
CGCGCGCTGG AACTGGCGCG CGCGATTGTG ATCGTCAAGC TGACGAATCA GCGCCGCCTC
CTGGCGACGA CCGGCTGGCC CGCTGCGGCA TCCGCCATGC AGCAGATCGA CGCAGCGCTG
ACAGCGGCGT CTCAGGCGCA GAATGTGGAC ATATTGCGCG GGCATGAAGG CGCTGCCGCC
GCCGCCTACT TCGGCGCCTG GCGCGCATCG CTGCCGCCTG CCTGGGGATT CGGCGGGCGC
GCCTTCTACC CGCCGCCCGA CCCGATCAAT GCCATGCTGT CGTTCGGCTA CACCCTGGCG
CTCCATGATG TCATCACCGC CGTGCAGATC ACGGGTCTCG ACCCTTACCT GGGCACATTC
CACGTCATCG AAACCGGTCG CCCATCACTG GCGCTCGATC TGCTGGAAGA GTTCCGCCCG
GTGATCGTCG ACCGCATGGT GCTCGACATC GTGCGCACCA ATGCCATCGG ACGCGAGCGC
TTTCACCGTC CGCAGGAACG ACCCGAAGCG GTCTACCTCG ATGCTGAAGG GCGCGCCTTC
CTTGTGCAGC GGTACGAAAC GCTTCTCCAG ACGAAGGTGC GGTTGCCCGG CGGCGAGCAG
ACGCCGATGC GCCGGGTCAT CCTGCTGCAG GCGCAGGCGA TCGCGCGCGT GCTGCGCGGC
GAACAGGAGC GATATACCGG ATTCAGTCTC AATTCTTGA
 
Protein sequence
MPTLYIQEQG VMVRKRDNQV LVTRDGQTLH DVPLAKIDQV VLMGRGVQIS TALLIDLLER 
GIPVTLTNQH GSRHYATLTA GPSRFGDLRT GQMQYVNTPS RALELARAIV IVKLTNQRRL
LATTGWPAAA SAMQQIDAAL TAASQAQNVD ILRGHEGAAA AAYFGAWRAS LPPAWGFGGR
AFYPPPDPIN AMLSFGYTLA LHDVITAVQI TGLDPYLGTF HVIETGRPSL ALDLLEEFRP
VIVDRMVLDI VRTNAIGRER FHRPQERPEA VYLDAEGRAF LVQRYETLLQ TKVRLPGGEQ
TPMRRVILLQ AQAIARVLRG EQERYTGFSL NS