Gene RoseRS_3012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3012 
Symbol 
ID5209980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3791391 
End bp3792389 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID640596604 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001277326 
Protein GI148657121 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCC TTTACATCCA GGAACAGGGC GTGATGGTGC GCAAACGTGA TAATCAGGTG 
CTGATCACGA AAGACGGGCA GACGCTGAGC GAAGTGCCAC TGGCGAAGAT TGACCAGGTT
GTGCTGATGG GGCGGGGGGT GCAACTTTCG ACGGCGCTGC TGATCGATCT GCTTGAACGC
GGCATTCCGG TGACGTTCAC CAATCAGCAC GGCAGCCGCC ATTACGCCAC ACTGACTGCC
GGACCATCAC GCTTCGGCGA TCTGCGCATA CGCCAGATGC AGTTTGTCGG TGCACCGGAT
CGCGCCCTGC GCCTGGCAAA GGACATTGTG AGCGCCAAAC TGACCAATCA GCGTCGTTTG
CTGGCAGCGA CCGGGTGGCC AGCCGCAGCG ACGGCGATCG CGCAGATCGA TGCAGCGCTG
ACTGCGGCAG CAAACGCGCC GCATGTGGAC ATGCTGCGCG GGCATGAAGG CGCAGCGGCG
GCGGCATACT TCGGCGCGTG GCGCGCATCG CTTCCGCCGG TGTGGGGATT CGGCGGGCGT
GCCTTCTACC CGCCGCCCGA CCCGATCAAC GCCATGCTCT CATTCGGGTA TACGCTGGCG
CTCCATGATG TCATTACCGC TGTGCAGATC ACCGGTCTCG ATACGTACCT GGGCGTGTTT
CATGTGATCG AGCCTGGTCG TCCATCACTG GCGCTCGACC TGCTGGAGGA GTTTCGCCCG
TTGATCGTCG ACCGGTTGGT AATCGATCTG GTGCGTACCA ATGCGATTGG TCGCGAACAT
TTTCACCATC CGCAGGAACG ACCGGATGCC GTATACCTCG ATGATGTCGG GCGTACACTG
CTGGTGCAGC GGTATGAATC GATGCTTCAG ACAAAGGTAC GGTTGCCTGG CGGCGAGCAG
ACGCCGTTGC GACGGGTGAT CCTGCTGCAG GCGCAGGCGA TTGCGCGTAT CGTTCGCGGT
GAGCAGGAAC AGTACACAGG ATTCAGTCTG AATAACTGA
 
Protein sequence
MPTLYIQEQG VMVRKRDNQV LITKDGQTLS EVPLAKIDQV VLMGRGVQLS TALLIDLLER 
GIPVTFTNQH GSRHYATLTA GPSRFGDLRI RQMQFVGAPD RALRLAKDIV SAKLTNQRRL
LAATGWPAAA TAIAQIDAAL TAAANAPHVD MLRGHEGAAA AAYFGAWRAS LPPVWGFGGR
AFYPPPDPIN AMLSFGYTLA LHDVITAVQI TGLDTYLGVF HVIEPGRPSL ALDLLEEFRP
LIVDRLVIDL VRTNAIGREH FHHPQERPDA VYLDDVGRTL LVQRYESMLQ TKVRLPGGEQ
TPLRRVILLQ AQAIARIVRG EQEQYTGFSL NN