Gene RoseRS_0645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0645 
Symbol 
ID5207583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp798411 
End bp799358 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content54% 
IMG OID640594262 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001275015 
Protein GI148654810 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.354682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.348803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTGC ACAATCTACA AATCTTGCCC AAGGTCAGCG ATAGCTGGAC CTACCTGTAC 
GTCGAGCATG CCATCATTGA GCAGGATGAC AAAGCAATCG CGATTCTCAA CAAGGAGGGC
AAAACTCCCG TACCTTGCGC CACGCTCTCG CTCCTCATGC TGGGTCCCGG CATCAGCATC
ACCCATCAGG CCATCAAGAC ACTGGCAGAA AACGGGTGCA TGGTGGCCTG GGTAGGAGAA
GAAGGCGTTC GTTTTTATGC AGTTGGCATG GGAGAAACCA GATCAGCGGC CAACACACTG
CGTCAGGCAG CAATGCACAG CGATCCGGAT CTGCGTTTGC GAATTGTACG GCGCATGTAC
GAAATGCGCT TTCCTGAAAA GCTCGATCCC GGTCTCACCA TTAAGCAGAT TCGAGGGAAA
GAGGGAGCGC GTGTTCGAGA CACATATGCG CGGTGGAGCC GTGAGACCGG CGTCAAGTGG
GATGGTCGAT TCTACAAACA GAATGACTGG CGGCGCACCG AACCGATTAA TCGGGCTATT
TCGGCAGCCA ACAGTTGTTT GTACGGGATC GTTCATGCTG CGATTGTCGC TGCAGGCTAC
TCACCTGCGC TCGGATTCAT TCATACCGGC AAGATGCTCT CGTTCGTCTA CGATGTCGCC
GATCTTTACA AAACGGACAT CGCCATTCCG GCAGCTTTTC GCTGCACAGC AGCCGGTGAG
AGTCGACTAG AGAGTCGAGT GCGACATTTG TGTCGTGATC TGATCCGTGA GCAGCGCATG
CTGGAACGCA TTGTCGATGA TCTCCACAGA ATCTTTGACA TCTCAACGCT CGATCAGCGC
GAGTCGGAGT TGTTTGATCG ATATTATGCC CGCCCTGGCA ACCTGTGGGA TCCGGAAGAA
GGGGAGGTTG CTGGCGGCAT CAATTACAGC GAAGAGGAAG TTTCATGA
 
Protein sequence
MPVHNLQILP KVSDSWTYLY VEHAIIEQDD KAIAILNKEG KTPVPCATLS LLMLGPGISI 
THQAIKTLAE NGCMVAWVGE EGVRFYAVGM GETRSAANTL RQAAMHSDPD LRLRIVRRMY
EMRFPEKLDP GLTIKQIRGK EGARVRDTYA RWSRETGVKW DGRFYKQNDW RRTEPINRAI
SAANSCLYGI VHAAIVAAGY SPALGFIHTG KMLSFVYDVA DLYKTDIAIP AAFRCTAAGE
SRLESRVRHL CRDLIREQRM LERIVDDLHR IFDISTLDQR ESELFDRYYA RPGNLWDPEE
GEVAGGINYS EEEVS