Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0645 |
Symbol | |
ID | 5207583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 798411 |
End bp | 799358 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640594262 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001275015 |
Protein GI | 148654810 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.354682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.348803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTGC ACAATCTACA AATCTTGCCC AAGGTCAGCG ATAGCTGGAC CTACCTGTAC GTCGAGCATG CCATCATTGA GCAGGATGAC AAAGCAATCG CGATTCTCAA CAAGGAGGGC AAAACTCCCG TACCTTGCGC CACGCTCTCG CTCCTCATGC TGGGTCCCGG CATCAGCATC ACCCATCAGG CCATCAAGAC ACTGGCAGAA AACGGGTGCA TGGTGGCCTG GGTAGGAGAA GAAGGCGTTC GTTTTTATGC AGTTGGCATG GGAGAAACCA GATCAGCGGC CAACACACTG CGTCAGGCAG CAATGCACAG CGATCCGGAT CTGCGTTTGC GAATTGTACG GCGCATGTAC GAAATGCGCT TTCCTGAAAA GCTCGATCCC GGTCTCACCA TTAAGCAGAT TCGAGGGAAA GAGGGAGCGC GTGTTCGAGA CACATATGCG CGGTGGAGCC GTGAGACCGG CGTCAAGTGG GATGGTCGAT TCTACAAACA GAATGACTGG CGGCGCACCG AACCGATTAA TCGGGCTATT TCGGCAGCCA ACAGTTGTTT GTACGGGATC GTTCATGCTG CGATTGTCGC TGCAGGCTAC TCACCTGCGC TCGGATTCAT TCATACCGGC AAGATGCTCT CGTTCGTCTA CGATGTCGCC GATCTTTACA AAACGGACAT CGCCATTCCG GCAGCTTTTC GCTGCACAGC AGCCGGTGAG AGTCGACTAG AGAGTCGAGT GCGACATTTG TGTCGTGATC TGATCCGTGA GCAGCGCATG CTGGAACGCA TTGTCGATGA TCTCCACAGA ATCTTTGACA TCTCAACGCT CGATCAGCGC GAGTCGGAGT TGTTTGATCG ATATTATGCC CGCCCTGGCA ACCTGTGGGA TCCGGAAGAA GGGGAGGTTG CTGGCGGCAT CAATTACAGC GAAGAGGAAG TTTCATGA
|
Protein sequence | MPVHNLQILP KVSDSWTYLY VEHAIIEQDD KAIAILNKEG KTPVPCATLS LLMLGPGISI THQAIKTLAE NGCMVAWVGE EGVRFYAVGM GETRSAANTL RQAAMHSDPD LRLRIVRRMY EMRFPEKLDP GLTIKQIRGK EGARVRDTYA RWSRETGVKW DGRFYKQNDW RRTEPINRAI SAANSCLYGI VHAAIVAAGY SPALGFIHTG KMLSFVYDVA DLYKTDIAIP AAFRCTAAGE SRLESRVRHL CRDLIREQRM LERIVDDLHR IFDISTLDQR ESELFDRYYA RPGNLWDPEE GEVAGGINYS EEEVS
|
| |