Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3119 |
Symbol | |
ID | 7311713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3653330 |
End bp | 3654226 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643610022 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002507390 |
Protein GI | 220930481 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.679902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGATGGA GAAATATAAT AGTTTCAAAT CCAACTAAAC TGAAATTAAA ACAGAATAAC TTATGGGTTG AACAATCAGA TGGCTTTAGT ATACCGATTG ATGATATAAA TACAATAGTA CTTGATAGTG CGGATGTTAC GATTACATCC GCACTATTAT CAAAATTGGC AGAAGAAGAC ATTGCTTTGT ATTCTTGTGA TGGGAAGCAC ACACCGAATG GAGTACTTCT TCCATTCAGT TGTCATAGTA GACAATACAA AATTGTAAAA ACTCAAATAA ATCTTTCAGC ACCTTTTAAA AAAAGGTGCT GGCAAAGAGT TGTTCAACAG AAAATAGAAA ATCAGGCCTT TTGCTTAAAT ATTCTAGAAT TAAAAGGAAG AGATGAATTA ATAAATCTAT CTAAGAGTGT TCTATCTGGT GATTCAACTA ATGTAGAGGC TCATGCTGCA AAATATTATT TCTCTGTTCT ATTCACAAAC TTCAAAAGGG GTATGCAGGA TAACACAAAC TATGCATTAA ACTATGGCTA TTCAATATTA AGGGGAGCTG TAGCCAGAAC CATAGCATCG TATGGATTTA TCCCTTCTAT TGGAATACAT CATAGAAGCG AATTGAATAA TTTTAATCTT GCTGATGACT TTATCGAACC GTTCAGACCA ATTGTTGATA TGTGGGTAAA ACAAAATATA AATGAGGATA CACTTTTAAC ACCTAAACAT AAGTTAAATC TTATAAGTTT GTTGGGTTAC GAATGTGTCT TTGAGGGAAA AATAATATCT ATAAGGTCTG CAATCGAAAA GGTGATTTCA AGTTTTTCAA GTTCTTGTGC AAAGAACGAT TATAGTTTAT TGAAATTACC TGAAATAATA CCATTAGAGG TACATGCAAA TGAGTGA
|
Protein sequence | MGWRNIIVSN PTKLKLKQNN LWVEQSDGFS IPIDDINTIV LDSADVTITS ALLSKLAEED IALYSCDGKH TPNGVLLPFS CHSRQYKIVK TQINLSAPFK KRCWQRVVQQ KIENQAFCLN ILELKGRDEL INLSKSVLSG DSTNVEAHAA KYYFSVLFTN FKRGMQDNTN YALNYGYSIL RGAVARTIAS YGFIPSIGIH HRSELNNFNL ADDFIEPFRP IVDMWVKQNI NEDTLLTPKH KLNLISLLGY ECVFEGKIIS IRSAIEKVIS SFSSSCAKND YSLLKLPEII PLEVHANE
|
| |