Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1950 |
Symbol | |
ID | 4484920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 2210483 |
End bp | 2211370 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639730742 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_873708 |
Protein GI | 117929157 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGAC CGTGGCGTGT CGTGGATCTG TCCGAACTGT CCGGTGAGGT GCATGCGGCT CAAGGGGCAC TTCTTGTCGG TGACGAGCGG GTGCCGCTGG TTGACGTCGC GATGATGCTT ACGGGGCCGT ACGTCTCCCT GCACGGCAGC GTTATTGACC GCGCTGCGGC GTTCGGGGTA GGCGTGGTGC ACTGCGACTG GCGGGGTGTT CCGGTAGCCG CTACATTGCC GTGGTCGACT CACAACCGGG TGGCGGCTCG TCATCGCGCG CAGGCGGAGC TTTCGTTGCC TAGGCAAAAG AACGCATGGA TGAATATCGT GAAGACAAAG ATCCGCAATC AGGCCGCTGT GCTACGGGCG CTTCGCCGAG ACGGTGTGGC GCAACTGGAG CGACTCGCGG CGCAGGTTCG ATCAGGTGAT GCAAGCAATG CTGAAGGGGC TGCCGCGCGC GTGTATTGGG CTCGCTTGTT TCAGGACAAG CACTTTCGTC GCGTTCCGCG AGCACGTGAC GTTGTCAACG GCCTCCTAGA CTACGGCTAT GCGATCTTAC GTGGTTGTTG CCTTCGCGCG GTGGTCGGTG CGGGACTCGC GCCGTCCCTC GGCCTTTGGC ACCGGCGCCA CGATAATCCG TTTACGCTGG TTGACGATCT TATCGAACCA TTCCGACCTG CGGTGGACAA GACGGTCATA GAGATCGTCA CTGCGGGCGC ATCGGGTCTT GACCGTCCCA CTAAGCGCCT TCTTGTAGCG GTGCTTGATC ACCAATTTGA TGCGAGCGGA GCGACCGTGG GAACAGCCGT GGAGCGGTTT GCCCAGCAGG TCGGTCGGTA CGTCGAGGGC GAGATACGAA GTCTGAGACC ACCCGCCATG GAGCTGTCGC ATGCTTAA
|
Protein sequence | MTGPWRVVDL SELSGEVHAA QGALLVGDER VPLVDVAMML TGPYVSLHGS VIDRAAAFGV GVVHCDWRGV PVAATLPWST HNRVAARHRA QAELSLPRQK NAWMNIVKTK IRNQAAVLRA LRRDGVAQLE RLAAQVRSGD ASNAEGAAAR VYWARLFQDK HFRRVPRARD VVNGLLDYGY AILRGCCLRA VVGAGLAPSL GLWHRRHDNP FTLVDDLIEP FRPAVDKTVI EIVTAGASGL DRPTKRLLVA VLDHQFDASG ATVGTAVERF AQQVGRYVEG EIRSLRPPAM ELSHA
|
| |