Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2894 |
Symbol | cas1 |
ID | 5592173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2896088 |
End bp | 2897005 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640922011 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001459522 |
Protein GI | 157162204 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTGGC TTCCTCTTAA CCCCATTCCA CTCAAAGATC GCGTCTCCAT GATCTTTCTG CAATATGGGC AGATCGATGT AATAGATGGC GCGTTTGTAC TTATCGACAA GACAGGGATC CGCACTCATA TTCCTGTTGG CTCGGTTGCC TGCATCATGC TGGAACCTGG TACACGGGTT TCGCATGCAG CTGTACGCTT GGCTGCGCAA GTTGGAACAT TGTTGGTATG GGTGGGGGAA GCGGGCGTTC GTGTTTATGC TTCTGGTCAG CCTGGTGGTG CGCGTTCAGA TAAGCTGCTC TACCAGGCAA AACTTGCTCT GGATGAAGAT TTGCGTCTGA AGGTAGTACG CAAAATGTTT GAACTTCGGT TTGGTGAACC TGCGCCCGCC CGACGCTCCG TAGAACAACT CAGAGGTATA GAAGGAAGCC GTGTGCGAGC AACTTATGCA CTTCTGGCGA AGCAATACGG GGTGACATGG AATGGACGTC GTTACGATCC GAAAGACTGG GAAAAGGGCG ATACGGTCAA CCAATGCATT AGCGCTGCAA CTTCCTGTTT ATACGGCGTA ACTGAAGCGG CGATACTTGC AGCTGGTTAT GCACCAGCTA TTGGGTTTGT GCATACAGGA AAGCCTCTTT CCTTTGTTTA CGATATCGCG GACATCATTA AATTTGACAC TGTTGTACCG AAAGCTTTTG AGATAGCGCG TTGTAATCCT GGTGAGCCGG ACCGGGAAGT CCGTTTGGCG TGCAGGGATA TTTTTCGCAG TAGTAAAACA TTAGCCAAAT TGATTCCGCT TATAGAGGAC GTGCTTGCTG CTGGAGAAAT ACAACCGCCG GCCCCACCTG AAGATGCACA GCCTGTTGCC ATTCCGCTTC CTGTTTCACT GGGAGATGCA GGCCATCGGA GTAGCTGA
|
Protein sequence | MAWLPLNPIP LKDRVSMIFL QYGQIDVIDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV SHAAVRLAAQ VGTLLVWVGE AGVRVYASGQ PGGARSDKLL YQAKLALDED LRLKVVRKMF ELRFGEPAPA RRSVEQLRGI EGSRVRATYA LLAKQYGVTW NGRRYDPKDW EKGDTVNQCI SAATSCLYGV TEAAILAAGY APAIGFVHTG KPLSFVYDIA DIIKFDTVVP KAFEIARCNP GEPDREVRLA CRDIFRSSKT LAKLIPLIED VLAAGEIQPP APPEDAQPVA IPLPVSLGDA GHRSS
|
| |