Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3057 |
Symbol | cas1 |
ID | 5589688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3057621 |
End bp | 3058541 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640926701 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001464077 |
Protein GI | 157155405 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.82457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGTTTG TACCACTGAG CCCGATCCCA TTAAAAGATC GCACCACCAT GATCTTCCTC CAGTACGGTC AGATCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACAGGGATC CGCACACATA TTCCGGTAGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACGAGAGTT TCCCACGCGG CGGTGCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA GCGGGCGTTC GCGTTTACTC ATCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGCCTGA AGGTGGTGCG CAAAATGTAT GAATTACGTT TTCGTGAGCC ACCGCCAGCT CGCCGTTCAG TGGAGCAGCT ACGTGGAATT GAGGGATCTC GCGTTCGCCA GACGTATGCA TTACTGGCGA AACAATATGG TGTGAAATGG AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTGAA TCGCTGCATC AGTGCTGCTA CATCATGTCT GTACGGTATA TCTGAAGCGG CAGTATTAGC CGCGGGATAT GCGCCCGCTA TTGGGTTTAT CCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC GATATCATTA AATTTGATTC GGTTGTACCA AAGGCATTTG AAATAGCGGC GAGGCAACCC GCAGAACCTG ATAAAGAAGT CAGATTAGCC TGTCGTGATA TTTTCCGTAG CACTAAGTTA ACGGGCAAAT TAATACCGTT AATTGAGGAA GTCCTTGCCG CAGGTGAAAT TGAACCGCCA CAACCTGCGC CGGATATGTT ACCGCCAGCC ATCCCCGAAC CTGAAACGTT GGGCGATAGC GGTCATCGAG GACGCGGCTG A
|
Protein sequence | MTFVPLSPIP LKDRTTMIFL QYGQIDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY ELRFREPPPA RRSVEQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDSVVP KAFEIAARQP AEPDKEVRLA CRDIFRSTKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPETLGDS GHRGRG
|
| |