Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2234 |
Symbol | |
ID | 5734121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2840994 |
End bp | 2842046 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279375 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001545002 |
Protein GI | 159898755 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.982085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAC TCTACCTTTC GGAGCAATAT AGCATTGTCA AACGCGAAGG CGAGGCCTTG CGCGTCGAGA TTCCCGAAGA TCAACAACTT GGTCGCCAAC GTCAGGTTGT GCGAGTACCA TTAAACGTGA TTGAGCGGGT GGTAGTGCAG GGCGAAATCA CCCTAACTGC CTCGGCATTA GCCTGCTTAT TGGAGCGACG CATTTGCACC CATTTTTTGA GCTACAGCGG ACGTTCCCAA GGAGCACTAA CGCCTGATCC GACGCGTAAT GCAAGCCTGC GTTTAGCTCA ATATGCCGCG CATACCAGCA TCCAACATCG ATTTAGCCTT GCACGAACCT TTGTCGATGG GAAATTGCGC AATTTACGCA CCCAAATTTT GCGTTTCAAT CGTTCGCAGC GTGAGCCAAC TCTGACCCAA GCGATCGAGC GTTTACGCGA TGCCCATCGC GATCTCCATG GATTAAGCAT TCCAGAGTAT GTTGACCCGC TTGATCGCAT GCATGGAATG GGCCAGATTT TGGGCTGCGA AGGGCAAGGA AGCGCCGCCT ACTGGGATTG TTGGGGAATG TTGCTCAATC AGCCGTGGGA GTGGCATGGC CGTCGTCGTC GCCCACCGCC TGATCCAGTC AATGCCCTGT TATCGTATGG CTACGTGATT CTGACCAGTC AAGTTTTGAG CCAATTAGCG ATTGTGGGCT TTGATCCCTA CATCGGCTTT TTGCATCAAT CGAGTTTTGG CAAACCAGCC TTAGCACTTG ATCTCATGGA AGAATTTCGC CCAGTGATCG TTGATTCAGT AGTTTTGACC GTGCTTAACA CCAAAATTCT GAACCAGCAG CATTTTCAAC GTGAGCCTGG GAGCGTGCAA CTAAGCAAAG AAGGCCGTAA ACTCTTTCTG ACCAAGCTCG AAGAACGCTT CAGTAGTGAA ATCCAACACC CAATTTTTGG CTATCGGGTG AGCTATCGAC GCTGCATCGA ACTCCAAGCG CGGCTGCTTG CCAAAGCCCT GATGGGCGAG ATTCAGCACT ATATTCCATT TCTCGTGAGG TGA
|
Protein sequence | MQTLYLSEQY SIVKREGEAL RVEIPEDQQL GRQRQVVRVP LNVIERVVVQ GEITLTASAL ACLLERRICT HFLSYSGRSQ GALTPDPTRN ASLRLAQYAA HTSIQHRFSL ARTFVDGKLR NLRTQILRFN RSQREPTLTQ AIERLRDAHR DLHGLSIPEY VDPLDRMHGM GQILGCEGQG SAAYWDCWGM LLNQPWEWHG RRRRPPPDPV NALLSYGYVI LTSQVLSQLA IVGFDPYIGF LHQSSFGKPA LALDLMEEFR PVIVDSVVLT VLNTKILNQQ HFQREPGSVQ LSKEGRKLFL TKLEERFSSE IQHPIFGYRV SYRRCIELQA RLLAKALMGE IQHYIPFLVR
|
| |