Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5081 |
Symbol | |
ID | 5737039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 99827 |
End bp | 100840 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641282246 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001547837 |
Protein GI | 159901591 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAA TTGTCCATGA ACGGGGCACC TTTATTCAAA AACATCAAGG TCGTTTGCGG GTCATGCGTG AAAAAGAGCG TTTGGCTGAA GTTCCATTAT TAATTCTTGA TCACGTTATT ATCGAATCGT ATGGTGTTGG AATTTCATCC GATGCAGTGC GAGCGTGTGC TGAGCATGGC ATCCCGATTC ATTTTTTAAG TAGTACAGGT ATTGCCTATG CTTCGCTCTA TAGTGCCGGA TTAACGGGTA CGGTGCAAAC ACGCCGTGCC CAATTACAAG CTTTTGAAAA TGAGCGCGGT GCATGGCTAG CCCGTGCATT TGTCTGTGGC AAATTAGAAA ATCAGCATAA TTTGCTCCGA ATGATGGCCA AATATCGCAA AACGGCTGAT CCTGCTTGTT TTCAGCGGGT TCAGCCAATT ATCGCCGAAA TGCGTGATCA TATTATCGAA GCTGAGCGGG TTATGCCGCA GCAACTTGAG CACATTCGGC CTTCACTGTT GAGTATCGAA GGTCGCGGCG CGGCTCGTTA TTGGTTTGGT GTGCGTGAAT TATTGCTCTG TGATTTAGAT TGGCCTGGGC GTGAAACCCA AGGAGCACGT GATCCGCTCA ATAGTGCCTT GAATTATGGC TATGGCATTT TGTATAGCCA AATCGAGCGT TGCCTCGTTC TGGCTGGCCT TGATCCCTAT GGTGGCTTTA TGCACACTGA TCGGCCTGGT AAACCATCAT TAGTGCTCGA TTTGATTGAA GAATTTCGCC AAACCGTGGT TGATCGTACC ATTTTGGGTT TGGTCAATCG CAAAATGACG ATTGAGCAAG ATGAAACTGG CCGATTAAGC GACCATACAC GCGAGATGAT TCGTGAGCGC CTATTTAAGC GTTTGGAAGC GAGTGAGCCA TATGAGACCA AACGGGTGAG TTTGCGGGTA ATTATGCAAT CTCAAGCTCG CCATCTCGCA ACATTTGTGC GCGGCGATCG CGACACCTAC ACACCATTTA TTGCCTCGTG GTAG
|
Protein sequence | MELIVHERGT FIQKHQGRLR VMREKERLAE VPLLILDHVI IESYGVGISS DAVRACAEHG IPIHFLSSTG IAYASLYSAG LTGTVQTRRA QLQAFENERG AWLARAFVCG KLENQHNLLR MMAKYRKTAD PACFQRVQPI IAEMRDHIIE AERVMPQQLE HIRPSLLSIE GRGAARYWFG VRELLLCDLD WPGRETQGAR DPLNSALNYG YGILYSQIER CLVLAGLDPY GGFMHTDRPG KPSLVLDLIE EFRQTVVDRT ILGLVNRKMT IEQDETGRLS DHTREMIRER LFKRLEASEP YETKRVSLRV IMQSQARHLA TFVRGDRDTY TPFIASW
|
| |