Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2386 |
Symbol | |
ID | 5734267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3039256 |
End bp | 3040257 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279527 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001545154 |
Protein GI | 159898907 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACTC TCTATTTAAA CGAGCAAGGC ACCCGTTTGG GCAAAAAAGA TGAGCGCTTG ATCATCCTAC GTGGTCAAGA GTTGATCAAC GATATTCCAG TAATCAAAGT TGATCGAGTC ATTGTGATGG GTCAAGGGGT GCAGGTATCG CATGCCGCAA TTGTATTTCT GGCCCAACGA GGAATTCCCT TAATTTTTAC GACCCAATCG GGTGGCTCAC AGAAGGCCAT GGTTTCGGCG GGCTTGGGTA ATAATGCGGC TTTGCGCCTC GCCCAATGCC GCATTGTCGA TAACCCCCAT TTGGCGGTTC CCTTGGTGCA GGCGATTGTA GTTGGCAAAG TTGCCAATCA AATTCAACTG TTGGAGCGTT ATGGCAGCGA TTGGGGTGGG ATGGGGCTAC GTGCCAAACA AACCATGCAG CATGTCATTC AACAAACGCA ACACATGCCC GATATCGAGC AATTGCGTGG CCTCGAAGGA GCTGGGGCAG CGGCCTATTG GGGCACATGG AGTGCTGTTT TCAAAACTGC CTGGGGTTTT GCGGGCCGCG CTTATCGCCC AACCCCCGAC CCATTGAATG CCTTGTTGAG TTTTGGCTAC ACACTGCTAC TCAACGATTT GATGACTGCC GTGCAAGCCC TCAGCTTTGA TCCCTATCTC GGCGTGTTTC ATACTGTGCA GTTTGGGCGA CCCTCGTTGG CGCTCGATCT CGAGGAGGAA TTTCGGCCAT GCATCGTTGA TCGTATGGTG TTGGATGTGC TTGATGCTGG TTTATTGCAA ATGAGCAATT TCAGCCGCAC TGAAAAAGGC TTTTTGCTCA ACGATCGGGC GCGTAAAAGC TTTATTCAAG CCTATGAGCA ACGCATGCAA ACCCCGATTC GCTATCAAGG CACTGGTAAC AACGAGCCAA TGCGACGGGT ACTTTTACTA CAAACTCAGC ATTTAGCGCG GGTTCTGCAA GGCGAAGAGC CGCGCTATCA GCCCTATGTT TGGCGTGATT GA
|
Protein sequence | MPTLYLNEQG TRLGKKDERL IILRGQELIN DIPVIKVDRV IVMGQGVQVS HAAIVFLAQR GIPLIFTTQS GGSQKAMVSA GLGNNAALRL AQCRIVDNPH LAVPLVQAIV VGKVANQIQL LERYGSDWGG MGLRAKQTMQ HVIQQTQHMP DIEQLRGLEG AGAAAYWGTW SAVFKTAWGF AGRAYRPTPD PLNALLSFGY TLLLNDLMTA VQALSFDPYL GVFHTVQFGR PSLALDLEEE FRPCIVDRMV LDVLDAGLLQ MSNFSRTEKG FLLNDRARKS FIQAYEQRMQ TPIRYQGTGN NEPMRRVLLL QTQHLARVLQ GEEPRYQPYV WRD
|
| |