Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2475 |
Symbol | |
ID | 5734356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3165630 |
End bp | 3167825 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279615 |
Product | CRISPR-associated Csm1 family protein |
Protein accession | YP_001545241 |
Protein GI | 159898994 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATC CCCCAGTCAT ACGGGCATTG AAGAGTATGA CAAACTCGGT CAAGGCTTGG GATACTACGC CACTGCATTC GATCTTCAAC CAAATTGCTC CATCAGCCGA GCCAAACACC CAAACTTCAA CGCCCTACTT TCTCCAAGCG CAACGTTTTA GCCTCACCGA ATCTGCGCTT TTTCCAATCA CCGAACGTCC GCCAGTTTCA GCCGATTTAC AAACTCAATT CAATCAAGCG ATTGAGCAAT GCCATGCCGA GCCAAGCATT CATGTGGTCC AGATGCTGAG CTTGTTTCAC GATTATGCTT GGGCCGTCAG CTACCAGCCA ACTGCTGGCG AGACAGCCGA TCCGGTTGTA TCGCTCTACG ATTATGCTCG CACCAAAGCC GCGTTGGCCG CCGCTGGTGA GCAAACCATG TTAGTTGGTG GCGATCTTTC GGGCGTGCAG GATTTTATCT ACACCATCGC GGCCAATCGT GCCGCCAAAA GCCTGCGCGG TCGCTCATTC TATTTGCAAA TGCTGACCGA TGCCTTGGCT GGTTGGGTGT TGCAGCAAGC AGGCATGCCC AGCACTAATT TGCTCTATAG CGGCGGTGGG CGATTTTATG TGATTGTGCC CGCCGCTTGT TACGAACAAT TGGCGCAGTG GCGACGGGCG CTTGGCCAAT TTTTGCTGAA TGTCCACGAT GGCGAGTTGT ATATCGCCTT GGGTGGCGCG ACCATCGCCG ATGCTGCGCA CAATTTTGAG GCCTTATTTC GCGCCGTCAA CGACCAAGTG ACCGCTGATA AACGTCGCCG TTTTGCAACC CTCGATAATC AAGAGCTGCA AGCCAAATTG TTCACGCCAC GCGGCCATCG CGGCAACGAG GATGACCGCT GCGCAACCTG TGGTTATATG GGGCCGAGCA ATCAATTTGA AGCCGATGAA GATGATAGCA AGATCTGCCG GCTCTGCGAA AGCACGATTA AGCTGGCATT CAGCTTGCAC GATGCCCAAT TTCTTTGCAT CAGCCAGCCA CAGCCCAATC TCTCTGCGGT CAAAGGCCGC ACAAACGCAA AGGATATTTT GGCGGGCTTG GGTTTGCAGG CAGAAATTTG TTTCGATCAG CAGGAATTGA TCGATCATCT CAATCGCAAT TCAGCACAAT CAATCCATGT GCAGATTATT CGGCCCTTTG CGGGCGATTT AGCGATGCTG AGCCAACTTC GTGCCAATTA TCGCCAGCAT GTCTTTAGCA TACGCCCAAT CGTCAATGTA ACACCTAAAG CTCCTAACGG TGAGGTCAAA TCGTTTGATC AGTTGGCCAA AGCGAGTCGT GGCATCAAGC GTTTTGGGGT GCTGCGCATG GATGTTGATG ATCTTGGCGA TATTTTTGGC TATAGCCTTG CCAAAGCCTC GTTGGCCCGT ATTTCAAGCC TGAGTGCCGC CTTTTCGCGC TTTTTCGAGG GCTGGGTTGG CGAAATTTGT CGTGACCAGA ATATTGCCGC CGCAATCTAC CAGCCGGAGC AAGCGAGCAA TCAAGCAATC AGCCCTGAAC AAATTTACAG CGTCTATTCG GGCGGCGACG ATTTGTTTCT GGTTGGCAGT TGGGATGTGC TGGCCCATGT CGCCAATCGA ATTCAACACG ATCTTCAGCG CTACACGGGC TATAACCAAT TAATCCATGT TTCGGCGGGA TTGACCCTGC ACACCGAAAA ATTCCCGTTA TATCAAGCGG CCAAACTAGC CCATCATGCG CTTGATCAGG CCAAAGATGC TGCGCCACGC CAAGCAATTC GCAAAAACGC CCTAGATTTC CTTGATCAAA CGATTGCCTG GGAAGCCTAC CCCGCCTTGA TCAACTGGCA TCAACGTTTG TGGCGGCTCT ATCAAGGCGA GCATGGCATG GTGCGTTCGC TGCTGCAAGT GCTGATGGAG CTGTATAGCC AATATAACGA GCATAGCCAG CAACGTCAAA AATCGGGCAA AAAACACACC GCCTATGGCC CATGGATTTG GCGTGGCAAA TATCAACTGG CCCGCATTCG CCAACGCTAT GAGAACCATC AAGAATTACA AAAACTCTTG CGAGATATCG ATGAAGATTT ATTTACGGGG TTCGATGATC CGCATCGGAT CAGTTTGCGC ACAATCGAGC AGCTTGGCTT AGCCGCTCGT TGGACACAAT TATTAATTCG TGAGCAAGGA GATTAA
|
Protein sequence | MSDPPVIRAL KSMTNSVKAW DTTPLHSIFN QIAPSAEPNT QTSTPYFLQA QRFSLTESAL FPITERPPVS ADLQTQFNQA IEQCHAEPSI HVVQMLSLFH DYAWAVSYQP TAGETADPVV SLYDYARTKA ALAAAGEQTM LVGGDLSGVQ DFIYTIAANR AAKSLRGRSF YLQMLTDALA GWVLQQAGMP STNLLYSGGG RFYVIVPAAC YEQLAQWRRA LGQFLLNVHD GELYIALGGA TIADAAHNFE ALFRAVNDQV TADKRRRFAT LDNQELQAKL FTPRGHRGNE DDRCATCGYM GPSNQFEADE DDSKICRLCE STIKLAFSLH DAQFLCISQP QPNLSAVKGR TNAKDILAGL GLQAEICFDQ QELIDHLNRN SAQSIHVQII RPFAGDLAML SQLRANYRQH VFSIRPIVNV TPKAPNGEVK SFDQLAKASR GIKRFGVLRM DVDDLGDIFG YSLAKASLAR ISSLSAAFSR FFEGWVGEIC RDQNIAAAIY QPEQASNQAI SPEQIYSVYS GGDDLFLVGS WDVLAHVANR IQHDLQRYTG YNQLIHVSAG LTLHTEKFPL YQAAKLAHHA LDQAKDAAPR QAIRKNALDF LDQTIAWEAY PALINWHQRL WRLYQGEHGM VRSLLQVLME LYSQYNEHSQ QRQKSGKKHT AYGPWIWRGK YQLARIRQRY ENHQELQKLL RDIDEDLFTG FDDPHRISLR TIEQLGLAAR WTQLLIREQG D
|
| |