Gene Haur_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2475 
Symbol 
ID5734356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3165630 
End bp3167825 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content51% 
IMG OID641279615 
ProductCRISPR-associated Csm1 family protein 
Protein accessionYP_001545241 
Protein GI159898994 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATC CCCCAGTCAT ACGGGCATTG AAGAGTATGA CAAACTCGGT CAAGGCTTGG 
GATACTACGC CACTGCATTC GATCTTCAAC CAAATTGCTC CATCAGCCGA GCCAAACACC
CAAACTTCAA CGCCCTACTT TCTCCAAGCG CAACGTTTTA GCCTCACCGA ATCTGCGCTT
TTTCCAATCA CCGAACGTCC GCCAGTTTCA GCCGATTTAC AAACTCAATT CAATCAAGCG
ATTGAGCAAT GCCATGCCGA GCCAAGCATT CATGTGGTCC AGATGCTGAG CTTGTTTCAC
GATTATGCTT GGGCCGTCAG CTACCAGCCA ACTGCTGGCG AGACAGCCGA TCCGGTTGTA
TCGCTCTACG ATTATGCTCG CACCAAAGCC GCGTTGGCCG CCGCTGGTGA GCAAACCATG
TTAGTTGGTG GCGATCTTTC GGGCGTGCAG GATTTTATCT ACACCATCGC GGCCAATCGT
GCCGCCAAAA GCCTGCGCGG TCGCTCATTC TATTTGCAAA TGCTGACCGA TGCCTTGGCT
GGTTGGGTGT TGCAGCAAGC AGGCATGCCC AGCACTAATT TGCTCTATAG CGGCGGTGGG
CGATTTTATG TGATTGTGCC CGCCGCTTGT TACGAACAAT TGGCGCAGTG GCGACGGGCG
CTTGGCCAAT TTTTGCTGAA TGTCCACGAT GGCGAGTTGT ATATCGCCTT GGGTGGCGCG
ACCATCGCCG ATGCTGCGCA CAATTTTGAG GCCTTATTTC GCGCCGTCAA CGACCAAGTG
ACCGCTGATA AACGTCGCCG TTTTGCAACC CTCGATAATC AAGAGCTGCA AGCCAAATTG
TTCACGCCAC GCGGCCATCG CGGCAACGAG GATGACCGCT GCGCAACCTG TGGTTATATG
GGGCCGAGCA ATCAATTTGA AGCCGATGAA GATGATAGCA AGATCTGCCG GCTCTGCGAA
AGCACGATTA AGCTGGCATT CAGCTTGCAC GATGCCCAAT TTCTTTGCAT CAGCCAGCCA
CAGCCCAATC TCTCTGCGGT CAAAGGCCGC ACAAACGCAA AGGATATTTT GGCGGGCTTG
GGTTTGCAGG CAGAAATTTG TTTCGATCAG CAGGAATTGA TCGATCATCT CAATCGCAAT
TCAGCACAAT CAATCCATGT GCAGATTATT CGGCCCTTTG CGGGCGATTT AGCGATGCTG
AGCCAACTTC GTGCCAATTA TCGCCAGCAT GTCTTTAGCA TACGCCCAAT CGTCAATGTA
ACACCTAAAG CTCCTAACGG TGAGGTCAAA TCGTTTGATC AGTTGGCCAA AGCGAGTCGT
GGCATCAAGC GTTTTGGGGT GCTGCGCATG GATGTTGATG ATCTTGGCGA TATTTTTGGC
TATAGCCTTG CCAAAGCCTC GTTGGCCCGT ATTTCAAGCC TGAGTGCCGC CTTTTCGCGC
TTTTTCGAGG GCTGGGTTGG CGAAATTTGT CGTGACCAGA ATATTGCCGC CGCAATCTAC
CAGCCGGAGC AAGCGAGCAA TCAAGCAATC AGCCCTGAAC AAATTTACAG CGTCTATTCG
GGCGGCGACG ATTTGTTTCT GGTTGGCAGT TGGGATGTGC TGGCCCATGT CGCCAATCGA
ATTCAACACG ATCTTCAGCG CTACACGGGC TATAACCAAT TAATCCATGT TTCGGCGGGA
TTGACCCTGC ACACCGAAAA ATTCCCGTTA TATCAAGCGG CCAAACTAGC CCATCATGCG
CTTGATCAGG CCAAAGATGC TGCGCCACGC CAAGCAATTC GCAAAAACGC CCTAGATTTC
CTTGATCAAA CGATTGCCTG GGAAGCCTAC CCCGCCTTGA TCAACTGGCA TCAACGTTTG
TGGCGGCTCT ATCAAGGCGA GCATGGCATG GTGCGTTCGC TGCTGCAAGT GCTGATGGAG
CTGTATAGCC AATATAACGA GCATAGCCAG CAACGTCAAA AATCGGGCAA AAAACACACC
GCCTATGGCC CATGGATTTG GCGTGGCAAA TATCAACTGG CCCGCATTCG CCAACGCTAT
GAGAACCATC AAGAATTACA AAAACTCTTG CGAGATATCG ATGAAGATTT ATTTACGGGG
TTCGATGATC CGCATCGGAT CAGTTTGCGC ACAATCGAGC AGCTTGGCTT AGCCGCTCGT
TGGACACAAT TATTAATTCG TGAGCAAGGA GATTAA
 
Protein sequence
MSDPPVIRAL KSMTNSVKAW DTTPLHSIFN QIAPSAEPNT QTSTPYFLQA QRFSLTESAL 
FPITERPPVS ADLQTQFNQA IEQCHAEPSI HVVQMLSLFH DYAWAVSYQP TAGETADPVV
SLYDYARTKA ALAAAGEQTM LVGGDLSGVQ DFIYTIAANR AAKSLRGRSF YLQMLTDALA
GWVLQQAGMP STNLLYSGGG RFYVIVPAAC YEQLAQWRRA LGQFLLNVHD GELYIALGGA
TIADAAHNFE ALFRAVNDQV TADKRRRFAT LDNQELQAKL FTPRGHRGNE DDRCATCGYM
GPSNQFEADE DDSKICRLCE STIKLAFSLH DAQFLCISQP QPNLSAVKGR TNAKDILAGL
GLQAEICFDQ QELIDHLNRN SAQSIHVQII RPFAGDLAML SQLRANYRQH VFSIRPIVNV
TPKAPNGEVK SFDQLAKASR GIKRFGVLRM DVDDLGDIFG YSLAKASLAR ISSLSAAFSR
FFEGWVGEIC RDQNIAAAIY QPEQASNQAI SPEQIYSVYS GGDDLFLVGS WDVLAHVANR
IQHDLQRYTG YNQLIHVSAG LTLHTEKFPL YQAAKLAHHA LDQAKDAAPR QAIRKNALDF
LDQTIAWEAY PALINWHQRL WRLYQGEHGM VRSLLQVLME LYSQYNEHSQ QRQKSGKKHT
AYGPWIWRGK YQLARIRQRY ENHQELQKLL RDIDEDLFTG FDDPHRISLR TIEQLGLAAR
WTQLLIREQG D