Gene Haur_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0579 
Symbol 
ID5732300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp665931 
End bp667817 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content44% 
IMG OID641277706 
ProductCRISPR-associated RAMP Crm2 family protein 
Protein accessionYP_001543355 
Protein GI159897108 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.638626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAT CGCTGTTGTT GATCGCCCTA GGGCCAGTTC AGGAATTTAT CGAGCAGGCT 
CGGCGCACCC GCGATGTCTG GTTTGGTTCG TGGGTGCTCA GTCATCTTGC CAAAAAGGTC
GCCCAAACCC TGCCACGCGA AGGCTTGATC TTTCCAACTG GCGATAGTTT GCCCTCGCCC
GAAGCCGAAA ATAGCGATCC AAGTACTGAA CGAATCAGCT ATAGTAACCA AATTTTGGTG
ATCGTGCCCG ATGCCCAAGC AGCCATTACC CACGCAAAGG CAGCCTTTGA TCAAGAAATT
GACCAATTCG TAAACATTCT GCTGAAAACT AAACAGATCA ATTGGCTCGA AGGCCTGTCG
GAAGAGCAGC GTGAACAGCA GCTTATTAAA CAATTAAAAG ATTTATTTGA GTTTCAGTAT
GTATATGTTG ATTTACCGGA TGACCAGCAC TATGCGCAAC AACGCAAAAA GCTTTACCAG
CTCTATGCAG CACGTAAAAA TACCCGCAAC TTTGGCTTAA TTGATTGGCA ACAAGACCCC
CAAAATATGA AGCGCTCGGA TAAATCGTCG CTCGATGGCT GGCACACGCT AATAGCCAAT
GGAGCAAAAT CGGGCGAAAA TCTCTCAGCC GTCGATTTAC TCAAGCGTAT GCTCAAACCA
CAAGATTTAG ACCAAGCCTA TCCAAAAGGT TTAGACGGAT TCCCAAGTAC TTCGCATATG
GCTGCGCTGG CATTTGCTGA ACATGTTAAT CAAAGCACTT TAGCTCGCCA AGGTTGGGAA
CGCTATATCA ATAATCTACG TAGCACAGGC TATCGCATCA CTGAATATTT ACCACGAGTT
CAGCGCTTTG ATCGAATTGC CCTCTTTCCA AGCGAAAATA CTAACCGCGA TTTTTTTGAT
GGTTCATTGG TTTTTGAGTC GCGCTTAGGG GAAGATTTCG CCAATACTGA TTTGAATACT
GTAATTATTA AGCAAGCACG CGGCTATTTG GAAGATTTTA TAACTGCTTA TGCCAAACAA
CGCCCAAATC CCTATTATGC GATTATTGCT GCCGATGGCG ATAAAATGGG CAATTTAATT
GATTCGATTA GCGATGTACA AATTCATCGC GATTTATCCA AGGTTGTTAC AACATTTGCT
AAAGATGCAG CTCAAATTCT AGAAAATGAT TACCAAGCAG CAGTCATTTA TGCTGGTGGC
GATGATGTAC TAGCGCTTCT TCCATTACAT ACACTTTTAG CAGCTACCAA AAAAATCAGC
GATCTCTTTA TCGAATATAT GCAACCACTA GCCGAGCAAC AGGCTGTTAG CACACCAACA
CTCTCGGTTG GGGTGGCAAT TGTGCATCAT CTTGAAGCCT TGCAGGATTC AATTCAATTG
GCGCGGAAGG CCGAGAAGCT AGCCAAAAAG CCGCGTAATG CCTTAGCAAT TATTTTGAGC
AAACGTAATG GCTCCGATAA AACCATTGTT GGTCAGTGGG ATACAGAATT TTATCAACAC
TTAACCAAAT TAGTTGAGCT GCATTGCCAA CAAACAATTC CTGATGGCTT TATTTTTGAG
CTTGATGAGC TACTCAAGCG CTTTGACTTT GGGCTTATCA ATGATCGAAC AGAACTTCAA
CAAACGCTCA AGATTATTGA ACATGAAACG CTGCGAATTT TGAAGCGCAA ACAAATTCAG
TCTGCAGGCG AAGCTACATT TTTGGCTGCT GATGTGATTA ATCTGCTCCA AAACATTATT
GGCTATTGTT CGAGTCATTC ATCAAACTCA GATCAAGCAG ACCAATATCT GCGCGATTTT
ATCGCCATGA ATTTAGTGGC ACGGGAGTTA GCCAATGTTC AGGCCTTATT TGAAAAGACT
CAGCCCCAAG GAGTACCTCA ATTATGA
 
Protein sequence
MNRSLLLIAL GPVQEFIEQA RRTRDVWFGS WVLSHLAKKV AQTLPREGLI FPTGDSLPSP 
EAENSDPSTE RISYSNQILV IVPDAQAAIT HAKAAFDQEI DQFVNILLKT KQINWLEGLS
EEQREQQLIK QLKDLFEFQY VYVDLPDDQH YAQQRKKLYQ LYAARKNTRN FGLIDWQQDP
QNMKRSDKSS LDGWHTLIAN GAKSGENLSA VDLLKRMLKP QDLDQAYPKG LDGFPSTSHM
AALAFAEHVN QSTLARQGWE RYINNLRSTG YRITEYLPRV QRFDRIALFP SENTNRDFFD
GSLVFESRLG EDFANTDLNT VIIKQARGYL EDFITAYAKQ RPNPYYAIIA ADGDKMGNLI
DSISDVQIHR DLSKVVTTFA KDAAQILEND YQAAVIYAGG DDVLALLPLH TLLAATKKIS
DLFIEYMQPL AEQQAVSTPT LSVGVAIVHH LEALQDSIQL ARKAEKLAKK PRNALAIILS
KRNGSDKTIV GQWDTEFYQH LTKLVELHCQ QTIPDGFIFE LDELLKRFDF GLINDRTELQ
QTLKIIEHET LRILKRKQIQ SAGEATFLAA DVINLLQNII GYCSSHSSNS DQADQYLRDF
IAMNLVAREL ANVQALFEKT QPQGVPQL