Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0700 |
Symbol | |
ID | 5732601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 804133 |
End bp | 805983 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641277830 |
Product | CRISPR-associated RAMP Crm2 family protein |
Protein accession | YP_001543476 |
Protein GI | 159897229 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00738894 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCATT TATTACTAAT CGCGATTGGC CCAGTTCAAG ATTTCATCGC CAGTGCTCGT CGCACCCGCG ACCTCTGGTT TGGTTCGTGG CTACTCAGCG AATTAGCCAA AGCAGCGGCA AAAAGCATTT ATCAAGCTAA AGGCAATTTG ATCTTTCCTT ATCCAACCAA TCCAGCCCAA GATTTAGCGC CAGAAAGTGC CTTCAATGTG CCTAACAAAA TTTTGGCCAA GGTAGAGGGC GATATTAAAA AGATTGCTGA CACAGCTAAA CAAGCCTTGG AGCAACGTTT AAATCAGATT CGTGATGGCG CATTTGATCA AGTTAAAGGA ACGTTCTACG ATGAACAAGC TGCTGATCAA ATCGCCGCAT TGGTTGAATT CAATTGGGTT GCCCTGCCCT TAGGGACTGA TTATGCGAAA ACTCGCGAGC AGCTTGAGCA CTTAATGGCA GCCCGCAAAA ATACCCGTAA TTTTAGCGAG GTGACGTGGG GCACAAACGC GCCAAAATCC TCAATCGATG GCGAACGTGA ATCGGTAATT GATGAAAAAT ACTATCCCCC TGCTGGGATG GCGAATACCT CGCCAGAATA CGCCAAAATC GTGGCTAGGC TCTATCGTGA TTATGGCATC AACGTCGGTG AACGTTTATC AGGCGTTGAT ATGCTCAAAC GGCATGGTCA AAAAGGTATC GATTCAAGGT TTCCAAGTAC CTCACATATG GCGGCAATTC CGATGTTCAA CTTTTTAAAT AAATATGATG AGGCAAATGC CAATATCAGT CTTATTGATC CATGGAATAG ATACATTAAT TTTTTCAATA CAAATTCCAC TCTGATAAAA TATATTGACA ATGAAAGAAC CCCAAAATCA CAAAAGAATT TCGGTAATAT AGATGGATCA CTCTTATTTG CTGAGCGATT TAACGACACA TTAGAAGGTA ATGATCTATC CGATGCTCAA AACGCATTGG AAGCATTCTA TACCGCTTTA GGTGTTAAAA AACGTCCCAT TCCCTACTAT GCAATCCTCC ATGCTGATGG CGATGCGATG GGTAGTGTGA TTAATAACCA AGAGGTAATT GAAAAACATC AGGAAATATC AAAAGCCTTA GCAAAATTTG CAGATAATAA ACGTGACATC AATCCTTTGA ATGATTCTGA AGAGCGATCT GTTGCAACCA TCGTCGCAAA ACATCAAGGA GCCTTAGTCT ATTCGGGCGG CGATGATGTT TTAGCATTTT TACCGTTGCA TACTGCATTG GAATGTGCCG ATGCGTTGGC TAAAAGGTTT CAAACGCTTT TAAGTAACTT TAAAGATAAA ACAGATCGTT CGCCAACGCT CTCGGTTGGC ATTGCAATTG TCCATCACCT TGAGCCATTA TCCGATGCCT TAGATTTGGC ACGCGCTGCT GAGAAAAAAG CCAAAGGCTT TGTAGGCAAA AATGCGCTGG CAATCAACCT CAGCAAACGT GGCGGTGCAG ATCGAATTAT CGTTGGCTCA TGGATTGAAA GCAAATATTC ATTTTACAAT CGACTTAAAA AATTAATTGA ATGGCATCAA ACCGATCAAA TCCCCCTTGG TTTTGCCTTT GAACTGCACG ATTTATCTAT TCGCCTAAAA GACTTGCCCG CCAACATTTT TCACGCCGAA GCCAAACGAA TCATCGAGCG CAAGCATACA AGCAACGGAA TAAAGGTCTC GTCGGCGATT GCTACAGAAT TATTAGCTAT GATTAACGAT TTGGTTCCAG AAACGCTGAC CCAAGCCCAC ATAATGATCC AACAATTTGC AGATGAAGTA ATTATTGCAG AATTTATTGC CGACGCACAA AAGCTCGCCA ATGGTAATTA A
|
Protein sequence | MSHLLLIAIG PVQDFIASAR RTRDLWFGSW LLSELAKAAA KSIYQAKGNL IFPYPTNPAQ DLAPESAFNV PNKILAKVEG DIKKIADTAK QALEQRLNQI RDGAFDQVKG TFYDEQAADQ IAALVEFNWV ALPLGTDYAK TREQLEHLMA ARKNTRNFSE VTWGTNAPKS SIDGERESVI DEKYYPPAGM ANTSPEYAKI VARLYRDYGI NVGERLSGVD MLKRHGQKGI DSRFPSTSHM AAIPMFNFLN KYDEANANIS LIDPWNRYIN FFNTNSTLIK YIDNERTPKS QKNFGNIDGS LLFAERFNDT LEGNDLSDAQ NALEAFYTAL GVKKRPIPYY AILHADGDAM GSVINNQEVI EKHQEISKAL AKFADNKRDI NPLNDSEERS VATIVAKHQG ALVYSGGDDV LAFLPLHTAL ECADALAKRF QTLLSNFKDK TDRSPTLSVG IAIVHHLEPL SDALDLARAA EKKAKGFVGK NALAINLSKR GGADRIIVGS WIESKYSFYN RLKKLIEWHQ TDQIPLGFAF ELHDLSIRLK DLPANIFHAE AKRIIERKHT SNGIKVSSAI ATELLAMIND LVPETLTQAH IMIQQFADEV IIAEFIADAQ KLANGN
|
| |