Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0628 |
Symbol | |
ID | 4709442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 706750 |
End bp | 707901 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855092 |
Product | CRISPR-associated Cmr3 family protein |
Protein accession | YP_001002215 |
Protein GI | 121997428 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.75883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGT ACCGCTACAT CGAGCCCCAG GACGTCCTCT TCTTCCGCGG CAACCGCCTC TTTGGCGAGC CCGGCAACGC CGGCGCCGCC CTCATGCCGC CCTGGCCCTC GGTCTTCGCC GGGGCCCTGC GCAGCGCCAT GCTCAGCGCC GCCGGCGCGG ACCCGGCGCA GCTGCGCAGC GGCGAGCTGC CTGCCCCGCT CGACACGGTG CTCGGCACCC CCGAGGAGCC GGGGACCTTC ACCCTCACCG GCGTGACGCT GGCTCGCCGG CAGTCGTCCG GCACCGCCGA GCCCCTCCAC CCCCTGCCGG CGGACCTGAG CGTCGAGCGT GATGAGGCCA CAGGCGAGTG CGCGGTCCAT CGCCTGACGC CGCAGCCACT GCCCGCCGGC GTCGCCAGCA GCCAACCCCT GGAGCGCCTG CCGGTGCTCC GGCGCAGCGA CCGCGGTAAG CCGGCCGCCG GCTACTGGCT CACCCATAGC GGCTGGCAGC GCTACTGCCA GGGCGAGCCC CCACCGGCTG AGGCCCTCGT TCACCGCTCG GCGATCTGGA GCAGCGACCC GCGCCTGGGC ATTGCCCTGA AGCCCGAGCA GCGCACGGCG GCCGAGAGCC AGCTCTACAC CACCGAGGGC ATCCGGCTGT GCGAGGGATA CGGCTTTCTC GCCGCCATCG CCGGCTCCGA CCCGACCAGC CTGCCTAAGC AGGCAACCCT GCGCCTCGGC GGCGACGGCC GCGGCGCGTT CATGAGCGCC GTCGCAGCCC CGGTCGAAAC CTCGCCGGAG CCGGCCGCCA TCGAGGCAGA GGGCCGCCTA CGGATCGTAC TCACCACGCC CGGCATCTTC CCGGGGGGGT GGCAACTCCC CGGGCTCGAC GCCAATGGCC TCTGGCACTA TCCCGGCGGC CGGGCGCGGC TGGTCGCCGC TGCTGTGCCC CGCGGCCAGG TCATCAGCGG CTGGGATCTG GCTCACCACC GGCCCAAACC CGCCCAACGT GCGGCCCCAG CCGGGACCGT CTACTGGCTC GAGGCGCTTG AGGGCGGGCT CAAGCCGCTT CGCAAGCTTG CCGAATCCGG TCTCTGGGGC ATGACCCCGG AGAATGAAGA CCCAGCACGG CGAGCCGAGG GCTTCAACCG CTTCGCCATC GCCAACGCCT GA
|
Protein sequence | MAEYRYIEPQ DVLFFRGNRL FGEPGNAGAA LMPPWPSVFA GALRSAMLSA AGADPAQLRS GELPAPLDTV LGTPEEPGTF TLTGVTLARR QSSGTAEPLH PLPADLSVER DEATGECAVH RLTPQPLPAG VASSQPLERL PVLRRSDRGK PAAGYWLTHS GWQRYCQGEP PPAEALVHRS AIWSSDPRLG IALKPEQRTA AESQLYTTEG IRLCEGYGFL AAIAGSDPTS LPKQATLRLG GDGRGAFMSA VAAPVETSPE PAAIEAEGRL RIVLTTPGIF PGGWQLPGLD ANGLWHYPGG RARLVAAAVP RGQVISGWDL AHHRPKPAQR AAPAGTVYWL EALEGGLKPL RKLAESGLWG MTPENEDPAR RAEGFNRFAI ANA
|
| |