Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0619 |
Symbol | |
ID | 4711443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 697188 |
End bp | 698150 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639855083 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001002206 |
Protein GI | 121997419 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0887113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCACCC TCTACATCGA CCGGCGCCGG ACGCGCCTGG AGCTCGCGCA CAAGGCGCTC ACCATTCGGG AACCGGAGGC CCAGCCCCGC TCGGTGCCGC TGAGCCTCAT CGACCGACTG ATCGTCATTG GCCAGGTCGA GCTGAGCAGC GGCGTGCTCA CTACTCTCGC CGAGAGCGGC GTCAGCCTGG TCTTCATGCC GAGCCGTGGA CAGCGGCGCA GCGCCTTCCT CCGCAGCGAG GGCCATGGCG ATGCCGTCCG CCGCCTCGGC CAGTACAGGC TCATCCACCT CGAGGCTGAG CGCCAGGCCT GGGCGCGCCG CCTCGTGCGA CTGCGTCTGG CCGGGCAGCA GCGGCTCCTC GCGAGTGCGC TATACCGGCG TCCTGATCAA CGCCAGCCGC TCACGGCTGC CCACCGCGAG ATCGAGGCGG CCCAGGCGAC CGTGCGCCGC GAGGCGCCCG CCGGTGAGCA ACTGCGGGGG CAGGAGGGTA CGGCCGCGGC GGCCTTCTTC CGCGGCTACG GCGCTCTCTT CGCCGAAGCG CTAGGCTTCT CCGGGCGAAA TCGCCGGCCA CCCCGGGATC CCGTCAACGC CGTCCTCTCG CTCGGCTACA CCCTCGCGCA CGGCGATGCA CTGCGGGCCG TCACCGCTGC CGGCCTCGAT CCGGCCATCG GCGTACTGCA CGAGCCTGCC TGGGGGCGAG ACTCCTTGGC CTGCGATCTC ACGGAGATCG CCCGGGCCCG GGTGGAGCGG CTGACCTGGG AGCTATTCGC GAGCGAGACG CTCCAGCGCA CGGACTTCAC CAACAGCACC GAGGGCGTAC GACTAGGCAA GGCTGCACGG CAGACCTTCT TCGGCTGCTG GGAACGCCAT GCCGGGCTCC ATCGACGCTG GCAGCGCCGC GCCGCTCAGG CCCTAGCCGC CGAGTGCGCC CACCACGGCG CCCAAACTAT TCCCGAGGCG TAG
|
Protein sequence | MGTLYIDRRR TRLELAHKAL TIREPEAQPR SVPLSLIDRL IVIGQVELSS GVLTTLAESG VSLVFMPSRG QRRSAFLRSE GHGDAVRRLG QYRLIHLEAE RQAWARRLVR LRLAGQQRLL ASALYRRPDQ RQPLTAAHRE IEAAQATVRR EAPAGEQLRG QEGTAAAAFF RGYGALFAEA LGFSGRNRRP PRDPVNAVLS LGYTLAHGDA LRAVTAAGLD PAIGVLHEPA WGRDSLACDL TEIARARVER LTWELFASET LQRTDFTNST EGVRLGKAAR QTFFGCWERH AGLHRRWQRR AAQALAAECA HHGAQTIPEA
|
| |