Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3578 |
Symbol | |
ID | 7402493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | + |
Start bp | 332259 |
End bp | 333254 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643710116 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002567682 |
Protein GI | 222481446 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCA CCGAAGGGAT GTTCGACGAG TCAGTGGTCT ACGTCACCAA GCAAGGAAGC CAGGTTGGCA CCGAGGGCGG TCGAATCACC GTCTGGGATG TCGACGGCGA CGAGGGTGAG TTAGCCTCGT TCCCGACCGA GAAGCTCGAT ACGATCAACG TCTTCGGCGG GGTGAACTTC TCGACACCGT TCGTCGCCGA GGCCAACCGT CACGGGATCA TTCTGAACTA CTTCACCCAG AATGGAAAGT ATCGGGGGAG CTTCGTACCT GAAAAGAACA CCATCGCGGA GGTCCGGCGA GCCCAGTATG ACCTCGACGA GACTGCGGAG ATCGACATCG CGGCAGATAT GATCGCCGCC AAGATCCGAA ACGCTCGGAC GCTGCTCTCG CGGAAGGGCG TCCACGGGAC GGAGCTGCTC AAGGATCTCG GTGTGCGGGC GACGACAGTA GCTACGAAGG ACGGCCTCCG TGGTGTTGAA GGAGAAGCCG CCGAGCGCTA CTTCAACCGT CTCGATGAGA CACTCACCGA TGGCTGGACC TTCGAGAAGC GGACCAAGCG ACCGCCAGAG GACCACATTA ACTCATTGCT ATCACTGACC TACGTGTTTA TGAAAAACGA AGTGCTGAGC GCGTTACGGC AGTACAATCT TGATCCATTC TTGGGTGTGC TACATGCGGA TCGGCATGGC CGACCCTCGC TGGCACTCGA TCTCCAGGAG GAGTTCAGAC CGATCTTCTG TGATGCGTTC GTGACACGGT TGGTTAATCG CGGTGTCATC ACCCACGATG AGTTCACTCA GGACAATCAT TTGGCCGACG ATGCATTTCA GACCTACTGC TCAAAGTTCG ACGAGTTCAT GCAAGAGGAG TTCACCCATC CGTACTTTGA GTACACTGTG ACTCGGCGTA AGGCAGTGCG ACAGCAGGCG ATTCTCTTAC GGAAAGCAAT CACTGGCGAG TTGGATGAAT ATCATGCGCT AACTTTTTCA AAATGA
|
Protein sequence | MKATEGMFDE SVVYVTKQGS QVGTEGGRIT VWDVDGDEGE LASFPTEKLD TINVFGGVNF STPFVAEANR HGIILNYFTQ NGKYRGSFVP EKNTIAEVRR AQYDLDETAE IDIAADMIAA KIRNARTLLS RKGVHGTELL KDLGVRATTV ATKDGLRGVE GEAAERYFNR LDETLTDGWT FEKRTKRPPE DHINSLLSLT YVFMKNEVLS ALRQYNLDPF LGVLHADRHG RPSLALDLQE EFRPIFCDAF VTRLVNRGVI THDEFTQDNH LADDAFQTYC SKFDEFMQEE FTHPYFEYTV TRRKAVRQQA ILLRKAITGE LDEYHALTFS K
|
| |