Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3332 |
Symbol | |
ID | 7402188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | - |
Start bp | 82881 |
End bp | 85034 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643709884 |
Product | CRISPR-associated protein, Csh1 family |
Protein accession | YP_002567450 |
Protein GI | 222481214 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02591] CRISPR-associated protein, Csh1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCC CGACACGGGA AGACTTCGCA GCAGCCATCG ATGACTACTG GCATGGTCGA CCACCGACCA GCCTCGAAGA CGTGATGTCA CTGTACGGTG TACTTGCAGT TGCCGAATCC GGAGGCTCGC TGTACGGTAC GCCAAGTACG CTTGACCCCT TTATCGACGA CGGGCGTTTG GTCGTTATTG ATCTCGATTT GACTGGCGAC GAGCCAACAT ACGAACAAGG AGGCGCAGGT ATCAGCGTCG ATACCCTCCG CACCGATGAT ATAGCAAAGC TCGCGTACTC ACACAAGTCC TCCGGCCGGG GAGCGAAGTA CAGTCTCACG CAGATCGGAT CGAAAAACGG AAACGACGCC TCGGGTGTTG CTGGGACGAT CCTGAGGCGT GTTCGTTCAT GGACTAATCA AGATAGCGTC CAGAGTATTA CTGGTGACAA CGGACACCCT GATGCGTGGG TTATCGAAAC GTTAGACAAG GTATTCACGA AAGACGGCGA GACACTCGAA CGAATTCAAT CAGATATAGA GTCACTACTA CCCACAGACG ATTCAGTCCC GACTGTACTG ACTGTCCGCT GTCGGATCAA CGACACAGAT CTCAGCACTA CTGACAATGA TGACACAGAC TGGTTTTGGC CCGCCGATCT CGATGTACTC AACCGAGCAA TGCGACGCTA CGCGACGGCC AACGCCACTG ACAAGAATAT TAGTTCTGGC CCTGCTTCCG AAGGCGAGGC AGTCGGTGTT GTGACCGATC AGGTGGGTCG TGTAGTCGGG ACACCTGAAA GCCCGCTGGA AGTTTTTTCA GTCAAACATC CGGATGCTCA GCCCGGCCTC CGACGTGACC AATCGTGGCG AAACTATCCG GTGAGTGCCG ACGTTGCGAT GCTGTTCAGT AAAGGTCAGG ACCTTATCGA GCGCTGTGTG CTTCGCCGAG GCGGGGTTGA GACGTACACA CTCCCGTACT TCGCTGGCGA ACTCACCGAA TTAAAAGCTG AAACACTATA TCGGGCGATC CATTCACTCG ATTCAGAAAG CGAGTACACG GACAGTTCAC ATGCCCCGAT GGCGAGAGTT ACCTACGAGA TCCAAGAAAA CGACGATCCA GCAGTTCGGG CACTGAAAGA CGAAATCCGA TTCTATGTCG TCTCATTACC AATCAGTGAC GACAAAAACA TCATCGCCGA AGAACCCTCG GCGGGTATCT ATTGGGTCTC CGAACTCGCA AAGTCGCTTG TCGACACGGT CCAAGGGCCG ACACTCTCGC CCGAATCGGG AGGATTCAGA ACCTACGATA ACTGGGACCT CCTTGATTTT CCCGACGAGA CGGGTCAGGC TCGAAACGTC GCGTTCGGCA AAATCGTCGG CCACACATTC ACCGACGCCG TGTTTGCCTA CCGTGACGAG GAGGGGGACG ACTTCCGACG AATCGTCGAC CACCGACTCA TCGCCGGAGT ACCACTTGAC GCCTCGATGC TCTTTTCAGA GTACATGCAG CGCTGGGGTG ACGAGTTCGA TGGCGACGAC CCGGTTCCAC AACAGGTCAT CGCCCAACAG CTCGTGCATC TCGAATCGCT GTCGCGTGCC GGACTCCTAA CAGGACTCGA CGCGGCGGTC GAACCAACGG ACACAGAGAC TATGACAGCA ATCGAAGACA CAGACACTGA CCTTTCGAAC CTCGCGGCGA TCCGTAAGTA CCGACTCGAA TCGTTCCTCG ACCGACCCAT GTTCACCGAG AACGCCGAGC GAAAGGCAAC CGCCCTCGCT GGCGTTCTCG TCGGACAAAT AAGTTGGCAT CAAGCAGACG AGCGCGATCT CGGTCGACCG CTCGACTCGA AAACGCGTGG CGATCAGCTC ACGAAAAACG GGCTCGAACA AGCGGTGAAA ACAGCCTTAG AAAAGGCAAA AGTGTACGCA CACGACTCAA AACAGTACGC TGACCGAGAT ATTCTGTTCC CTGAAACAGT CGACCTGCTT CTTGAAACGA CGGAGCAGAT GCCGACCAAA TGGGAGATCG ACAAACGTGA TCTCCAGTTT GCCTACGTGC TCGGACACGC CCACGGTCGA CGGTCGATGC CGAACGCGTT CGATCTGTAC AAAAAAGACG ACGCAAGCGA GGAGAGCGAA GCAGCCGAAT CGCCCGCAAA CTGA
|
Protein sequence | MSGPTREDFA AAIDDYWHGR PPTSLEDVMS LYGVLAVAES GGSLYGTPST LDPFIDDGRL VVIDLDLTGD EPTYEQGGAG ISVDTLRTDD IAKLAYSHKS SGRGAKYSLT QIGSKNGNDA SGVAGTILRR VRSWTNQDSV QSITGDNGHP DAWVIETLDK VFTKDGETLE RIQSDIESLL PTDDSVPTVL TVRCRINDTD LSTTDNDDTD WFWPADLDVL NRAMRRYATA NATDKNISSG PASEGEAVGV VTDQVGRVVG TPESPLEVFS VKHPDAQPGL RRDQSWRNYP VSADVAMLFS KGQDLIERCV LRRGGVETYT LPYFAGELTE LKAETLYRAI HSLDSESEYT DSSHAPMARV TYEIQENDDP AVRALKDEIR FYVVSLPISD DKNIIAEEPS AGIYWVSELA KSLVDTVQGP TLSPESGGFR TYDNWDLLDF PDETGQARNV AFGKIVGHTF TDAVFAYRDE EGDDFRRIVD HRLIAGVPLD ASMLFSEYMQ RWGDEFDGDD PVPQQVIAQQ LVHLESLSRA GLLTGLDAAV EPTDTETMTA IEDTDTDLSN LAAIRKYRLE SFLDRPMFTE NAERKATALA GVLVGQISWH QADERDLGRP LDSKTRGDQL TKNGLEQAVK TALEKAKVYA HDSKQYADRD ILFPETVDLL LETTEQMPTK WEIDKRDLQF AYVLGHAHGR RSMPNAFDLY KKDDASEESE AAESPAN
|
| |