Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1009 |
Symbol | |
ID | 6092439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1053866 |
End bp | 1056019 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642488205 |
Product | CRISPR-associated Csm1 family protein |
Protein accession | YP_001739042 |
Protein GI | 170288804 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAGACA GAGAAGAGCT TGTTGTTGGG GCTCTCCTGC ACGACATAGG AAAGGTCGTA AGAAGGGCGG GGGATGACAG AAGGCATCAG ATTGCCGGAT ACGATTTTAC AAACAAAGTG AAGAAATTCG CTGTAATTCA GGACTACATT CACTACCATC ATGAAAAGGA TCTTCTGAAA AAAAGTTTGG AAAACGAAAA GGTTTGGTAC GTGTGCTTCG CAGACAACCT CTCGAGCAAA GAAAGAATGA CTGAAGGTCA GAAGTTTGAA GAACTTCGAA GAATGGACAA CCTCCTCTCA AAGATCCCGG AGGGGGAATC TTCCAGAAAC GTCACTTACT TTCCTGCGAA ACCAGCCAAC GAAGTGGTGG AAGCGGTAAA GGACATGAAA GAAGATCAGA AAACCTACGA GGATCTCTAC AGGAGATTTG TCGAAGACGC TCAGAAGATT TCTCCCACAC CAGATGATGT GAATTTTCTC ACTTACAAGT ACTTCTCGTT CATTCCTCAG GAAACAAGAG TGGAAGGAGA CATGGATATA TCGCTCTACG ACCATCTGAA GGTCACGGCT ATGCTTGCTC TCTCGCTCTA TGACTACGCA AAGGAAAACG ACCTCAAATT TGAATCCTAC CAGGAGATGA AATCACACTT TGAGAATTCC AACGTGAAAC CCTTCCTACT TGTTGGGGGA GATGTCTCCG GGATACAGAA TTTTATTGCC AACGTCTCTT CAAAAGGAGC TCTCAGATCC TACAGAGGAA GGAGTTTCTT CATAGAAATT CTCCAGGAAG TCGTTGTGGA TGAGATTCTT GATAAAACAG GCTTTTACAG GACAAACGTT CACTTCATAG GAGGAGGACA TTTCTACCTT GTCCTTTCCA ACACAGAGAA GGTGAAGAAA GCCCTCGAAG AGATCAGAAA CGAGCTGAAC GAATGGTTCA GAAATAGGGG TCTTTCACTC CACCTTGTGA TCGAATCTGT TGAATTTTCG GTGAAAGACG TGGAAGACAT GTCCAAGGTT TTCAAGAAGA TCGGTGAAAA GTTGAACGAA AGAAAATACA GAATGTACAC AGAAAAAGAC CTCGAAGCGA TCTTTCCCGA TGATTTGAAT CTGATCCAGG AGAAGGGAAA CCACACCTGC AAAATCTGTG GAAACAGAGT AGACAGGCTC TTTTCCATTC GAGAAGGAGA AGAAGAAATC GCCTGTGACT TCTGCAAGGA AATGTACGAG CTTGGAAGAG AACTTCTTGA AGAGTCTCAC GTGTATCTTG CTGAAAGGAA GAATGGAAAG TTCGAGATTT TTAAGAGAAA ATTCGATTTC TCGAGAGAAC CGGGAGAGGG TTTCAGCTAC AAGTTGAGAA GGATATACGA ATTCTCGGAA AAAGAAAAGA ACGTCAGAAG AATACAGGTG GTGACGTATT TCAAAGAACA GGAGTTCGAG AAAATCGCAG AAAAAGCACC TGGCAAAAAA ATAGCGAGCC TCCTTGTTGA CGTTGACAAC CTTGGAAAGA TCTTTCTCAA AGGTTTAAAG AAAAAGACTC TTTCCAGATA CAGCACCCTC TCAAGGCTCA TGAGCTTTTT CTTCAAAGAA AGAGTAGAGA GTATTGTTGA AGGAAAGAAC GTTATGGTGA TTTATTCCGG CGGAGACGAT CTCTATCTGG TTGGCGGATG GAACGATGTT CTGGATGTGG CAAAAGAGTT GAGAGAGGCG TTTGGAAGAT TCACAACGAA CGACTTCATG ACGTTCTCCG CGGGATACGT GATCACCGAT GAAAAGACAA GCATGAGCCT AATAAGAGAA ATGTCTGAAA GAGCCGAAAG CGCTGCCAAG AAATCCGGGA AGAACAGCAT AGCATTTTCG AACAGAAACT ACTATGCGGT AAAGTGGAAC ACCTTCTTCG AAATGTACAA CTTTTATCAA GAGTTGAAGG AAATAGCAGA CAAAGTGGAC AGAAGTGTCA TTAGAAAGGC TTTGAATCTC ACAAGGGAAG AGTCTCCTCT GAACAAAGCC TTCCTCGCCT ACATAGAAGC AAGGGAGAAC AAAGACGAAG ACAAAAGAGT GGCAAATCTC ATGAGAGAGA ACATAGATCA CCTCGGTGAA AACGCCTTGA ATGTAATCCT CCAGTTTGTG GATCTTCTCT CAAGAAAAAG CTGA
|
Protein sequence | MKDREELVVG ALLHDIGKVV RRAGDDRRHQ IAGYDFTNKV KKFAVIQDYI HYHHEKDLLK KSLENEKVWY VCFADNLSSK ERMTEGQKFE ELRRMDNLLS KIPEGESSRN VTYFPAKPAN EVVEAVKDMK EDQKTYEDLY RRFVEDAQKI SPTPDDVNFL TYKYFSFIPQ ETRVEGDMDI SLYDHLKVTA MLALSLYDYA KENDLKFESY QEMKSHFENS NVKPFLLVGG DVSGIQNFIA NVSSKGALRS YRGRSFFIEI LQEVVVDEIL DKTGFYRTNV HFIGGGHFYL VLSNTEKVKK ALEEIRNELN EWFRNRGLSL HLVIESVEFS VKDVEDMSKV FKKIGEKLNE RKYRMYTEKD LEAIFPDDLN LIQEKGNHTC KICGNRVDRL FSIREGEEEI ACDFCKEMYE LGRELLEESH VYLAERKNGK FEIFKRKFDF SREPGEGFSY KLRRIYEFSE KEKNVRRIQV VTYFKEQEFE KIAEKAPGKK IASLLVDVDN LGKIFLKGLK KKTLSRYSTL SRLMSFFFKE RVESIVEGKN VMVIYSGGDD LYLVGGWNDV LDVAKELREA FGRFTTNDFM TFSAGYVITD EKTSMSLIRE MSERAESAAK KSGKNSIAFS NRNYYAVKWN TFFEMYNFYQ ELKEIADKVD RSVIRKALNL TREESPLNKA FLAYIEAREN KDEDKRVANL MRENIDHLGE NALNVILQFV DLLSRKS
|
| |