Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2317 |
Symbol | |
ID | 5875848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2336540 |
End bp | 2337532 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641542662 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001663914 |
Protein GI | 167040929 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CGATTTATAT TTTTTCAGAT GGGGAATTAA AAAGAAAAGA TAATACTCTA TTTTTTGAAG GAGAAAATGG AAGAAAATTT ATACCAGTAG AAAATACTTC TGAAATAATG GTTTTTGGAG AAGTAAGCCT TAACAAAAGA CTTCTTGAGT TTTTAACACA ATCAGAGATT ATACTTCATT TTTTCAATCA TTATGGATAT TATGTAGGGT CTTATTATCC AAGAGAACAT TTAAACTCAG GCTATATGAT ATTAAGACAA GCTGAGCACT ATAATGATGG AAGTAAAAGG CTTTATCTTG CTCAAAAATT TGTCGAAGGA GCTTATAAGA ATATAAGGCA AGTTTTGAAA TATTATTCAA ATAGAGGCAA AGATTTGGAA GATGTCATTT ATTCCATAGA AAAATTAGGG GAGAGCGTTG ATTCAACTTC TACAATAAAT GAATTAATGG CAATAGAAGG TAATATTAGG GAATATTATT ATAAGGCTTT TGATGAGATA ATTCAAAACC CAGATTTTAA GTTTGACTTT AGAAGCAAAA GACCACCTCA GAATTTTTTG AATACATTGA TAAGCTTTGG AAATTCTTTG ATGTATACGA CAACTTTAAG TGAAATATAC AAAACCCACT TAGACCCGAG AATAGGTTTT TTGCATGCGA CCAATTTTAG ACGTTTTTCT TTAAATCTTG ATGTTTCAGA GATTTTTAAG CCTATCATTG TAGATAGGAC TATATTTACC TTGCTTAGCA AAAAAATGGT TACTAAAGAA GACTTTGAAG AAGATGCAGA AGGATTATTA CTTAAAGAAA AAGGGAAAAA AGTATTTGTG CAGGAATTTG AAGATAAGCT TGCTACAACC ATTAAACACA GGACTCTTTC TAACAATGTT TCTTATAGAA GACTTATAAG ATTAGAGTTG TATAAATTGG AAAAACACTT GATTGAAGAA GAACAGTACA AGCCTTTTAT TGCACAATGG TAA
|
Protein sequence | MKKTIYIFSD GELKRKDNTL FFEGENGRKF IPVENTSEIM VFGEVSLNKR LLEFLTQSEI ILHFFNHYGY YVGSYYPREH LNSGYMILRQ AEHYNDGSKR LYLAQKFVEG AYKNIRQVLK YYSNRGKDLE DVIYSIEKLG ESVDSTSTIN ELMAIEGNIR EYYYKAFDEI IQNPDFKFDF RSKRPPQNFL NTLISFGNSL MYTTTLSEIY KTHLDPRIGF LHATNFRRFS LNLDVSEIFK PIIVDRTIFT LLSKKMVTKE DFEEDAEGLL LKEKGKKVFV QEFEDKLATT IKHRTLSNNV SYRRLIRLEL YKLEKHLIEE EQYKPFIAQW
|
| |