Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3218 |
Symbol | |
ID | 4809520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3816440 |
End bp | 3817435 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108652 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001039606 |
Protein GI | 125975696 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000280385 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGA TTCTGAATAC ACCCGGACTT TACCTTTGTA AAAGGGGAGA GTGCTTTCAG ATTCAAAGCG AGAATGAAAA ACGGGAAATA GCTGCAACTA AAGTTGACCA AATAATGATA ACCACTCATG CAGCTCTTAC AACTGATGCA ATTGAACTGG CTCTTGAATA CAATATTGAC ATAATATTTT TAAAAAACAC GGGACAACCA ATGGGGCGTG TATGGCATTC AAAACTTGGA AGTATCAGTA CAATTAGAAG AAAACAATTA TTCCTCCAGG ATAGCCCGCT TGGGCTTCAA CTTGTAAAAG AATGGATCTT GGAGAAAATG GATAATCAAA TACGGCTGTT AAAGAAACTG GAAGTTAACC GCAGAGATGA TGAAAAACGT GCTATAATCA GGGATACCAT AGAAAAGATT GAAAAGCAAA AAGCTAACAT TATGTCGATT AACAATAAAG AAACAGTGAA CAATGTAAGA AATATGCTTC TTGGTTATGA AGGAACTGCC GGAAGAGTAT ATTTTGAAAC CCTTGGCAAG TTGATTCCTG AGAAATATGC TTTTGAAGCG AGAAGCCGGA ATCCTGCGAA AGATCCTTTC AACTGTATGC TCAACTATTC CTACGGTATT TTATATTCCA GCGTTGAAAA AGCCTGCATA ATTGCAGGAT TAGACCCATA CATTGGCATT ATGCATACCG ACAATTATAA TAAGAAAGCT CTTGTATATG ATATGGTTGA AATGTACAGA GGATATATGG ATGAAATAGT TTTCAGGCTG TTTAGCACAA AGAAAGTTCA AGACGATTTT TTTGACAAGA TTGAAGATGG TTACTATCTC AATAAAGAAG GAAAGCAACT GCTAATATCC GAGTATAACA AAGAGCTGGA AGTCAAAATG AATTACAGAG GAAGAAGAAT AGAATTTGCC AACATAATCC AGTACGACTG CCATCAGATT GCAAATCGCA TACTGAAGGA GGATATACCA TGTTGA
|
Protein sequence | MKLILNTPGL YLCKRGECFQ IQSENEKREI AATKVDQIMI TTHAALTTDA IELALEYNID IIFLKNTGQP MGRVWHSKLG SISTIRRKQL FLQDSPLGLQ LVKEWILEKM DNQIRLLKKL EVNRRDDEKR AIIRDTIEKI EKQKANIMSI NNKETVNNVR NMLLGYEGTA GRVYFETLGK LIPEKYAFEA RSRNPAKDPF NCMLNYSYGI LYSSVEKACI IAGLDPYIGI MHTDNYNKKA LVYDMVEMYR GYMDEIVFRL FSTKKVQDDF FDKIEDGYYL NKEGKQLLIS EYNKELEVKM NYRGRRIEFA NIIQYDCHQI ANRILKEDIP C
|
| |