Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0578 |
Symbol | |
ID | 6164759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 528753 |
End bp | 529664 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667731 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001793966 |
Protein GI | 171185047 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0010415 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0764761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGTGG TAGTTAAGGA GCACGGGGTG TCGCTGGGCT ACAGCAGGTG GGCCCTGGTG GTGAGGAGGA GGGGCGGGGC CGCCGAGAGG ATCCCCATAC ACCAAGTGGA CAGGCTGTGG ATCCTCACTG GGGGCGTCTC CATCTCGTCT AGGCTAGTGA GGGCCCTGGC GAGGAGCTTC GTAGATGTGG TGTTTTTCGA CGGCAGAGGC AACCCCGCGG CTAGGCTGTT TCCGCCTGAG GCAAACGGCA CGGTTGCACA CCGGCGGGCT CAGTACGAGG CCTACCTAAA CGGCAGGGGG CTCGAGCTGG CCAAGCTGGT GGTGTATGGA AAGATCGTGA ACCAAGCGGC GGCGCTTAGG AGAGCCGGCT TGTGGAGGAG GGAGCTGTAC CAAGAGCTCG CGGGGGCTGC GTCTAGGGTG GCAGAGGCGG CGGCCGCGGT CCCCCGGTGC GGAGACCCAC AGTGCGTCCT CGGCCACGAG GGGCGCGCCG CGGCTGAGTA CTGGGCCGCC CTCTCAAAGG CCTTTGGGAC CCCCACGAGA GATCCAAACG CCTCCGACCC CTTCAACCTC GCGCTTAACT ACGGCTATGG GATACTCCGC TACGCCGTGT GGAGACAGGC AGTTATCCAC GGCCTCGACC CCTACGCCGG CTACCTTCAC GTGGATAAGT CGGGGAGGCC CTCCCTAGTG CTGGACCTAA TGGAGGAGTT CAGACCCCAC ATAGACCTCA TGGTGCTCAA GGCCAAGCCC TCCGCAGACT GGCTAGAGGG CGGGGTCTTG AAGCGGGAGG CCAGGGCGGC GCTGGTGGAG AAGTGGCTCG AGATGAGGCT CGAGCCCACC ATAGCGAGGC AGGTGGGGCT GGCGGTGGCC CACCTAGAGG GCAGGGGGGT ATACACGCCG CATAAGCTAT GA
|
Protein sequence | MEVVVKEHGV SLGYSRWALV VRRRGGAAER IPIHQVDRLW ILTGGVSISS RLVRALARSF VDVVFFDGRG NPAARLFPPE ANGTVAHRRA QYEAYLNGRG LELAKLVVYG KIVNQAAALR RAGLWRRELY QELAGAASRV AEAAAAVPRC GDPQCVLGHE GRAAAEYWAA LSKAFGTPTR DPNASDPFNL ALNYGYGILR YAVWRQAVIH GLDPYAGYLH VDKSGRPSLV LDLMEEFRPH IDLMVLKAKP SADWLEGGVL KREARAALVE KWLEMRLEPT IARQVGLAVA HLEGRGVYTP HKL
|
| |