Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1140 |
Symbol | |
ID | 6165863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1028728 |
End bp | 1029600 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641668291 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001794516 |
Protein GI | 171185597 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.511238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000356789 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCGCAT ACGGCGCCAG GATAAGGGCT AGGAAGGGCC TCCTCCTCGT GGAGACAAAG GAGGGCGCCA GGGAGTACCC CCTACACGAG GTAGACGAGG TCCTCCTACT CACCGGCGGC ATATCCATAA CGACGAGGGC GCTCAGGGCC CTCCTCGCCG CCGGGGCCAC AGTCGCCGTC TTCAGCCCCC GCGGGGAGCC CCTGGGCATA TTCATGAAGC CCATCGGAGA CGCCACGGGG GCCAAGAGGA GGTGCCAGTA CAAGGCGGCG GAGGACGGCA GAGGGCTACA GTACGCCAAG AGCTGGGTCT TCAAGAAGAT GCTGGGCCAG AGAGACAACA TCAAGGCCTG GCGCCGCCGC CTAAGAGGCT ACAGCCAATA CGCCGAGTCC CTAGCCAAGG CCCTACAGGC GCTGAGAGAC GCCGCCTCCC CCCACGCTGT CTTGGAGGCC GAGGCGGCGG CCGCCGAGGC CTACTGGGCC GCCTACAGGG AGGTCACGGG GTTCCCCGGC AGAGACCAGG AGGGGAGAGA CCCCGTCAAC GCCGGCCTAA ACTACGGCTA CGGGATCTTG AAGGCCCTGG TCTACAAATC CCTGATCCTC GCCGGGCTGG ACCCATACGT CGGCTTCCTC CACGTAGACA AATCCGGGAG GCCCTCCCTA GCGCTGGACT TCATGGAGCA GTGGAGGCCC CGCGTCGACG CCGTCGTGGC CAAGATGGCG GACAAGCTGG AGTCCGAGGG CGGCCTACTC ACCCGCCGGT CCCGCCTGGA GCTGGCCGCC GCCGTCCTGG AGGAGCTCCA CGCCGCCAAG AGGCCCCTCT CCGCCGAGAT CCACAGAGAG GCCAGAGCTC TGGCGCGCTC CATATGTACA TAA
|
Protein sequence | MAAYGARIRA RKGLLLVETK EGAREYPLHE VDEVLLLTGG ISITTRALRA LLAAGATVAV FSPRGEPLGI FMKPIGDATG AKRRCQYKAA EDGRGLQYAK SWVFKKMLGQ RDNIKAWRRR LRGYSQYAES LAKALQALRD AASPHAVLEA EAAAAEAYWA AYREVTGFPG RDQEGRDPVN AGLNYGYGIL KALVYKSLIL AGLDPYVGFL HVDKSGRPSL ALDFMEQWRP RVDAVVAKMA DKLESEGGLL TRRSRLELAA AVLEELHAAK RPLSAEIHRE ARALARSICT
|
| |