Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1321 |
Symbol | |
ID | 4602001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1273626 |
End bp | 1274609 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774096 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_920721 |
Protein GI | 119720226 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00529939 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTGC TCGTGCTACG CGGCGTAGAC GAGGTGACCG TGAGTAGCAG GAGCACGGTG GTGATCAAGT CCGGGAACAG GGTGTTCGAG CGGGCTCTCA GGGACGTCGA CGCAGTCCTC GTCGTGGGCT CTGGGATCAA GATATCGTCC TCCCTCCCGC CCGTGTTGGC GCTCCACGGC ATACCTCTCT CGATACTCGC GAAGGGGCAC GTCGCTGTGC TACTGAACCC TGTCGGGACG AAGTATAACA ACTACAGGGC CCTCCAGTAC ACGTTGCCGA AGAACAAGGC GCTCGCAATC GCGCTCGAAT ACCTCAAGTC CAGGGTGCGC GGAATGGCGA GCATAATCAG GAACCGCGGG GGAAGGCTCC CCGCGCTCCC CGAGCCTCCC GACCCAGCGC TCTACGAAGA CCCGGCGAGG CTCGAATCGG ACATAAGATC CTGGGAGGCC GCCGCCTCGA ACACCCTCTG GGACGAGGTC TTCAAGCTAC TGGACCCATC CGCGGCCAGG GAGCTCAGAG AGAGATACGG CTTCGCGGGG AGGAAGCCCG GGCACCCGGA CCCCCTCAAC AAGGCTATCT CCGCCATGTA CGCAGTCCTC TACACGCTCT CGACGAAAGC ACTCGTAGCC GCCGGGCTAG ACCCCACCTA CGGCTTCCTG CACAGAACCC AGTACAGCGT GCCGCTAGCG TTCGACTACG CCGAAGCCTT CAAGCCCTTA GCAGTGGAAG CCGCGCTGGA CCTCGTAAAC GAGGAAGGCC TCCCAACGCT GAGCGAAGAC GGCGACCTCG ACAAAGACTC CCTCAACAAG GCCATGAAGA GGCTCTACAG GTACCTCTCA GCCAAGCACA GGGAAACCGG GAAAACACCC TACCAGCAGA TACACCTGAA GGCATTCTGC CTCGCAAAAC ACCTGGAAGG CAAGTGTAGC AGAGAAAAAC TCGCCTTCAC GTGGGACAAA AAGCGCTACA TAATACACGA GTAA
|
Protein sequence | MRLLVLRGVD EVTVSSRSTV VIKSGNRVFE RALRDVDAVL VVGSGIKISS SLPPVLALHG IPLSILAKGH VAVLLNPVGT KYNNYRALQY TLPKNKALAI ALEYLKSRVR GMASIIRNRG GRLPALPEPP DPALYEDPAR LESDIRSWEA AASNTLWDEV FKLLDPSAAR ELRERYGFAG RKPGHPDPLN KAISAMYAVL YTLSTKALVA AGLDPTYGFL HRTQYSVPLA FDYAEAFKPL AVEAALDLVN EEGLPTLSED GDLDKDSLNK AMKRLYRYLS AKHRETGKTP YQQIHLKAFC LAKHLEGKCS REKLAFTWDK KRYIIHE
|
| |