Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1587 |
Symbol | |
ID | 3581630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1829376 |
End bp | 1830374 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637685282 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_289646 |
Protein GI | 72161989 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0375026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACC CCCGTAAGGC GCTAGCCCGT CCGACACTGG CGATGCTGCC CCGGGTATCG GACGGCCTCT CCTTCCTCTA CGTGGATGTC TGCCGAATAG TGCAGACCGA CACCGGCGTA TGCGCGGAGG TCGAAACCGA AACCGGAAGA ATACATCGAG TCCCAATCCC CACCGCTTCA CTCGCATGCG TCCTACTGGG ACCGGGAACC TCAATCACCA GTCCTGCCAT GGCGACTTTC ATGCGCCACA ACACAACGGT CGTAACCTGC GGGGCGGGAG GCATCCTCAA CTACGGAAGC TTTCCCGCCC CTAACCGCAC CACGAAATGG ATCGACCGGC AGGCACGCGC CTACTCCGAC GACAGACGCC GACGAGACGT CGCCGTGCGG ATGTACGAGA TGCGCTTCGG AGAAGAACCT CCTCCCGGTG CGTCCATCGA AAGGCTGCGC CAGCTTGAAG GGGCACGGAT GAAAGCCCTC TACCGAAGCC TGGCGGCTAA GAACAGGGTG AAACCATTCA AACGGAACTA CAACCCGCAT GACTGGGATG ACCAAGACCC TGTCAACAAG GCGCTTTCGG CAAGCAACGC TGCCCTTTAC GGGGTGGTGC ACTCGGTACT GGCGCACCTG GGCTGCCACC CCGCGCTCGG GTTCATCCAC TCCGGAAAAC AAGACGCGTT CGTCTACGAC ATCGCCGATC TCTATAAAGC ACGGACCACT ATCCCGCTCG CGTTCTCCTT GAGCAGGACA GCCAACCCGG AGCAAGAAGC GCGCCTGCGG CTTCGTCGAG ACCTAAAGCT GTACCGGCTG ATTCCGCAAA TTGTGCGGGA TGTGCAGACC CTCCTCTCCT TAGACGACCC TGAGGAGGCC GTTTCGGAAG AGGAGCCGCC GAGCTCGGGA GGGCCGTGGC AAGTCGTTGA CTTGTGGGAC CCGGTGGTCG GGGCTGTCTC CGGCGGTGTG AACTATGCCA ATCACATCGC GGACAGCGAG GAGCCCTGA
|
Protein sequence | MDNPRKALAR PTLAMLPRVS DGLSFLYVDV CRIVQTDTGV CAEVETETGR IHRVPIPTAS LACVLLGPGT SITSPAMATF MRHNTTVVTC GAGGILNYGS FPAPNRTTKW IDRQARAYSD DRRRRDVAVR MYEMRFGEEP PPGASIERLR QLEGARMKAL YRSLAAKNRV KPFKRNYNPH DWDDQDPVNK ALSASNAALY GVVHSVLAHL GCHPALGFIH SGKQDAFVYD IADLYKARTT IPLAFSLSRT ANPEQEARLR LRRDLKLYRL IPQIVRDVQT LLSLDDPEEA VSEEEPPSSG GPWQVVDLWD PVVGAVSGGV NYANHIADSE EP
|
| |