Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd703_0734 |
Symbol | |
ID | 8088007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya dadantii Ech703 |
Kingdom | Bacteria |
Replicon accession | NC_012880 |
Strand | - |
Start bp | 835354 |
End bp | 836355 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644834806 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002986367 |
Protein GI | 242238186 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACG TATTCAGCCC GTCGGATTTA AAAACCATTC TGCACTCCAA ACGCGCCAAT ATTTATTACC TCCAGCATTG CCGTATTTTG GTGAATGGCG GACGGGTGGA ATATGTCACT GAAGAAGGAA ATCAGTCGCT GTACTGGAAT ATCCCTATCG CTAATACCAG CGTGGTAATG CTCGGCACCG GCACCTCGGT GACGCAGGCG GCGATGCGGG AATTTGCTCG CGCCGGGGTG ATGGTCGGGT TTTGCGGCAG TGGAGGTACG CCATTGTTCG CCGCCAACGA GGCCGAAGTG GCGGTATCGT GGCTGTCGCC GCAAAGTGAA TACCGCCCTA CTGAGTATTT GCAGGATTGG GTCAGCTTCT GGTTTAACGA ACAGCAGCGG CTGGCGGCGG CGATTGCCTT TCAACAGGTG CGCATTGGGC AGATTCGTCA GCACTGGCTG GGTGGGCGAC TGGCGCGTGA ATCACGTTTC ACCATCAAAC CCGAACATGT GGAAGCGTTG CTTAACCGCT ATCAGCAGGG ACTGGTCGAC TGCCGCACCA GTAACGATGT GCTGGTACAG GAAGCGATGA TGACCAAAGC GTTATATCGG CTGGCGGCCA ACGCCGTGAG CTACGGTGAT TTTACCCGCG CCAAACGCGG CGGCGGCACC GATTTGGCGA ACCGTTTTCT CGACCACGGC AACTATCTGG CTTACGGGCT GGCGGCAGTA GCATTGTGGG TGCTGGGCCT GCCGCATGGC CTGGCGGTGC TGCACGGCAA GACCCGACGC GGCGGGCTGG TGTTCGATGT GGCGGATTTG ATCAAAGACG CGCTGATTTT GCCGCAGGCG TTTATCGCCG CGATGGAAGG CGAAGACGAG CAGGATTTCC GCCAGCGTTG CCTGACGGCG TTTCGACAGG CCGAGGCGTT GGATGTAATG ATCGACAGCC TGCAACAGGT GGCTCAGCAA TTAAGCCAGG TGGCGAAAAC CGGCAGCCAG GTGGCGCGAT GA
|
Protein sequence | MDNVFSPSDL KTILHSKRAN IYYLQHCRIL VNGGRVEYVT EEGNQSLYWN IPIANTSVVM LGTGTSVTQA AMREFARAGV MVGFCGSGGT PLFAANEAEV AVSWLSPQSE YRPTEYLQDW VSFWFNEQQR LAAAIAFQQV RIGQIRQHWL GGRLARESRF TIKPEHVEAL LNRYQQGLVD CRTSNDVLVQ EAMMTKALYR LAANAVSYGD FTRAKRGGGT DLANRFLDHG NYLAYGLAAV ALWVLGLPHG LAVLHGKTRR GGLVFDVADL IKDALILPQA FIAAMEGEDE QDFRQRCLTA FRQAEALDVM IDSLQQVAQQ LSQVAKTGSQ VAR
|
| |