Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_0696 |
Symbol | |
ID | 8119739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 801338 |
End bp | 802339 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644851084 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003003056 |
Protein GI | 251788335 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.101778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACG TATTCAGCCC GTCGGATTTA AAAACCATTC TGCATTCCAA ACGCGCCAAT ATTTATTACC TCCAGCATTG CCGTATTTTG GTCAATGGCG GACGGGTGGA ATATGTCACC GAAGAGGGGA ATCAGTCGTT GTACTGGAAT ATCCCGATTG CCAATACCAG CGTGGTGATG CTCGGCACCG GCACTTCGGT CACGCAGGCG GCGATGCGGG AATTTGCCCG CGCCGGGGTG ATGGTCGGGT TTTGTGGTGG TGGCGGTACG CCGCTGTTCG CCGCCAATGA GGCCGAAGTG GCGGTGTCGT GGCTGTCGCC GCAGAGCGAA TACCGCCCCA CCGGCTATTT GCAGGATTGG GTCAGCTTCT GGTTTAACGA AGAGCAGCGG CTGGCGGCGG CGGTTGCCTT CCAGCAGGTG CGCATCGGCC AGATTCGCCA GCACTGGCTG GGCGGGCGGC TGGCGCGCGA GTCGCGTTTT GCTATCAAAC CCGAGCATGT GGAAGCACTG CTTAACCGCT ATCAGCAGGG GCTGACGGCG TGTCGCACCA GTAACGACGT ATTGGTGCAG GAAGCGATGA TGACCAAAGC GCTGTACCGG TTGGCGGCCA ACGCGGTGAG TTACGGTGAT TTTACCCGCG CCAAACGCGG CGGCGGCACC GACATGGCGA ACCGTTTTCT CGACCACGGC AACTATCTGG CTTACGGTCT GGCGGCGGTG GCGCTGTGGG TGTTGGGATT GCCGCACGGG CTGGCGGTGC TGCACGGCAA AACCCGCCGT GGCGGGCTGG TGTTCGATGT GGCGGACCTG ATTAAAGACG CGCTGATTCT GCCGCAGGCG TTTATCGCCG CGATGGAAGG GGAAGACGAG CAGGAATTTC GCCAGCGCTG CCTGACGTCG TTTCGTCAGG CCGAGGCGCT GGACGTGATG ATCGACAGCC TGCAACAGGT GGCGCAGCAA TTAAGCCAGG TGGCGAAAAC CGGGAGTCGG GGGGCGCAAT GA
|
Protein sequence | MDNVFSPSDL KTILHSKRAN IYYLQHCRIL VNGGRVEYVT EEGNQSLYWN IPIANTSVVM LGTGTSVTQA AMREFARAGV MVGFCGGGGT PLFAANEAEV AVSWLSPQSE YRPTGYLQDW VSFWFNEEQR LAAAVAFQQV RIGQIRQHWL GGRLARESRF AIKPEHVEAL LNRYQQGLTA CRTSNDVLVQ EAMMTKALYR LAANAVSYGD FTRAKRGGGT DMANRFLDHG NYLAYGLAAV ALWVLGLPHG LAVLHGKTRR GGLVFDVADL IKDALILPQA FIAAMEGEDE QEFRQRCLTS FRQAEALDVM IDSLQQVAQQ LSQVAKTGSR GAQ
|
| |