Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_1644 |
Symbol | |
ID | 6089631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 1824498 |
End bp | 1825478 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641596714 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001720390 |
Protein GI | 170023885 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0746674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACG CTATTCATTC CTCTGATTTG AAAACGATCC TGCATTCAAA ACGATCCAAT ATTTACTATT TAGAATATTG CCGCGTATTG GTTAATGGTG GGCGAGTTGA ATATGTCACC GATGAAGGTA AACAATCCCT TTACTGGAAT ATCCCCATAG CGAACACCAC CGTTATTATG CTGGGAACCG GGACTTCGGT GACTCAGGCT GCTATGCGTG AGTTTGCCCG AGCCGGGGTC TTAGTCGGTT TTTGTGGCGG GGGTGGGACG CCGCTTTTTG CGGCTAATGA CGTAGAGGTC AATGTCTCGT GGCTCACTGC ACAAAGCGAA TACCGGCCAA CCGAGTATCT GCACGATTGG GTCAGTTTCT GGTTCGATGA TGAAAAAAGA CTGGCAGCAG CAGTGGCTTT CCAGCGCATC AGGATCGCCC AAATTCAACA ACATTGGCTC AGCAGTCACA TACAGCGCGA ATCTCTTTTT CCGGTTAATC ACGATCAATT ATTATTTATC CTCAGCCGTT TTGAGCAAAA TTTAGCGAAT TGTCTCACCA GTAATGACCT TATGGTTCAG GAAGCGGTAT TAACAAAGGC GCTCTATAAA CTGGCTGCTA ATACAGTGAA TTACGGCGAT TTCACCCGCG CTAAACGCGG TGGGGGCATC GATCTAGCTA ATCGTTTTCT CGATCACGGA AATTATCTCG CCTATGGCTT AGCTGCGACG GCGACATGGG TTATTGGCTT ACCCCATGGT CTGTCTGTTT TACACGGTAA GACCCGGCGT GGTGGTTTGG TCTTTGATGT GGCCGATTTA ATTAAAGATG CGCTAGTGCT ACCGCAGGCA TTTATTGCCG CCATGCAGGG AGAAGAAGAG CAAGAATTTC GTCAGCGCTG CATTAGCGGG TTTCAACGAA CCGAAGCGCT GGATGTGATG ATTGATGGAA TAAAAGAAAC GGCAGCGTTA TGTAGCCAGG TTCCGCGATG A
|
Protein sequence | MENAIHSSDL KTILHSKRSN IYYLEYCRVL VNGGRVEYVT DEGKQSLYWN IPIANTTVIM LGTGTSVTQA AMREFARAGV LVGFCGGGGT PLFAANDVEV NVSWLTAQSE YRPTEYLHDW VSFWFDDEKR LAAAVAFQRI RIAQIQQHWL SSHIQRESLF PVNHDQLLFI LSRFEQNLAN CLTSNDLMVQ EAVLTKALYK LAANTVNYGD FTRAKRGGGI DLANRFLDHG NYLAYGLAAT ATWVIGLPHG LSVLHGKTRR GGLVFDVADL IKDALVLPQA FIAAMQGEEE QEFRQRCISG FQRTEALDVM IDGIKETAAL CSQVPR
|
| |