Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2676 |
Symbol | cas |
ID | 5136197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2834914 |
End bp | 2835789 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640534124 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001218554 |
Protein GI | 147674033 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGG GAGAGCGGCT GTTTGTTAAA GTAACGCGTG AATCCTTGCC CCAAGTGAAA GATAAATATC CCTTTATTTA TCTTGAACGA GGACGGCTGG AAATTGACGA CAGTAGTGTC AAGTGGATTG ATGCCGATGC CAACATAGTC CCGATTCCAG TAGCAACGAT TAACACATTG CTACTAGGTC CGGGAACCAG TGTAACTCAT GACGCAATCA AAACCGCAAC TGCGGCAAAC TGTTCCATCA GTTGGGTGGG GGAGGATAGT TTACTGTTTT ATGCAGCTGG TTTTCTTCCC ACTGCAGACA CAAGAAATCT AAAGAAGCAG ATGTTGTTAG CGGCCAGCGA AGAGAGTGCA TTGAAAGTGG CACGCATGAT GTTCGCTAAG CGTTTTCCTG ATGCAGACCT TGAAGACAAA ACCCTAAAAA GCATGATGGG GATGGAAGGA AACCGTGTAA AAGCGCTCTA TCAACAAAAG GCCGAGCAGT ATGGTGTCGG CTGGAGAGGG CGACAGTTTA CGCCAGGTAA GTTTTCTTTA AGTGATCTTA CCAATCAAAT CATGACGGCT AGCAATGCTG CCTTGTATGG CATTTTGTGT TCAGCTATCC ACTCAATGGG CTACTCACCA CATATTGGAT TTATCCATTC GGGCAGCCCA TTACCTTTTG TGTATGACAT GGCAGATCTT TACAAAGAAC ACCTTTGTAT TGATTTAGCT TTCTCACTTA CACGAGATAT GGCAGGTCAC TATGACAAAC ACAAGGTCTC TGATGCGTTT CGAAAACGGG TTATCAGCAT GGATCTGTTA CAACAAGTCT CCTCGGATAT CAATGAGTTG ATGGGAGGGG GAAATGCTCG TCGTACTAGC AAATGA
|
Protein sequence | MAKGERLFVK VTRESLPQVK DKYPFIYLER GRLEIDDSSV KWIDADANIV PIPVATINTL LLGPGTSVTH DAIKTATAAN CSISWVGEDS LLFYAAGFLP TADTRNLKKQ MLLAASEESA LKVARMMFAK RFPDADLEDK TLKSMMGMEG NRVKALYQQK AEQYGVGWRG RQFTPGKFSL SDLTNQIMTA SNAALYGILC SAIHSMGYSP HIGFIHSGSP LPFVYDMADL YKEHLCIDLA FSLTRDMAGH YDKHKVSDAF RKRVISMDLL QQVSSDINEL MGGGNARRTS K
|
| |