Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0765 |
Symbol | cas2 |
ID | 8709646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 868034 |
End bp | 868975 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 646482867 |
Product | CRISPR-associated endonuclease Cas1, ECOLI subtype |
Protein accession | YP_003373989 |
Protein GI | 283783235 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0747275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA AATTCGGTGC CAAAAAAGCT GAAATACCAG AGTTTCCACG TATAAGCGAC AGAGTGAGTT TTATATACGT AGAGCATGCG AAAATTAATC GTCTTGATAG TGCTGTTACA GTCTTTGATG CAAATGGAAC TATAAGAGTT CCTGCTGCAA TGATTGGCGT TTTACTTTTA GGTCCTGGCA CTGAGATTAC TCACAGAGCT ATGGAATTAC TTGGAGATGT TGGCGCAAGC ATAGTTTGGG TTGGCGAGCA TGGTGTTCGC AATTACGCTC ATGGTAGAGC ATTGTCAAGA AGCTCAAGAT TATTGGAAAA ACAATCAAAA CTTGTTACTA ATAGTAGATC AAGATTGAAC GTTGCGAGAA AAATGTATCA AATGCGTTTC CCTAATGAAA ACGTATCGTC ATATACTTTG CAACAATTGC GAGGGAGAGA GGGCGCTCGT GTTAGACATT TATACAGGGA AATGTCTAAC AAATACAATG TCCAATGGAA TGGTCGCGAT TATAAAGTCA ATGATTTTGA AAGTGGCACT GTTGTTAATA AAGCATTGTC TGTAGGTAAT GTATGCCTTT ACGGTTTAGT ACATAGCATA ATATCTGCTT TAGGTTTAGC ACCTGGATTA GGTTTCGTGC ATACTGGACA TGATCTTTCT CTTGTATACG ATATTGCAGA CTTATATAAA GCTGAATTAA CTATTCCTGC ATCTTTTGAA ATTGCTGCTC GGTGTGAATC CGACGATGAC ATTGAACAGT TAATGCGTTT GAAGATGCGC GATTGCTTCG CAAATTGCAA TATAATGTCT CGTATTGTTA ATGATATACA AAATCTTCTT GAAATCCCGA TTGACGACCA AATTACTGTT GATGTAATAC ATCTTTGGGA CGATAAGGAA CTTCTAGTAG CTTCTGGCGT AAATTACAGT GAGGTGAATT GA
|
Protein sequence | MKKKFGAKKA EIPEFPRISD RVSFIYVEHA KINRLDSAVT VFDANGTIRV PAAMIGVLLL GPGTEITHRA MELLGDVGAS IVWVGEHGVR NYAHGRALSR SSRLLEKQSK LVTNSRSRLN VARKMYQMRF PNENVSSYTL QQLRGREGAR VRHLYREMSN KYNVQWNGRD YKVNDFESGT VVNKALSVGN VCLYGLVHSI ISALGLAPGL GFVHTGHDLS LVYDIADLYK AELTIPASFE IAARCESDDD IEQLMRLKMR DCFANCNIMS RIVNDIQNLL EIPIDDQITV DVIHLWDDKE LLVASGVNYS EVN
|
| |