Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0766 |
Symbol | cas1 |
ID | 8709011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 868977 |
End bp | 870056 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 646482868 |
Product | CRISPR-associated endoribonuclease Cas2 |
Protein accession | YP_003373990 |
Protein GI | 283783236 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01873] CRISPR-associated endoribonuclease Cas2, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0695815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTAA CTGTGATTAC GATGACGAAC TGTCCACTAT CTTTAAGAGG TGATTTAACT AAGTGGATGC AAGAAATAGC ATCAGGCGTT TACGTCGGCA ATTTTAATAG TCGAGTCCGA GAAGAATTAT GGAAGCGTAT AGAAGATAGT GTTGGCAATG GTGCTGTAAC AATGAGCTTT TCTTCTAGAA ATGAAATTGG CTATGATTTT AAAACTATCC ATTCGCATCG TGAAGTGGTT TATTCAGATG GATTGCCTCT AGTACGTATT CCAACAGTTG ATACACTAGA AAACGATATT AAACATGGAT TTAGCGATGC AGCACATTTT CATAATGCAA AGAAATTTTC AAATATAAGG CGTAAAAATA TTAATACTCC TTCTAGCACA ATTGAAAATA ATCTTGCTAA TGAAAATATT AGTTATCTAA GTTATGATAA TGCTTCTCAA ACTGCAAAAT TTGCTACGGA TAATGAAGAG ATAACTTGTA CCGATTACTG GAAGGATTTT ATAGATAAAA TAGATAAAAA TTTTGTTGTA ATAGATTGCG AAACTACAGG TTTAAATAGC AAAGTTGATA AAATCATTGA AATTGGGGCC GTTAAGTTCA TTGATGGAAA AATTGAAGAA TTCCAGAAAT TAATAAATAT TAATTGCGAT AGCGTTAGTA ATTTTAACAA AAATAATTTT ATTCCTAATG AAATTATTGA GTTAACTGGT ATTACATCCA ACATGCTTGA ATCTTATGGT GTCAGATTAG ATTTAGCATT AAAAGAATTT ATTAATTTTG TCGAAAGATT ACCTGTACTT GGATATAACG TACAATTTGA TATGTCATTT TTAAATTCTG CTTTATCTTC ATTAGATGAA AGATTTGATC ATTCTCTTTC AATTAACAAA ATTTTTGATA TTTCTCGTTT TGTAAAGAAA GAAAAGAAAT TTTTAAAAAA CTACCAATTA AAAACAGTGC TGCACGAATA CAACATTGCT GAATCTGTTC CACATAGAGC TCTTCTGGAT GCTAGACTAA CAGCGCATCT TGTTTTTAAT TTGACAGAAT TGTTAAAATT TCTTGCATAA
|
Protein sequence | MPLTVITMTN CPLSLRGDLT KWMQEIASGV YVGNFNSRVR EELWKRIEDS VGNGAVTMSF SSRNEIGYDF KTIHSHREVV YSDGLPLVRI PTVDTLENDI KHGFSDAAHF HNAKKFSNIR RKNINTPSST IENNLANENI SYLSYDNASQ TAKFATDNEE ITCTDYWKDF IDKIDKNFVV IDCETTGLNS KVDKIIEIGA VKFIDGKIEE FQKLININCD SVSNFNKNNF IPNEIIELTG ITSNMLESYG VRLDLALKEF INFVERLPVL GYNVQFDMSF LNSALSSLDE RFDHSLSINK IFDISRFVKK EKKFLKNYQL KTVLHEYNIA ESVPHRALLD ARLTAHLVFN LTELLKFLA
|
| |