Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0098 |
Symbol | |
ID | 5453585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 104851 |
End bp | 105786 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640875657 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001411378 |
Protein GI | 154250554 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.181677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.510072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTA AAACCGTCGA GTTCGCAACG CCGGGGACGC GCCTTTCCGT TGCCAACAAG CAGCTGGTAA TCGAGCGACC CGATCTTCCG AAGGCAACGC TGCCGATTGA GGATCTGGGC GTCGTCATTG TCGACGACTT GAGGGCGACA TATACGCAAG CGGTTTTTAT CGAACTGCTT GAAGCCGGTG CAACTGTCAT GGTGACAGGG CGGGACCACC TGCCCGCCGG AATGATGCTC CCGCTTGATG CGCACCATAT ACAAACGGAG CGGCACCGGG CGCAGGTGGA AGCGAGCGAA CCGACGAAAA AGCGCGCTTG GCAGGCATTG ATCCGAAGCA AGATTGCGCA ACAGGGCATA GTTCTTGCTC ATTTCACAGG CGAGCATGGC GGCCTTCTTC CAATGGCGCG GCGCGTCCGC TCCGGCGATC CCGACAATCT GGAGGCGCAA GCCGCACAAC GTTATTGGCC CCGCCTGTTC GGAAAAGATT TTCGGCGAGA CCGCGATCTG GAAGGCGTCA ATGCCTTGCT GAATTATGGT TATGCAGTTG TCCGCGCCGC GACGGCACGC GCGACTGTTG CGGCCGGGTT AATTCCGTCA CTTGGCGTAT TCCATCGAAA CCGCGCGAAT CCGTTTTGCC TTGCCGACGA TCTTCTGGAG CCTTACCGGC CCTATGTGGA TTGGCGGGTC CGATTGCTTG CCAATCAAAT GGGCGAAGAG GCGCCAAGCC TCGATGATCG TGACACGCGG GCGGCGCTGC TTTCCATTTT CAATGAGACT GTTCTTGTTG GGGGCAGGCG AATGCCGCTC TTGCTTGCAC TTCACGCAAG CGCCGCTTCG CTGTGCCGCG CGTTGACGGG AGGTGAAGCG GCACTGGCAT TGCCCGAAGG CATGCCGCTT GCGCCCGATC TTCTCGACAA TGATGGAGAG GGATGA
|
Protein sequence | MIRKTVEFAT PGTRLSVANK QLVIERPDLP KATLPIEDLG VVIVDDLRAT YTQAVFIELL EAGATVMVTG RDHLPAGMML PLDAHHIQTE RHRAQVEASE PTKKRAWQAL IRSKIAQQGI VLAHFTGEHG GLLPMARRVR SGDPDNLEAQ AAQRYWPRLF GKDFRRDRDL EGVNALLNYG YAVVRAATAR ATVAAGLIPS LGVFHRNRAN PFCLADDLLE PYRPYVDWRV RLLANQMGEE APSLDDRDTR AALLSIFNET VLVGGRRMPL LLALHASAAS LCRALTGGEA ALALPEGMPL APDLLDNDGE G
|
| |