Gene Plav_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0098 
Symbol 
ID5453585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp104851 
End bp105786 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content59% 
IMG OID640875657 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001411378 
Protein GI154250554 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.181677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.510072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTA AAACCGTCGA GTTCGCAACG CCGGGGACGC GCCTTTCCGT TGCCAACAAG 
CAGCTGGTAA TCGAGCGACC CGATCTTCCG AAGGCAACGC TGCCGATTGA GGATCTGGGC
GTCGTCATTG TCGACGACTT GAGGGCGACA TATACGCAAG CGGTTTTTAT CGAACTGCTT
GAAGCCGGTG CAACTGTCAT GGTGACAGGG CGGGACCACC TGCCCGCCGG AATGATGCTC
CCGCTTGATG CGCACCATAT ACAAACGGAG CGGCACCGGG CGCAGGTGGA AGCGAGCGAA
CCGACGAAAA AGCGCGCTTG GCAGGCATTG ATCCGAAGCA AGATTGCGCA ACAGGGCATA
GTTCTTGCTC ATTTCACAGG CGAGCATGGC GGCCTTCTTC CAATGGCGCG GCGCGTCCGC
TCCGGCGATC CCGACAATCT GGAGGCGCAA GCCGCACAAC GTTATTGGCC CCGCCTGTTC
GGAAAAGATT TTCGGCGAGA CCGCGATCTG GAAGGCGTCA ATGCCTTGCT GAATTATGGT
TATGCAGTTG TCCGCGCCGC GACGGCACGC GCGACTGTTG CGGCCGGGTT AATTCCGTCA
CTTGGCGTAT TCCATCGAAA CCGCGCGAAT CCGTTTTGCC TTGCCGACGA TCTTCTGGAG
CCTTACCGGC CCTATGTGGA TTGGCGGGTC CGATTGCTTG CCAATCAAAT GGGCGAAGAG
GCGCCAAGCC TCGATGATCG TGACACGCGG GCGGCGCTGC TTTCCATTTT CAATGAGACT
GTTCTTGTTG GGGGCAGGCG AATGCCGCTC TTGCTTGCAC TTCACGCAAG CGCCGCTTCG
CTGTGCCGCG CGTTGACGGG AGGTGAAGCG GCACTGGCAT TGCCCGAAGG CATGCCGCTT
GCGCCCGATC TTCTCGACAA TGATGGAGAG GGATGA
 
Protein sequence
MIRKTVEFAT PGTRLSVANK QLVIERPDLP KATLPIEDLG VVIVDDLRAT YTQAVFIELL 
EAGATVMVTG RDHLPAGMML PLDAHHIQTE RHRAQVEASE PTKKRAWQAL IRSKIAQQGI
VLAHFTGEHG GLLPMARRVR SGDPDNLEAQ AAQRYWPRLF GKDFRRDRDL EGVNALLNYG
YAVVRAATAR ATVAAGLIPS LGVFHRNRAN PFCLADDLLE PYRPYVDWRV RLLANQMGEE
APSLDDRDTR AALLSIFNET VLVGGRRMPL LLALHASAAS LCRALTGGEA ALALPEGMPL
APDLLDNDGE G