Gene Plav_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0099 
Symbol 
ID5454634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp105795 
End bp108908 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content56% 
IMG OID640875658 
ProductCRISPR-associated endonuclease Csn1 family protein 
Protein accessionYP_001411379 
Protein GI154250555 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.28892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.129983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGGA TTTTCGGCTT CGATATTGGC ACTACATCCA TCGGATTTTC GGTTATTGAC 
TACAGTTCAA CGCAATCGGC CGGGAATATT CAGAGACTTG GCGTCCGGAT ATTTCCGGAA
GCCCGCGATC CGGACGGGAC CCCTCTAAAC CAGCAACGCA GGCAAAAGCG GATGATGCGC
CGTCAGTTGC GGCGGCGGCG GATACGGAGG AAGGCGCTCA ACGAGACACT GCATGAAGCG
GGCTTTCTGC CCGCCTATGG TTCTGCCGAT TGGCCAGTTG TCATGGCGGA TGAACCTTAC
GAGTTGCGCA GGCGAGGTCT GGAAGAAGGC TTGAGCGCAT ATGAGTTCGG GCGAGCGATC
TACCATCTTG CGCAGCACCG CCATTTCAAG GGCCGCGAAC TTGAAGAAAG CGATACTCCC
GACCCTGACG TCGATGACGA AAAGGAGGCG GCGAACGAGC GGGCAGCAAC CCTGAAGGCT
CTCAAGAACG AGCAAACCAC ATTGGGCGCG TGGCTTGCAC GCCGACCGCC ATCGGACCGC
AAACGCGGAA TCCACGCGCA TCGCAATGTA GTTGCGGAAG AGTTCGAAAG ACTTTGGGAA
GTTCAATCGA AGTTTCATCC GGCGCTCAAA AGCGAAGAAA TGAGGGCAAG GATTAGCGAC
ACGATCTTTG CCCAGCGGCC GGTATTCTGG CGCAAGAACA CTTTGGGCGA ATGCCGCTTC
ATGCCGGGCG AGCCTCTATG TCCGAAAGGC TCCTGGCTTT CGCAACAGCG ACGGATGCTG
GAAAAGCTGA ACAATCTCGC GATAGCAGGT GGCAATGCCC GGCCGCTGGA CGCCGAAGAG
AGGGATGCAA TTCTCTCCAA GCTTCAGCAA CAAGCCTCCA TGTCGTGGCC GGGCGTCCGA
AGCGCATTGA AGGCGCTGTA TAAACAGCGG GGCGAACCCG GCGCGGAGAA AAGCCTCAAA
TTCAATCTGG AACTGGGTGG CGAGAGCAAA CTCCTTGGCA ACGCACTCGA AGCCAAGCTA
GCCGACATGT TCGGGCCAGA CTGGCCTGCG CATCCTCGCA AGCAGGAAAT CCGCCATGCC
GTCCATGAGC GCCTATGGGC GGCGGACTAT GGCGAAACGC CAGACAAGAA GCGCGTGATT
ATCCTCTCGG AGAAAGACAG GAAGGCGCAT AGGGAAGCCG CTGCAAATTC GTTTGTGGCC
GACTTTGGGA TTACCGGGGA GCAAGCAGCC CAACTGCAGG CGCTCAAGCT GCCGACCGGG
TGGGAACCCT ATTCAATCCC GGCCCTGAAT TTGTTTCTTG CAGAACTTGA AAAGGGTGAG
AGATTTGGCG CCCTTGTAAA TGGCCCGGAT TGGGAAGGTT GGCGGCGCAC CAACTTCCCC
CATCGCAATC AGCCAACCGG CGAAATTCTA GACAAACTGC CAAGCCCGGC CTCGAAAGAA
GAACGCGAGC GGATTTCCCA ACTCCGAAAT CCAACGGTTG TCCGCACTCA AAACGAACTC
CGAAAAGTCG TGAATAATCT TATCGGGCTT TACGGGAAGC CCGACCGAAT TCGCATTGAA
GTCGGCCGCG ACGTCGGAAA GTCGAAGCGG GAGCGCGAGG AAATTCAGAG CGGGATTCGC
CGCAACGAAA AACAGCGAAA GAAAGCGACT GAAGACTTAA TCAAGAATGG CATCGCTAAT
CCGTCTCGCG ACGACGTTGA GAAATGGATT CTTTGGAAGG AAGGACAGGA ACGCTGCCCA
TATACTGGCG ACCAAATTGG CTTCAATGCG CTGTTTCGCG AAGGCCGGTA TGAAGTTGAA
CATATATGGC CCCGTTCACG CTCCTTCGAT AACAGCCCGC GCAACAAAAC GCTTTGCCGA
AAGGATGTGA ACATCGAGAA AGGCAACCGG ATGCCCTTCG AGGCATTCGG CCACGATGAA
GACCGCTGGT CCGCTATCCA AATACGCCTG CAAGGCATGG TGTCTGCGAA GGGTGGAACC
GGCATGTCAC CCGGCAAGGT AAAGCGATTT CTTGCCAAGA CTATGCCCGA GGATTTTGCA
GCCCGCCAAT TGAATGACAC TCGCTATGCG GCAAAGCAAA TTCTTGCGCA GCTCAAACGC
TTGTGGCCCG ACATGGGCCC CGAAGCGCCT GTCAAAGTCG AGGCCGTCAC CGGCCAAGTG
ACCGCGCAAC TTCGGAAGTT GTGGACGCTC AACAACATAC TTGCGGACGA CGGGGAAAAG
ACACGAGCGG ATCACCGGCA TCATGCAATT GATGCCTTGA CGGTTGCCTG CACTCATCCG
GGCATGACCA ACAAGCTTTC ACGCTATTGG CAGTTGAGAG ACGATCCGCG CGCAGAAAAG
CCGGCGCTTA CGCCTCCTTG GGACACCATC CGCGCAGATG CAGAGAAAGC CGTGAGCGAG
ATAGTAGTGT CTCATCGCGT CCGGAAGAAA GTTTCCGGCC CGCTCCACAA GGAAACGACA
TATGGCGACA CAGGAACCGA CATAAAAACC AAGAGCGGAA CCTATCGTCA GTTTGTCACG
CGCAAGAAAA TCGAATCTCT ATCGAAAGGC GAACTTGACG AAATACGCGA TCCGCGCATC
AAGGAGATTG TTGCGGCGCA TGTTGCCGGG CGCGGCGGCG ATCCCAAAAA GGCGTTTCCG
CCATATCCAT GCGTATCGCC CGGCGGACCG GAGATTCGAA AGGTGAGACT CACGTCAAAG
CAACAGCTGA ACCTCATGGC CCAAACTGGC AATGGATATG CCGACCTTGG CTCCAATCAT
CATATAGCGA TCTATCGCTT GCCGGACGGG AAAGCTGATT TTGAGATTGT GAGTTTGTTC
GATGCCTCTC GGCGACTGGC CCAAAGAAAT CCCATCGTTC AAAGAACGCG GGCTGATGGG
GCGAGCTTTG TCATGTCTCT GGCCGCCGGC GAAGCAATAA TGATTCCGGA AGGAAGCAAA
AAGGGAATTT GGATCGTTCA GGGTGTTTGG GCCAGCGGCC AAGTAGTTTT GGAGCGCGAC
ACTGATGCTG ACCATTCAAC TACCACCAGA CCCATGCCAA ACCCGATACT AAAGGATGAC
GCAAAGAAAG TTTCAATTGA CCCAATCGGT CGAGTTCGGC CATCGAACGA CTGA
 
Protein sequence
MERIFGFDIG TTSIGFSVID YSSTQSAGNI QRLGVRIFPE ARDPDGTPLN QQRRQKRMMR 
RQLRRRRIRR KALNETLHEA GFLPAYGSAD WPVVMADEPY ELRRRGLEEG LSAYEFGRAI
YHLAQHRHFK GRELEESDTP DPDVDDEKEA ANERAATLKA LKNEQTTLGA WLARRPPSDR
KRGIHAHRNV VAEEFERLWE VQSKFHPALK SEEMRARISD TIFAQRPVFW RKNTLGECRF
MPGEPLCPKG SWLSQQRRML EKLNNLAIAG GNARPLDAEE RDAILSKLQQ QASMSWPGVR
SALKALYKQR GEPGAEKSLK FNLELGGESK LLGNALEAKL ADMFGPDWPA HPRKQEIRHA
VHERLWAADY GETPDKKRVI ILSEKDRKAH REAAANSFVA DFGITGEQAA QLQALKLPTG
WEPYSIPALN LFLAELEKGE RFGALVNGPD WEGWRRTNFP HRNQPTGEIL DKLPSPASKE
ERERISQLRN PTVVRTQNEL RKVVNNLIGL YGKPDRIRIE VGRDVGKSKR EREEIQSGIR
RNEKQRKKAT EDLIKNGIAN PSRDDVEKWI LWKEGQERCP YTGDQIGFNA LFREGRYEVE
HIWPRSRSFD NSPRNKTLCR KDVNIEKGNR MPFEAFGHDE DRWSAIQIRL QGMVSAKGGT
GMSPGKVKRF LAKTMPEDFA ARQLNDTRYA AKQILAQLKR LWPDMGPEAP VKVEAVTGQV
TAQLRKLWTL NNILADDGEK TRADHRHHAI DALTVACTHP GMTNKLSRYW QLRDDPRAEK
PALTPPWDTI RADAEKAVSE IVVSHRVRKK VSGPLHKETT YGDTGTDIKT KSGTYRQFVT
RKKIESLSKG ELDEIRDPRI KEIVAAHVAG RGGDPKKAFP PYPCVSPGGP EIRKVRLTSK
QQLNLMAQTG NGYADLGSNH HIAIYRLPDG KADFEIVSLF DASRRLAQRN PIVQRTRADG
ASFVMSLAAG EAIMIPEGSK KGIWIVQGVW ASGQVVLERD TDADHSTTTR PMPNPILKDD
AKKVSIDPIG RVRPSND