Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0099 |
Symbol | |
ID | 5454634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 105795 |
End bp | 108908 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640875658 |
Product | CRISPR-associated endonuclease Csn1 family protein |
Protein accession | YP_001411379 |
Protein GI | 154250555 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.28892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.129983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGGA TTTTCGGCTT CGATATTGGC ACTACATCCA TCGGATTTTC GGTTATTGAC TACAGTTCAA CGCAATCGGC CGGGAATATT CAGAGACTTG GCGTCCGGAT ATTTCCGGAA GCCCGCGATC CGGACGGGAC CCCTCTAAAC CAGCAACGCA GGCAAAAGCG GATGATGCGC CGTCAGTTGC GGCGGCGGCG GATACGGAGG AAGGCGCTCA ACGAGACACT GCATGAAGCG GGCTTTCTGC CCGCCTATGG TTCTGCCGAT TGGCCAGTTG TCATGGCGGA TGAACCTTAC GAGTTGCGCA GGCGAGGTCT GGAAGAAGGC TTGAGCGCAT ATGAGTTCGG GCGAGCGATC TACCATCTTG CGCAGCACCG CCATTTCAAG GGCCGCGAAC TTGAAGAAAG CGATACTCCC GACCCTGACG TCGATGACGA AAAGGAGGCG GCGAACGAGC GGGCAGCAAC CCTGAAGGCT CTCAAGAACG AGCAAACCAC ATTGGGCGCG TGGCTTGCAC GCCGACCGCC ATCGGACCGC AAACGCGGAA TCCACGCGCA TCGCAATGTA GTTGCGGAAG AGTTCGAAAG ACTTTGGGAA GTTCAATCGA AGTTTCATCC GGCGCTCAAA AGCGAAGAAA TGAGGGCAAG GATTAGCGAC ACGATCTTTG CCCAGCGGCC GGTATTCTGG CGCAAGAACA CTTTGGGCGA ATGCCGCTTC ATGCCGGGCG AGCCTCTATG TCCGAAAGGC TCCTGGCTTT CGCAACAGCG ACGGATGCTG GAAAAGCTGA ACAATCTCGC GATAGCAGGT GGCAATGCCC GGCCGCTGGA CGCCGAAGAG AGGGATGCAA TTCTCTCCAA GCTTCAGCAA CAAGCCTCCA TGTCGTGGCC GGGCGTCCGA AGCGCATTGA AGGCGCTGTA TAAACAGCGG GGCGAACCCG GCGCGGAGAA AAGCCTCAAA TTCAATCTGG AACTGGGTGG CGAGAGCAAA CTCCTTGGCA ACGCACTCGA AGCCAAGCTA GCCGACATGT TCGGGCCAGA CTGGCCTGCG CATCCTCGCA AGCAGGAAAT CCGCCATGCC GTCCATGAGC GCCTATGGGC GGCGGACTAT GGCGAAACGC CAGACAAGAA GCGCGTGATT ATCCTCTCGG AGAAAGACAG GAAGGCGCAT AGGGAAGCCG CTGCAAATTC GTTTGTGGCC GACTTTGGGA TTACCGGGGA GCAAGCAGCC CAACTGCAGG CGCTCAAGCT GCCGACCGGG TGGGAACCCT ATTCAATCCC GGCCCTGAAT TTGTTTCTTG CAGAACTTGA AAAGGGTGAG AGATTTGGCG CCCTTGTAAA TGGCCCGGAT TGGGAAGGTT GGCGGCGCAC CAACTTCCCC CATCGCAATC AGCCAACCGG CGAAATTCTA GACAAACTGC CAAGCCCGGC CTCGAAAGAA GAACGCGAGC GGATTTCCCA ACTCCGAAAT CCAACGGTTG TCCGCACTCA AAACGAACTC CGAAAAGTCG TGAATAATCT TATCGGGCTT TACGGGAAGC CCGACCGAAT TCGCATTGAA GTCGGCCGCG ACGTCGGAAA GTCGAAGCGG GAGCGCGAGG AAATTCAGAG CGGGATTCGC CGCAACGAAA AACAGCGAAA GAAAGCGACT GAAGACTTAA TCAAGAATGG CATCGCTAAT CCGTCTCGCG ACGACGTTGA GAAATGGATT CTTTGGAAGG AAGGACAGGA ACGCTGCCCA TATACTGGCG ACCAAATTGG CTTCAATGCG CTGTTTCGCG AAGGCCGGTA TGAAGTTGAA CATATATGGC CCCGTTCACG CTCCTTCGAT AACAGCCCGC GCAACAAAAC GCTTTGCCGA AAGGATGTGA ACATCGAGAA AGGCAACCGG ATGCCCTTCG AGGCATTCGG CCACGATGAA GACCGCTGGT CCGCTATCCA AATACGCCTG CAAGGCATGG TGTCTGCGAA GGGTGGAACC GGCATGTCAC CCGGCAAGGT AAAGCGATTT CTTGCCAAGA CTATGCCCGA GGATTTTGCA GCCCGCCAAT TGAATGACAC TCGCTATGCG GCAAAGCAAA TTCTTGCGCA GCTCAAACGC TTGTGGCCCG ACATGGGCCC CGAAGCGCCT GTCAAAGTCG AGGCCGTCAC CGGCCAAGTG ACCGCGCAAC TTCGGAAGTT GTGGACGCTC AACAACATAC TTGCGGACGA CGGGGAAAAG ACACGAGCGG ATCACCGGCA TCATGCAATT GATGCCTTGA CGGTTGCCTG CACTCATCCG GGCATGACCA ACAAGCTTTC ACGCTATTGG CAGTTGAGAG ACGATCCGCG CGCAGAAAAG CCGGCGCTTA CGCCTCCTTG GGACACCATC CGCGCAGATG CAGAGAAAGC CGTGAGCGAG ATAGTAGTGT CTCATCGCGT CCGGAAGAAA GTTTCCGGCC CGCTCCACAA GGAAACGACA TATGGCGACA CAGGAACCGA CATAAAAACC AAGAGCGGAA CCTATCGTCA GTTTGTCACG CGCAAGAAAA TCGAATCTCT ATCGAAAGGC GAACTTGACG AAATACGCGA TCCGCGCATC AAGGAGATTG TTGCGGCGCA TGTTGCCGGG CGCGGCGGCG ATCCCAAAAA GGCGTTTCCG CCATATCCAT GCGTATCGCC CGGCGGACCG GAGATTCGAA AGGTGAGACT CACGTCAAAG CAACAGCTGA ACCTCATGGC CCAAACTGGC AATGGATATG CCGACCTTGG CTCCAATCAT CATATAGCGA TCTATCGCTT GCCGGACGGG AAAGCTGATT TTGAGATTGT GAGTTTGTTC GATGCCTCTC GGCGACTGGC CCAAAGAAAT CCCATCGTTC AAAGAACGCG GGCTGATGGG GCGAGCTTTG TCATGTCTCT GGCCGCCGGC GAAGCAATAA TGATTCCGGA AGGAAGCAAA AAGGGAATTT GGATCGTTCA GGGTGTTTGG GCCAGCGGCC AAGTAGTTTT GGAGCGCGAC ACTGATGCTG ACCATTCAAC TACCACCAGA CCCATGCCAA ACCCGATACT AAAGGATGAC GCAAAGAAAG TTTCAATTGA CCCAATCGGT CGAGTTCGGC CATCGAACGA CTGA
|
Protein sequence | MERIFGFDIG TTSIGFSVID YSSTQSAGNI QRLGVRIFPE ARDPDGTPLN QQRRQKRMMR RQLRRRRIRR KALNETLHEA GFLPAYGSAD WPVVMADEPY ELRRRGLEEG LSAYEFGRAI YHLAQHRHFK GRELEESDTP DPDVDDEKEA ANERAATLKA LKNEQTTLGA WLARRPPSDR KRGIHAHRNV VAEEFERLWE VQSKFHPALK SEEMRARISD TIFAQRPVFW RKNTLGECRF MPGEPLCPKG SWLSQQRRML EKLNNLAIAG GNARPLDAEE RDAILSKLQQ QASMSWPGVR SALKALYKQR GEPGAEKSLK FNLELGGESK LLGNALEAKL ADMFGPDWPA HPRKQEIRHA VHERLWAADY GETPDKKRVI ILSEKDRKAH REAAANSFVA DFGITGEQAA QLQALKLPTG WEPYSIPALN LFLAELEKGE RFGALVNGPD WEGWRRTNFP HRNQPTGEIL DKLPSPASKE ERERISQLRN PTVVRTQNEL RKVVNNLIGL YGKPDRIRIE VGRDVGKSKR EREEIQSGIR RNEKQRKKAT EDLIKNGIAN PSRDDVEKWI LWKEGQERCP YTGDQIGFNA LFREGRYEVE HIWPRSRSFD NSPRNKTLCR KDVNIEKGNR MPFEAFGHDE DRWSAIQIRL QGMVSAKGGT GMSPGKVKRF LAKTMPEDFA ARQLNDTRYA AKQILAQLKR LWPDMGPEAP VKVEAVTGQV TAQLRKLWTL NNILADDGEK TRADHRHHAI DALTVACTHP GMTNKLSRYW QLRDDPRAEK PALTPPWDTI RADAEKAVSE IVVSHRVRKK VSGPLHKETT YGDTGTDIKT KSGTYRQFVT RKKIESLSKG ELDEIRDPRI KEIVAAHVAG RGGDPKKAFP PYPCVSPGGP EIRKVRLTSK QQLNLMAQTG NGYADLGSNH HIAIYRLPDG KADFEIVSLF DASRRLAQRN PIVQRTRADG ASFVMSLAAG EAIMIPEGSK KGIWIVQGVW ASGQVVLERD TDADHSTTTR PMPNPILKDD AKKVSIDPIG RVRPSND
|
| |