Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_1230 |
Symbol | |
ID | 4695198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 1365979 |
End bp | 1369185 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639849004 |
Product | CRISPR-associated endonuclease Csn1 family protein |
Protein accession | YP_996018 |
Protein GI | 121608211 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGG CCGTCGCTTA TCGCCTGGCG CTGGATCTGG GATCGACTTC GCTGGGATGG GCTATTTTTC GGCTGAATGA GGCGCGTGAG CCTACGGCCA TCATCAGGGC GGGGGTTCGC ATCTTCAGCG ATGGTCGCAA TGCCAACAGC GAGCCGCTGG CAGTCCACCG CCGCGTGGCC CGCGCCATGC GCCGGCGCCG GGACCGTTTG CTCAAGCGCA AAAAACGCAT GCACGATCAA CTGGTGCAGC ATGGCTTTTT TCCTGCCGAG GTGGCCGAGC GCAAGGAACT GGAGCGGCTG AACCCCTACC AGTTGCGCGC CAAGGGCTTG CATGAGGCCC TGACCCCGGG CGAGTTTGCC CGGGCGCTGT TCCACATCAA CCAGCGCCGG GGTTTCAAAA GCAACCGCAA GACCGATCGC AAGGACAATG ACAGTGGCGC GCTCAAGCAA GCCATTAGCG AACTGCGCAA GCGCATCCAG GACAGCGACT GCGCCACGGC AGGCGAGTGG TTCTGGAAGG AGCGCATGCA GCAAAAGCCC GAGGGCGTGC GCGGCCAATG GGTGCGGGCA CGCTACCGTA AAACGCCTTC CACCACTGAC GAGGGTAAAA AGCGCATCGG CTATGACCTG TATGTGGACC GCGCCATGGT TGAACAGGAG TTTGATGCGC TGTGGGCCGC GCAAGCGGCG TTGCAGCCTG GCTTGTTCAC CGAAGCGGCG CGTGCGGAAT TGAAAGACAC GCTGCTGTAC CAACGCGATC TGCGCCCTGT CAAGCCTGGC CGCTGCACGC TGCTGCCTGC AGAAGAACGC GCCCCGCTGG CCTTGCCCAG CACGCAGCGC TTTCGCATCC TGCAAGAGGT GAACAATCTG CGTCTCCTGG ACGAAGCACT GCGCGAAGTG CCACTGAACC TGGCGCAGCG TGATGCAGTG GTGAGCGCGC TGGAGGGCAA GAAAGAACTG AGCTTTGCGG CAATACGCAA GTTGCTCACG CTCTCGGGCA AGTTCAACCT GGAAGACGAA AAGCGCGCCG AACTCAAAGG CAACGCCACC AGCGTCATAC TGGCCCGCAA GGACTTGTTT GGCGACGCTT GGGCGGGGTT TGACGCAGCC GTGCAAGACG AGATCGTTTG GCGACTGGTC AGTCAGGAGA GCGAGGGCGC CTTGATCGCC TGGTTGCAGC AACACACGGG GGTGGATGGG GTGTGCGCTG AGGCCATCGT CAACACCCGT TTGCCCGATG GCTATGGCCG CCTGAGCCGC AAGGCCTTGG AGCGCATCGT GCCTGCGTTG CAGCGCGAGG TTTGCACCTA CGACAAGGCC GTGCAGGCAG CCGGTTTTGC GCACCATAGC GACCTGGGTT TTGACTTTGA TTACGCCGAG GATGAGGTGC AGCAGGTGGG CGAACACACC ATCGCATCAA CTGGCGAGGT GCAAGCGCAG TACGCTTTCA AGCAACTGCC GTACTACGGC AAGGCCCTGC AGCGCCATGT GGCCTTTGGC AGTGGCGATG CCAAAGACCA TGAGGAAAAG TGCTACGGCA AGATCGCCAA CCCCACGGTA CACATCGGGC TGAACCAGGT GCGCACCGTG GTCAATGCGC TGATCCGCCG CTATGGTCAC CCCACCGAAG TGGTGGTGGA GCTGGCGCGT GACTTGAAGC AAAGTCGCGA GCAAAAGCAG CAGACGCAGC GCGAACAGGC TGACAACCAG AAGCGCAACG AGGACATTCG TAAGCGCATT GCTCCAATCC TGGAAACCAG TCCCGAGCGT GTGCGCGACC GTGATATCAA AAAATGGATC TTGTGGGAAG AACTGAACAA AAAGGATATC GCTGACCGTC ACTGTCCCTA CAGCGGCGAA CGGATCAGCG CCACTATGCT ACTTAGCGAA GCGGTGGAAA TCGAGCATAT CCTGCCGTTT TCGAGAACGC TGGACGACAG CCTGAACAAC CGCACCGTGG CCATGCGCCG GGCCAATCGC ATCAAGGGCG ACCGCACCCC TTGGGAGGCG CGGGCTGACT TTGAAGCCCA GGGTTGGCGC TATGAAGCCA TCTTGCAACG CGCCGAGGGC ATGCCGCCGC GCAAGCGCTA CCGATTTGCG GAAGATGGCT ACCAGCGCTG GTTGGGCAAG GACCGGAACT TTCTGGCCCG TGCGCTCAAT GACACGAGCT ACCTCTCGCG GCTGGCTGCC AACTATTTGC GGCTGGTGTG TCCGCAAGGG GTGCGGGTGA TTCCGGGGCA GATGACGGAC AAGCTGCGCG GCAAGTTCGG GTTGAACTCT GTGCTGGGTT TGGACGGCAA AAAAAACCGC AACGACCACC GCCATCATGC GGTCGATGCC TGCGTGATCG GCGTGACCGA CCAGGGCCTG ATGCAGCGCT TTGCCAACGC CAGCAAACAA GCCCGTGAAA ATGGCTTGAC CCGGCTGGTG CAAGACATGC TCTTGCCTTG GTGGCCCAGC TACTACGACC ATGTGGAGCG TGCAGTGCGC CATATCCGGG TCAGTCACCG GCCAGACCAT GGCTTTGAGG GGGCGATGAT GAAAGGGACC GCCCACGGCA TTCGCGAGGA CGGCATCCGC GAGGACGGCA GAATCAAGCA ACGCCCCAAG GCCAAAGGTA GCGCGGACCA CAAGACCATC ACCCTCATTC CTATTGACGA GCCTCGCCAA CTCGCCCGCC ATGGCGTGGA TGCAGAAGGC AAGCCCCTGC CGTACAAGGG CTATGCGAGT GGTAGCAACT ATTGCATCGA GATCACGAAG AACGGCAAGG GCAAGTGGGA GGGGCAGGTG ATCTCGACGT TTGATGCCTA TCGCATCAAG GCCGCTGCCG ATGCGGCGGC GCGTGCCGGG CAGGTGATCT CGACGGTTGA AGCCGATTCC ATCGTGCGGA AATCAGGCTG GGAGCGTCTG CGCGGCGCGC AGAGCCAAAA CGGGCAGCCG CTGGTGATGC GCTTGGTGAT TGGGGATAGT GTCAGGATGG AAGTCGATGG GCGCGATGAG GTGATGCGTG TTGTGAAAAT GAGCGGGAAA GAGATGGTTT TTGCGCCTGT GCGTGAAGCA AACGTGGATA AACGCAATAA CATGCCGGAC GAGCAGGATC CATTTACTTA CACCTACAAA CGTGCTGACC AACTGCGCAA AGCCAAGGCC CGCCAAGTCA CCATCTCCCC CATAGGCGAA CTGCGCGACC CAGGCTTCAA AGGCTGA
|
Protein sequence | MNKAVAYRLA LDLGSTSLGW AIFRLNEARE PTAIIRAGVR IFSDGRNANS EPLAVHRRVA RAMRRRRDRL LKRKKRMHDQ LVQHGFFPAE VAERKELERL NPYQLRAKGL HEALTPGEFA RALFHINQRR GFKSNRKTDR KDNDSGALKQ AISELRKRIQ DSDCATAGEW FWKERMQQKP EGVRGQWVRA RYRKTPSTTD EGKKRIGYDL YVDRAMVEQE FDALWAAQAA LQPGLFTEAA RAELKDTLLY QRDLRPVKPG RCTLLPAEER APLALPSTQR FRILQEVNNL RLLDEALREV PLNLAQRDAV VSALEGKKEL SFAAIRKLLT LSGKFNLEDE KRAELKGNAT SVILARKDLF GDAWAGFDAA VQDEIVWRLV SQESEGALIA WLQQHTGVDG VCAEAIVNTR LPDGYGRLSR KALERIVPAL QREVCTYDKA VQAAGFAHHS DLGFDFDYAE DEVQQVGEHT IASTGEVQAQ YAFKQLPYYG KALQRHVAFG SGDAKDHEEK CYGKIANPTV HIGLNQVRTV VNALIRRYGH PTEVVVELAR DLKQSREQKQ QTQREQADNQ KRNEDIRKRI APILETSPER VRDRDIKKWI LWEELNKKDI ADRHCPYSGE RISATMLLSE AVEIEHILPF SRTLDDSLNN RTVAMRRANR IKGDRTPWEA RADFEAQGWR YEAILQRAEG MPPRKRYRFA EDGYQRWLGK DRNFLARALN DTSYLSRLAA NYLRLVCPQG VRVIPGQMTD KLRGKFGLNS VLGLDGKKNR NDHRHHAVDA CVIGVTDQGL MQRFANASKQ ARENGLTRLV QDMLLPWWPS YYDHVERAVR HIRVSHRPDH GFEGAMMKGT AHGIREDGIR EDGRIKQRPK AKGSADHKTI TLIPIDEPRQ LARHGVDAEG KPLPYKGYAS GSNYCIEITK NGKGKWEGQV ISTFDAYRIK AAADAAARAG QVISTVEADS IVRKSGWERL RGAQSQNGQP LVMRLVIGDS VRMEVDGRDE VMRVVKMSGK EMVFAPVREA NVDKRNNMPD EQDPFTYTYK RADQLRKAKA RQVTISPIGE LRDPGFKG
|
| |