Gene Veis_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1230 
Symbol 
ID4695198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1365979 
End bp1369185 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content61% 
IMG OID639849004 
ProductCRISPR-associated endonuclease Csn1 family protein 
Protein accessionYP_996018 
Protein GI121608211 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG CCGTCGCTTA TCGCCTGGCG CTGGATCTGG GATCGACTTC GCTGGGATGG 
GCTATTTTTC GGCTGAATGA GGCGCGTGAG CCTACGGCCA TCATCAGGGC GGGGGTTCGC
ATCTTCAGCG ATGGTCGCAA TGCCAACAGC GAGCCGCTGG CAGTCCACCG CCGCGTGGCC
CGCGCCATGC GCCGGCGCCG GGACCGTTTG CTCAAGCGCA AAAAACGCAT GCACGATCAA
CTGGTGCAGC ATGGCTTTTT TCCTGCCGAG GTGGCCGAGC GCAAGGAACT GGAGCGGCTG
AACCCCTACC AGTTGCGCGC CAAGGGCTTG CATGAGGCCC TGACCCCGGG CGAGTTTGCC
CGGGCGCTGT TCCACATCAA CCAGCGCCGG GGTTTCAAAA GCAACCGCAA GACCGATCGC
AAGGACAATG ACAGTGGCGC GCTCAAGCAA GCCATTAGCG AACTGCGCAA GCGCATCCAG
GACAGCGACT GCGCCACGGC AGGCGAGTGG TTCTGGAAGG AGCGCATGCA GCAAAAGCCC
GAGGGCGTGC GCGGCCAATG GGTGCGGGCA CGCTACCGTA AAACGCCTTC CACCACTGAC
GAGGGTAAAA AGCGCATCGG CTATGACCTG TATGTGGACC GCGCCATGGT TGAACAGGAG
TTTGATGCGC TGTGGGCCGC GCAAGCGGCG TTGCAGCCTG GCTTGTTCAC CGAAGCGGCG
CGTGCGGAAT TGAAAGACAC GCTGCTGTAC CAACGCGATC TGCGCCCTGT CAAGCCTGGC
CGCTGCACGC TGCTGCCTGC AGAAGAACGC GCCCCGCTGG CCTTGCCCAG CACGCAGCGC
TTTCGCATCC TGCAAGAGGT GAACAATCTG CGTCTCCTGG ACGAAGCACT GCGCGAAGTG
CCACTGAACC TGGCGCAGCG TGATGCAGTG GTGAGCGCGC TGGAGGGCAA GAAAGAACTG
AGCTTTGCGG CAATACGCAA GTTGCTCACG CTCTCGGGCA AGTTCAACCT GGAAGACGAA
AAGCGCGCCG AACTCAAAGG CAACGCCACC AGCGTCATAC TGGCCCGCAA GGACTTGTTT
GGCGACGCTT GGGCGGGGTT TGACGCAGCC GTGCAAGACG AGATCGTTTG GCGACTGGTC
AGTCAGGAGA GCGAGGGCGC CTTGATCGCC TGGTTGCAGC AACACACGGG GGTGGATGGG
GTGTGCGCTG AGGCCATCGT CAACACCCGT TTGCCCGATG GCTATGGCCG CCTGAGCCGC
AAGGCCTTGG AGCGCATCGT GCCTGCGTTG CAGCGCGAGG TTTGCACCTA CGACAAGGCC
GTGCAGGCAG CCGGTTTTGC GCACCATAGC GACCTGGGTT TTGACTTTGA TTACGCCGAG
GATGAGGTGC AGCAGGTGGG CGAACACACC ATCGCATCAA CTGGCGAGGT GCAAGCGCAG
TACGCTTTCA AGCAACTGCC GTACTACGGC AAGGCCCTGC AGCGCCATGT GGCCTTTGGC
AGTGGCGATG CCAAAGACCA TGAGGAAAAG TGCTACGGCA AGATCGCCAA CCCCACGGTA
CACATCGGGC TGAACCAGGT GCGCACCGTG GTCAATGCGC TGATCCGCCG CTATGGTCAC
CCCACCGAAG TGGTGGTGGA GCTGGCGCGT GACTTGAAGC AAAGTCGCGA GCAAAAGCAG
CAGACGCAGC GCGAACAGGC TGACAACCAG AAGCGCAACG AGGACATTCG TAAGCGCATT
GCTCCAATCC TGGAAACCAG TCCCGAGCGT GTGCGCGACC GTGATATCAA AAAATGGATC
TTGTGGGAAG AACTGAACAA AAAGGATATC GCTGACCGTC ACTGTCCCTA CAGCGGCGAA
CGGATCAGCG CCACTATGCT ACTTAGCGAA GCGGTGGAAA TCGAGCATAT CCTGCCGTTT
TCGAGAACGC TGGACGACAG CCTGAACAAC CGCACCGTGG CCATGCGCCG GGCCAATCGC
ATCAAGGGCG ACCGCACCCC TTGGGAGGCG CGGGCTGACT TTGAAGCCCA GGGTTGGCGC
TATGAAGCCA TCTTGCAACG CGCCGAGGGC ATGCCGCCGC GCAAGCGCTA CCGATTTGCG
GAAGATGGCT ACCAGCGCTG GTTGGGCAAG GACCGGAACT TTCTGGCCCG TGCGCTCAAT
GACACGAGCT ACCTCTCGCG GCTGGCTGCC AACTATTTGC GGCTGGTGTG TCCGCAAGGG
GTGCGGGTGA TTCCGGGGCA GATGACGGAC AAGCTGCGCG GCAAGTTCGG GTTGAACTCT
GTGCTGGGTT TGGACGGCAA AAAAAACCGC AACGACCACC GCCATCATGC GGTCGATGCC
TGCGTGATCG GCGTGACCGA CCAGGGCCTG ATGCAGCGCT TTGCCAACGC CAGCAAACAA
GCCCGTGAAA ATGGCTTGAC CCGGCTGGTG CAAGACATGC TCTTGCCTTG GTGGCCCAGC
TACTACGACC ATGTGGAGCG TGCAGTGCGC CATATCCGGG TCAGTCACCG GCCAGACCAT
GGCTTTGAGG GGGCGATGAT GAAAGGGACC GCCCACGGCA TTCGCGAGGA CGGCATCCGC
GAGGACGGCA GAATCAAGCA ACGCCCCAAG GCCAAAGGTA GCGCGGACCA CAAGACCATC
ACCCTCATTC CTATTGACGA GCCTCGCCAA CTCGCCCGCC ATGGCGTGGA TGCAGAAGGC
AAGCCCCTGC CGTACAAGGG CTATGCGAGT GGTAGCAACT ATTGCATCGA GATCACGAAG
AACGGCAAGG GCAAGTGGGA GGGGCAGGTG ATCTCGACGT TTGATGCCTA TCGCATCAAG
GCCGCTGCCG ATGCGGCGGC GCGTGCCGGG CAGGTGATCT CGACGGTTGA AGCCGATTCC
ATCGTGCGGA AATCAGGCTG GGAGCGTCTG CGCGGCGCGC AGAGCCAAAA CGGGCAGCCG
CTGGTGATGC GCTTGGTGAT TGGGGATAGT GTCAGGATGG AAGTCGATGG GCGCGATGAG
GTGATGCGTG TTGTGAAAAT GAGCGGGAAA GAGATGGTTT TTGCGCCTGT GCGTGAAGCA
AACGTGGATA AACGCAATAA CATGCCGGAC GAGCAGGATC CATTTACTTA CACCTACAAA
CGTGCTGACC AACTGCGCAA AGCCAAGGCC CGCCAAGTCA CCATCTCCCC CATAGGCGAA
CTGCGCGACC CAGGCTTCAA AGGCTGA
 
Protein sequence
MNKAVAYRLA LDLGSTSLGW AIFRLNEARE PTAIIRAGVR IFSDGRNANS EPLAVHRRVA 
RAMRRRRDRL LKRKKRMHDQ LVQHGFFPAE VAERKELERL NPYQLRAKGL HEALTPGEFA
RALFHINQRR GFKSNRKTDR KDNDSGALKQ AISELRKRIQ DSDCATAGEW FWKERMQQKP
EGVRGQWVRA RYRKTPSTTD EGKKRIGYDL YVDRAMVEQE FDALWAAQAA LQPGLFTEAA
RAELKDTLLY QRDLRPVKPG RCTLLPAEER APLALPSTQR FRILQEVNNL RLLDEALREV
PLNLAQRDAV VSALEGKKEL SFAAIRKLLT LSGKFNLEDE KRAELKGNAT SVILARKDLF
GDAWAGFDAA VQDEIVWRLV SQESEGALIA WLQQHTGVDG VCAEAIVNTR LPDGYGRLSR
KALERIVPAL QREVCTYDKA VQAAGFAHHS DLGFDFDYAE DEVQQVGEHT IASTGEVQAQ
YAFKQLPYYG KALQRHVAFG SGDAKDHEEK CYGKIANPTV HIGLNQVRTV VNALIRRYGH
PTEVVVELAR DLKQSREQKQ QTQREQADNQ KRNEDIRKRI APILETSPER VRDRDIKKWI
LWEELNKKDI ADRHCPYSGE RISATMLLSE AVEIEHILPF SRTLDDSLNN RTVAMRRANR
IKGDRTPWEA RADFEAQGWR YEAILQRAEG MPPRKRYRFA EDGYQRWLGK DRNFLARALN
DTSYLSRLAA NYLRLVCPQG VRVIPGQMTD KLRGKFGLNS VLGLDGKKNR NDHRHHAVDA
CVIGVTDQGL MQRFANASKQ ARENGLTRLV QDMLLPWWPS YYDHVERAVR HIRVSHRPDH
GFEGAMMKGT AHGIREDGIR EDGRIKQRPK AKGSADHKTI TLIPIDEPRQ LARHGVDAEG
KPLPYKGYAS GSNYCIEITK NGKGKWEGQV ISTFDAYRIK AAADAAARAG QVISTVEADS
IVRKSGWERL RGAQSQNGQP LVMRLVIGDS VRMEVDGRDE VMRVVKMSGK EMVFAPVREA
NVDKRNNMPD EQDPFTYTYK RADQLRKAKA RQVTISPIGE LRDPGFKG