Gene Veis_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1568 
Symbol 
ID4694933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1754362 
End bp1757310 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content63% 
IMG OID639849332 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_996345 
Protein GI121608538 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0336191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.49551e-05 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGCAAAC CCCTGAAAAT CAGCGAAGCG GGCACCGTGC AGTTCCCGAT GGTGCGCCAC 
GCGGCGGAAA TCGGCTGGAC GACCATCACG CCCGACGATG CCCGCGCCAA GCGCGGCGGC
GATGCGGGCA CGTTCTTCCG CGACGTGCTG GAAGCCAAGC TCGCGGCCTT CAACCCGTGG
CTGACGGCGG ATGCCATCCG CTCCATCGTG GAAACGCTGG ACGCGCTGCC GCCCGGCATC
GAAGGCAACC GCGAACTGCT GGGCTGGCTG CGCGGCGAAC GCCAGTGGTA CGACGAGGCG
GAAAAGCGCC ACCGAGCGGT GACGCTGATC GACTTCGATC ACCCGGCGGA AAACGCCTTC
CACGTCACTT GGGAATGGAA GATCAAACCG CCCGCGCGGC CCAAGGGCAA CCGCGCCGAC
GTGATGTTCG TGGTCAACGG CGTGCCGGTG GTCATCGTCG AGCACAAGAA TCCGAAGGAC
GGCGACGCCA TCGAGCGCGC TATCAAGCAA CTGCGCCGCT ACGAAATGGA AACACCGGAA
CTGCTGGCGA CTGCGCAGTT GTTCAACGTC ACCCATCTGC TCGATTACTG GTACGGCGTG
ACCTGGAACG CCACCCGCCG CGACATGGCG CGCTGGAAGC AGGCCCCGGA GGAAAGTTAC
CGCTTCGCGG TGCAGGCCTT CTTTGAGCCG ACCGAGTTCC TGCGCACCTT GCAGCACTGG
ATTCTGTTCT ACGTGCAGGA CGGCGAAACG CGCAAATCCG TGCTGCGCCA GCACCAGCGC
CGTGCCATCG ACGCCATCCT TGCACGCTGC GCCGACCCGG CCAAGACGCG CGGGCTGGTC
TGGCATACAC AGGGTTCGGG CAAGACCTTC ACCCTGCTCA CCGCCGCGCG GCAGATTCTG
GAAGACAAGG CGCGTTTCAA GAATGCCACC GTGCTGCTGG TGGTGGACCG TACCGAACTC
GAAGGCCAGT TGAAGGGCTG GGTCGAGCGC CTGCTGGGCG AGATGCAGGC GCAGGACATC
GCGGTCAAGC GCGCCAGCAA CAAGGCCGAA CTGCAAGCCC TGCTGGACGC GGATTTTCGC
GGCCTCATCA TCTCGATGAT CCACAAGTTC GACGAGGTGA AGAAAGACAG TTGCACGCGT
GACAACGTCT ATGTGTTCAT CGACGAGGCG CACCGCTCGG TGGCGAAAGA CTTGGGCACC
TACCTGATGG CGGCATTGCC CAAGTCCACC ATCATCGGCT TTACTGGCAC GCCGATTTCG
CGCAGCGCGC AGGGCGAAGG CACGTTCAAG ATTTTCGGCA CGCAGGACGA ACACGGCTAT
CTCGACAAGT ATTCGATTGC CGAAAGCATC ACCGACGAAA CCACGCTGCC CATCAAGCAC
ATGATGGCCC CCAGCGAAAT GACCGTGCCT GCCGAACGGC TGGACAAGGA ATTCTTCGCG
CTGGCGGAAA GCGAAGGCGT CAGCGACGTG GAGGAACTCA ACAAGGTGCT CGACCGCGCC
GTGGGCCTGC GCACCTTCCT CACCGCCGAC GAGCGCATCG AGAAGGTGGC GGCTTTCGTC
GCCGAACACT TTAAGGAAAA CGTGCTACCG CTGGGCTACA AGGCGTTTGT CGTGGCGGTG
AACCGCGAAG CCTGCGCCAA GTACAAACAG GCGCTGGACA AGCTGCTGCC GCCGGAATGG
ACGGTGCCGG TCTATACGCA GAACGCGGCG GATGCGATTG ATCGCCCGCT GGTGGCGAAG
CTACAGCTTT CCGACGAGGC CGAGGAACAG GCGCGGCTGA TGTTCAAGAA GCCCGCCGAA
AATCCGAAGA TTCTGATCGT CACCGACAAG CTGCTCACCG GCTACGACGC ACCGCTGCTG
TATTGCCTCT ACCTCGACAA GCCGATGCGC GACCATGTGC TGCTGCAATC CATCGCGCGC
GTGAACCGGC CTTATGTGGA TGCCAACGGC GTGCGCAAGC GCGTCGGGCT GGTGCTGGAT
TTCGTCGGCG TGCTGCGCGA ATTGAAGAAG GCGTTGACCT TCGACTCCAG CGACGTGGGC
GGCGTGATCG AGGACTTGGA CGTGCTGTTG CAGGATTTTC TGCAACGCAT TGCGCAGGCG
AAACAGGAAT ACCTTGAGGC CGATGCGGAG GGTGCGCCGG ACGAGCGGCT GGAAAAGCTG
GTGTTCGGTC GCTTCCTGAC ACCGGAAGCG CGCAAGACCT TCTTCGAGGC GTACAAGGAA
ATCGAAGCGT TGTGGGAAAT CCTCTCGCCC ACGCCGGAAC TGCGCGACCA CATCGCCAGC
TACAAGCAGT TGAGCCAGCT TTACGCGGCG GTGCGCAACG CCTATGCGGA GAAGGTCGGC
TTCGTCGCCG ATCTGGCCTA CAAGACCCGG CGGCTGATCG AGGAAAGCGC CGAGCAGCAA
GGCTTGGGAC GGTTGACCAA GAGCGTGACA TTCGACGTGG CGACCCTGCA ATCGTTGCGC
GGCGACAAGG GTTCGGACGA AGGCAAGGTG TTCAACCTCG TGCGCGGCTT GCAGCAGGAG
ATCGACCAGG ACGCCGCCGC CGCGCCCGTG TTGCAGCCAC TGAAAGACCG GGCCGAGCGC
ATCCTGAAGG ATTTGGAGGA ACGCAAGACC ACGGGCCTGG CCGCGATGGA TCAATTGGCG
GCGCTGGCGG CGGAGAAAGA AGCCGCGATG AAGGCGGCAC GCGACAGCGG CCTGTCGGCA
CGCGCGTTCG GCGTGTTCTG GGTGCTGCGC GAGGATGCAG CGGTGAAGGC CGCAAGCCTC
GATGCAATGG CGCTGGCGAA GGACATCGAG GAACTGCTGG GCCGCTTCCC CAATGCCGCG
GTCAACCCCG ACGAACGACG GCGACTGCGC GCGGCCATCT ACAAGCCCTT GCTCGGCCTG
CCGCCAGAAG AACGCACCCG CGCCGTCGAT CTGGTGTTCA AGATGCTTCT GGTGGAGGCG
GATGAATGA
 
Protein sequence
MSKPLKISEA GTVQFPMVRH AAEIGWTTIT PDDARAKRGG DAGTFFRDVL EAKLAAFNPW 
LTADAIRSIV ETLDALPPGI EGNRELLGWL RGERQWYDEA EKRHRAVTLI DFDHPAENAF
HVTWEWKIKP PARPKGNRAD VMFVVNGVPV VIVEHKNPKD GDAIERAIKQ LRRYEMETPE
LLATAQLFNV THLLDYWYGV TWNATRRDMA RWKQAPEESY RFAVQAFFEP TEFLRTLQHW
ILFYVQDGET RKSVLRQHQR RAIDAILARC ADPAKTRGLV WHTQGSGKTF TLLTAARQIL
EDKARFKNAT VLLVVDRTEL EGQLKGWVER LLGEMQAQDI AVKRASNKAE LQALLDADFR
GLIISMIHKF DEVKKDSCTR DNVYVFIDEA HRSVAKDLGT YLMAALPKST IIGFTGTPIS
RSAQGEGTFK IFGTQDEHGY LDKYSIAESI TDETTLPIKH MMAPSEMTVP AERLDKEFFA
LAESEGVSDV EELNKVLDRA VGLRTFLTAD ERIEKVAAFV AEHFKENVLP LGYKAFVVAV
NREACAKYKQ ALDKLLPPEW TVPVYTQNAA DAIDRPLVAK LQLSDEAEEQ ARLMFKKPAE
NPKILIVTDK LLTGYDAPLL YCLYLDKPMR DHVLLQSIAR VNRPYVDANG VRKRVGLVLD
FVGVLRELKK ALTFDSSDVG GVIEDLDVLL QDFLQRIAQA KQEYLEADAE GAPDERLEKL
VFGRFLTPEA RKTFFEAYKE IEALWEILSP TPELRDHIAS YKQLSQLYAA VRNAYAEKVG
FVADLAYKTR RLIEESAEQQ GLGRLTKSVT FDVATLQSLR GDKGSDEGKV FNLVRGLQQE
IDQDAAAAPV LQPLKDRAER ILKDLEERKT TGLAAMDQLA ALAAEKEAAM KAARDSGLSA
RAFGVFWVLR EDAAVKAASL DAMALAKDIE ELLGRFPNAA VNPDERRRLR AAIYKPLLGL
PPEERTRAVD LVFKMLLVEA DE