Gene Veis_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3131 
Symbol 
ID4692839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3479380 
End bp3482562 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content63% 
IMG OID639850889 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_997880 
Protein GI121610073 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.789076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0879634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTT TGGCCGAAGC CGACATTGAG CAAGCCCTGC TGACGCAGTT GCGCAGACTA 
GGCTACTCCA TCGCATCCGA CGATGTCATT GGTCCGGACG GCAACGCGCC GGAACGCGAA
AGCCACGATG TCGTCGTGCT GCATAAGCGG CTGGAAACCG CTGTGCTGCG GCTCAATCCC
GCTTTGCCGC CGGAAGCGCG GGCAGATGCC ATTCGCAAAC TGACTCAATC GGAATTCCCC
GCCCTGCTGG AAGAAAATCG TCGCATCCAC ACCCTGCTGA CCGAGGGCGT GGACGTTGAA
TACTACGGCG ACGATGGTGT ACTGACGGCG GGCAAGGTTC GGTTGCTGGA TTTCGATACA
CCGGAAAACA ACGACTGGTT GGCCGTGCAG CAGTTCGTGG TCATCAACGG CCAAGTCAAC
CGCAGGCCGG ATGTGGTGCT GTTCGTCAAC GGCCTGCCGC TGGCGGTCAT CGAACTCAAA
GCGCCGGGCA GCGCGGGCGC GCATCTGGCG GGCGCGTTCA ACCAGTTGCA GACTTACAAG
CAGCAGATTC CGGCGCTGTT TCACACCAAT GCGCTGCTGG TCACTTCGGA CGGCATTGCC
GCGCGCGTGG GGTCGCTGTC CGCCGATCTG GAACGCTTCA TGCCGTGGCG CACCACGGAT
GGCAGCGCGA TTGCGCCCAA GGGTTCGCCG GAGTTGGCAA CGCTGATCGA GGGCGTGTTC
GCGCCGCAAC GCTTCCTCGA CCTGCTACGG TATTTCACCG TATTCGGCGA GACCGGCTCG
GGTCTTACGA AGATCGTGGC GGGCTATCAC CAGTTCCACG CGGTCAATCG TGCCGTCGAA
TCCACCATTC GCGCATCAAG CCAATGGCAG GGCGTGCGCG AGGAACCGGC CTCGTATGGC
CTGCCCAGCG TGGCGATGCA AGCCCGTGGC GACCGGCGCG CGGGCGTGAT CTGGCACACG
CAAGGCTCCG GCAAAAGCCT GCTGATGGCC TTTTACGCCG GGCGGCTGGT GCAGCACCCG
GCGATGGAGA ACCCGACGCT CGTCGTGCTG ACCGACCGCA ACGATCTGGA CGACCAGTTG
TTCTCCACCT TCTCGATGTG CCGCGACCTG ATCCGGCAGA AACCGGTGCA GGCCGAAGGC
CGCGAGCATT TGCAACAACT GCTGGCGCGC GCTTCCGGCG GCGTGATCTT CACTACCTTG
CAGAAGTTTG GCGAGGTAGC CGAGCCGCTG ACCACGCGGC GCAATGTGGT CGTCATCGCC
GACGAAGCAC ACCGTTCGCA ATACGGTTTC CGCGCGAAGG TGGATGCCAA GACTGGCGAG
GTGAGCTACG GCTTCGCCAA ATACCTGCGC GATGCGCTGC CCAATGCCTC GTTCATCGGC
TTTACGGGCA CGCCCATCGA GGCGACGGAC GTGAACACCC TGGCGGTGTT CGGCAACTAC
ATCGACGTGT ACGACATCAG CCGCGCGGTG GAAGACGGCG CGACCGTGCC GATCTACTAC
GAATCGAGGC TGGCGCGCAT CGAACTCGAC GAGGACGAAA AGCCCAAGCT CAACGCCGAA
GTCGAGGCGT TGACCGAGGA CGAGGCCGTC ACCGAGCAGG AAAAGCTGAA GGCGAAATGG
GCAACGGTCG AAAAACTGGT CGGCAGTGAC AAACGCCTCG CGCTGGTGGC GCAGGATTTG
GTGGCGCATT TCGAGGACAG GCTGACCGCG CTGGACGGCA AGGCCATGGT GGTGTGCATG
AGCCGCCGTA TCGCCGTAGC GCTATACGAT GAAATCATCA AGCTGCGCCC GGACTGGCAC
AGCACGGACG ATGCCGCAGG CGCGGTGAAA ATCGTGATGA CCGGTGCGGC CAGCGACCCA
CCCGCTTGGC AGCAGCACAT CGGCAACAAG GCGCGGCGCG ATCTGCTGGC CAAGCGTGCG
CGTGACGCGA AAGACCCGCT GAAACTGGTC ATCGTGCGTG ATATGTGGCT GACCGGCTTT
GACGCGCCGT GCATGCACAC GATGTATGTC GATAAACCGA TGCAGGGTCA CGGGCTGATG
CAGGCCATCG CGCGCGTGAA CCGCGTGTTC CTCGACAAGC CTGCTGGGTT GATCGTGGAC
TACATCGGCA TCGCGCAAAG CCTGAAATCC GCGCTGGCGC AGTATTCGGC GGGTGACCGA
GCCAACACCG GTGTGGACGA AGCGCAGGCC GTGGCGGTGA TGCAGGAGAA GTACGAAGTA
GTGCGCGACA TGTACCACGG CTTCGACTAC GCCACCGCCT TGAACGGCAC GCCGCAGCAG
CGGCTGGCGA TGATGGCTGG TGCCATCGAG TGGATACTCG ACAAGCAGCA GCAATGGACG
GCTGCGGAAA CCACGGATGA AGGCAAGAAG AACGCCCAGC GCCGCTATCA GGATGCGGTG
CTGGCCTTGT CGAAGGCGTA CTCGCTGGCG TCGGCTTCGG ACGAGGCGCG CGGCATCCGC
GAGGAGATTG GCTTCTTCCA GGCCATCCGT GCGGCGCTGG TGAAGTCGGC GACAGGCTCG
GGCGTCACCT CGCAGGAGCG CGACTTCGCT ATCCAGCAGA TCGTCAGCCG CGCAGTGATT
TCGACCGAGA TCGTGGACAT CCTCAAGGCG GCGGGCATCC AAAGCCCGGA CCTCTCCATC
CTGTCCGACG AGTTCCTCGC CGAAGTGCAG CAGATGCAGA AGAAGAACCT TGCGCTGGAA
GCCTTGCGCA AGCTCATCAA CGACGGCATC CGCTCGCGCA GCAAGGCCAA CATCGTGCAG
ACACGCGCGT TCTCGGAGCG GCTGGAAGAT GCCGTGGCGC GCTACCACGC CAATGCCATC
ACCACCGCCG AAGTGTTGCA GGAACTGATT CAGTTGGCGA AGGACATCCG CGCGGCGCGG
CAGCGTGGCG AAGAAGCGGG CCTGTCCGAT GAGGAAATCG CCTTCTACGA CGCGCTGGCC
GAAAACGAAA GTGCGGTGCA GGCAATGGGC GACGACAAGC TCAAGGTCAT TGCCCACGAA
TTGCTGGGCA GCCTGAAAGG GAACGTGTCC GTGGATTGGG CGCATCGCGA TTCGGCGCGC
GCCCGGATGC GGGTGCTGGT CAAGCGCATT CTGCGCAAAT ACGGTTATCC GCCGGATTTG
CAGGATGCGG CGGTGCAGAC CGTGCTGCAG CAGGCCGAGG CGCTCTCGGC TGGATGGATT
TAG
 
Protein sequence
MAFLAEADIE QALLTQLRRL GYSIASDDVI GPDGNAPERE SHDVVVLHKR LETAVLRLNP 
ALPPEARADA IRKLTQSEFP ALLEENRRIH TLLTEGVDVE YYGDDGVLTA GKVRLLDFDT
PENNDWLAVQ QFVVINGQVN RRPDVVLFVN GLPLAVIELK APGSAGAHLA GAFNQLQTYK
QQIPALFHTN ALLVTSDGIA ARVGSLSADL ERFMPWRTTD GSAIAPKGSP ELATLIEGVF
APQRFLDLLR YFTVFGETGS GLTKIVAGYH QFHAVNRAVE STIRASSQWQ GVREEPASYG
LPSVAMQARG DRRAGVIWHT QGSGKSLLMA FYAGRLVQHP AMENPTLVVL TDRNDLDDQL
FSTFSMCRDL IRQKPVQAEG REHLQQLLAR ASGGVIFTTL QKFGEVAEPL TTRRNVVVIA
DEAHRSQYGF RAKVDAKTGE VSYGFAKYLR DALPNASFIG FTGTPIEATD VNTLAVFGNY
IDVYDISRAV EDGATVPIYY ESRLARIELD EDEKPKLNAE VEALTEDEAV TEQEKLKAKW
ATVEKLVGSD KRLALVAQDL VAHFEDRLTA LDGKAMVVCM SRRIAVALYD EIIKLRPDWH
STDDAAGAVK IVMTGAASDP PAWQQHIGNK ARRDLLAKRA RDAKDPLKLV IVRDMWLTGF
DAPCMHTMYV DKPMQGHGLM QAIARVNRVF LDKPAGLIVD YIGIAQSLKS ALAQYSAGDR
ANTGVDEAQA VAVMQEKYEV VRDMYHGFDY ATALNGTPQQ RLAMMAGAIE WILDKQQQWT
AAETTDEGKK NAQRRYQDAV LALSKAYSLA SASDEARGIR EEIGFFQAIR AALVKSATGS
GVTSQERDFA IQQIVSRAVI STEIVDILKA AGIQSPDLSI LSDEFLAEVQ QMQKKNLALE
ALRKLINDGI RSRSKANIVQ TRAFSERLED AVARYHANAI TTAEVLQELI QLAKDIRAAR
QRGEEAGLSD EEIAFYDALA ENESAVQAMG DDKLKVIAHE LLGSLKGNVS VDWAHRDSAR
ARMRVLVKRI LRKYGYPPDL QDAAVQTVLQ QAEALSAGWI