Gene Veis_3485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3485 
Symbol 
ID4692917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3852592 
End bp3855651 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content55% 
IMG OID639851242 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_998224 
Protein GI121610417 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.340393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.439729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA AATTCAAAAC CCAACCCTAC CAAACAGCAG CCGTGCAAGC CGTACTCGAC 
TGCTTCAAAG GCCAGCCCCC TGCCTCGGCG CCAGCCACCC GCTATCGCAT CGACCCGGGC
CGAGCCAAGA CGGGTAGCAC CGAAACCATC TTTGCGCAAG ACATGCAAGA CGGATTCAAA
AACGCCGATC TGATGCTCAC AGACGACGGC TTGCTGAAGA ACATCATCCA GGTTCAGCGC
CAGCAGAACC TGCCGCAGTC CGACGAGTTG ATCAAAACCA AGGTCAGTCG GCTCAATCTC
GACATCGAGA TGGAAACCGG CACCGGCAAG ACCTATTGCT ACATCAAGAC CATTTTCGAG
CTGAACAGGC AATATGGCTG GAGCAAGTTC ATCATCGTTG TGCCCAGCAT CGCCATCCGT
GAAGGTGTGG CGAAGTCGTT GGCGATGACG GCGGAGCATT TTCAGGAGAC CTACAACAGA
AAGGCGCGCT TTTTCATTTA CAACTCCAGG CAATTGCATC ATCTGGAGAG TTTTTCTTCA
GATGCCGGCA TCAATGTGAT GGTGATCAAT GTGCAGGCGT TTGCTGCACG CGGGGCGGAT
AACCGGCGTA TCTATGAAGA GTTGGACGAT TTCCAGTCCC GCAAGCCGAT TGATGTGATC
AGCGCCAACC GCCCCATCCT GATTCTGGAC GAGCCGCAGA AAATGGAAGG TGCGCAGACC
CAGGAGTCTC TGCAGGCATT CAAGGCGTTG ATGGCCTTGC GTTATTCGGC CACCCACAAG
ACCACGCACA ATAAAATCCA TCGCCTGGAC GCACTGGATG CCTATAACCA GAAACTGGTC
AAGAAGATCG CCGTGCGTGG CATTGCGGTG AAGGGTTTGG CGGGAACCAA TGCCTATCTG
TATTTGCAGT CCATCGAAAC TTCCACCAAG AAACCGCCCG AGGCGCGTGT CGAATTCGAG
TGCAAGCTCA AGAGTGGCGA GATCAGGCGC ATCGTGCGCA AGCTCTCCAG GGGCGATAAT
CTGTTCGACC TTTCGGATGA ACTTGGTCAG TACCATGATG GCTATGTGGT GTCGGATATC
AACGCCAATA CCGATACCTT GGGTTTCACC AATGGCGTGG AACTCTGCGC TGGCGAGGTT
CTTGGGGATG TGGACGAGGC GGCGCTGCGC CGCATTCAGA TTCGCGAGGC GATCAAGGCG
CACGTCGATA AGGAACGGGC GCTGTTCCAG CAAGGCATCA AGGTGCTGAC CTTGTTCTTT
ATCGATGAAG TGGCCAAGTA CCGCGATTAT TCTGCAGCCG ATGAGAAGGG CGATTATGCG
CGGATTTTCG AGGAGGAATA TCGCCAGTAT CTGAACGAGG TGCTGAGTCT GGATGAAATG
CCCTCCCAGT GTCCCTATAT CCAGTACCTG AAGGGCATGG ATGCCGCAGC AACGCATAGC
GGTTATTTTG CCATCGACAA AAAAACCAAG CGGCTTGCCG ATCCCGATAT GGACAAGCGC
GGCGAGAATG CGGGTTTGTC CAATGACGTG GATGCCTATG ATCTGATTTT GAGGGACAAG
GAGCGCTTGC TATCCTTCGC CGAGCCGGTG CGCTTCATCT TTTCGCACTC CGCCCTGCGT
GAGGGTTGGG ATAATCCGAA TGTGTTCGTG ATCTGTGCGC TCAAGCACAG CGACAATACC
GTGTCGCGGC GCCAGGAGGT GGGGCGCGGT CTGCGCTTGT CTGTCAACCA GAGCGGTGAC
CGGATGGATG ATCCCGCTAC CGTGCATGAG GTGAATGTGC TGACGGTGGT GGCCAGCGAG
AGTTACAAGG ATTTTGTGGC GGCGCTGCAG AAGGATATCA GCGAATCCTT GTCGGCGCGG
CCCAGGGTGG CTGATGAAGC CTGGTTCACC GGCAAACTGC TCAAGACCGC AAGCGGCGAT
GTCGAGGTCA CGCCGCAGTT GGCCAGGCAG ATTTACAAGT ACCTGCTCAA GAACGGTTAC
AGGGGCAATG CGAATCACAT CACGGAGGCT TATCACCAGG CCAGAAAGGA CGGCACCCCG
GCAATGCTGC CACCGGAATT GCAAGCCCAT GCCGATCAGG TGTACCAGTT GATCGATAGC
GTCTTTGATG ACAGCCAGTT GCCCAAGATC GGCGATGGTC GCCGTTCAAA GACAAATCCG
CTCAATGACA ACTTCGACAA GCAGGAGTTC AAGGAACTAT GGAACCGGAT CAATCGCAAG
GCGTTTTACA GCGTTGATTT TGATTCCGCC GAATTGGTCG ACAAGGCCAT CGCCGAACTG
GACAAGAATC TGCGCGTGCC GCCCCTGCAT TATCTGATTC AAACCGGCGA GCAGGCCGAC
CATGTGGGCC ATGACGAACT GGATAACCGC AGGGTCTTCG TGCTCAGCAG CAGCAAGACC
GAAAAGAGTG CGCATTCGAT TCATTCAAAG GTGAAGTACG ACCTGATCGG CAAAATCGCC
GACGGCACCC AACTGACCCG ACGCACGACG GCAGAGATTC TCAAGGGGAT CGATGCCGCC
GCATTCGCGC AGTTCTGCAC CAACCCGGAG AGTTTCATCA CCGAGGCCAT ACGGCTGATC
AACGAGCAGA AGGGGACGAT GCTCATCGAG CACCTGGCCT ATGATCTGGC CGAGGACAGG
TTTGATCTGG ATATTTTCAC GGTGGCGCAA TCCGGGCAAG ATTTCAGCAA GGCCGGGGAC
AAGCTCCGGC GGCATATCTA CGATTACGTG GTCACCGATT CCGGCATCGA GCGGGATTTT
GCGAAAGAAT TGGACACCTG CGCCGAAGTC GTCGTCTACG CCAAATTGCC ACGCGGCTTC
CATATCCCCA CCCCCGTGGG CAACTACACC CCCGACTGGG CCATTTCGTT CAAGGAGGGG
ATGGTCAAGC ATGTGTATTT CGTTGCCGAG ACCAAAGGGG ATATTTCATC CATGGTGCTC
AATAAAATTG AAGAAACCAA GACCGAGTGC GCGCGCAAGT TCTTCAATGG GATCAACTCT
GCGAATGTGA GGTATGACGT GGTGGACAGT TTTGGGAAGT TGATGGCGTT GGTGCAGTAG
 
Protein sequence
MKLKFKTQPY QTAAVQAVLD CFKGQPPASA PATRYRIDPG RAKTGSTETI FAQDMQDGFK 
NADLMLTDDG LLKNIIQVQR QQNLPQSDEL IKTKVSRLNL DIEMETGTGK TYCYIKTIFE
LNRQYGWSKF IIVVPSIAIR EGVAKSLAMT AEHFQETYNR KARFFIYNSR QLHHLESFSS
DAGINVMVIN VQAFAARGAD NRRIYEELDD FQSRKPIDVI SANRPILILD EPQKMEGAQT
QESLQAFKAL MALRYSATHK TTHNKIHRLD ALDAYNQKLV KKIAVRGIAV KGLAGTNAYL
YLQSIETSTK KPPEARVEFE CKLKSGEIRR IVRKLSRGDN LFDLSDELGQ YHDGYVVSDI
NANTDTLGFT NGVELCAGEV LGDVDEAALR RIQIREAIKA HVDKERALFQ QGIKVLTLFF
IDEVAKYRDY SAADEKGDYA RIFEEEYRQY LNEVLSLDEM PSQCPYIQYL KGMDAAATHS
GYFAIDKKTK RLADPDMDKR GENAGLSNDV DAYDLILRDK ERLLSFAEPV RFIFSHSALR
EGWDNPNVFV ICALKHSDNT VSRRQEVGRG LRLSVNQSGD RMDDPATVHE VNVLTVVASE
SYKDFVAALQ KDISESLSAR PRVADEAWFT GKLLKTASGD VEVTPQLARQ IYKYLLKNGY
RGNANHITEA YHQARKDGTP AMLPPELQAH ADQVYQLIDS VFDDSQLPKI GDGRRSKTNP
LNDNFDKQEF KELWNRINRK AFYSVDFDSA ELVDKAIAEL DKNLRVPPLH YLIQTGEQAD
HVGHDELDNR RVFVLSSSKT EKSAHSIHSK VKYDLIGKIA DGTQLTRRTT AEILKGIDAA
AFAQFCTNPE SFITEAIRLI NEQKGTMLIE HLAYDLAEDR FDLDIFTVAQ SGQDFSKAGD
KLRRHIYDYV VTDSGIERDF AKELDTCAEV VVYAKLPRGF HIPTPVGNYT PDWAISFKEG
MVKHVYFVAE TKGDISSMVL NKIEETKTEC ARKFFNGINS ANVRYDVVDS FGKLMALVQ