Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_1568 |
Symbol | |
ID | 4694933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 1754362 |
End bp | 1757310 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639849332 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_996345 |
Protein GI | 121608538 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0336191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000249551 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAGCAAAC CCCTGAAAAT CAGCGAAGCG GGCACCGTGC AGTTCCCGAT GGTGCGCCAC GCGGCGGAAA TCGGCTGGAC GACCATCACG CCCGACGATG CCCGCGCCAA GCGCGGCGGC GATGCGGGCA CGTTCTTCCG CGACGTGCTG GAAGCCAAGC TCGCGGCCTT CAACCCGTGG CTGACGGCGG ATGCCATCCG CTCCATCGTG GAAACGCTGG ACGCGCTGCC GCCCGGCATC GAAGGCAACC GCGAACTGCT GGGCTGGCTG CGCGGCGAAC GCCAGTGGTA CGACGAGGCG GAAAAGCGCC ACCGAGCGGT GACGCTGATC GACTTCGATC ACCCGGCGGA AAACGCCTTC CACGTCACTT GGGAATGGAA GATCAAACCG CCCGCGCGGC CCAAGGGCAA CCGCGCCGAC GTGATGTTCG TGGTCAACGG CGTGCCGGTG GTCATCGTCG AGCACAAGAA TCCGAAGGAC GGCGACGCCA TCGAGCGCGC TATCAAGCAA CTGCGCCGCT ACGAAATGGA AACACCGGAA CTGCTGGCGA CTGCGCAGTT GTTCAACGTC ACCCATCTGC TCGATTACTG GTACGGCGTG ACCTGGAACG CCACCCGCCG CGACATGGCG CGCTGGAAGC AGGCCCCGGA GGAAAGTTAC CGCTTCGCGG TGCAGGCCTT CTTTGAGCCG ACCGAGTTCC TGCGCACCTT GCAGCACTGG ATTCTGTTCT ACGTGCAGGA CGGCGAAACG CGCAAATCCG TGCTGCGCCA GCACCAGCGC CGTGCCATCG ACGCCATCCT TGCACGCTGC GCCGACCCGG CCAAGACGCG CGGGCTGGTC TGGCATACAC AGGGTTCGGG CAAGACCTTC ACCCTGCTCA CCGCCGCGCG GCAGATTCTG GAAGACAAGG CGCGTTTCAA GAATGCCACC GTGCTGCTGG TGGTGGACCG TACCGAACTC GAAGGCCAGT TGAAGGGCTG GGTCGAGCGC CTGCTGGGCG AGATGCAGGC GCAGGACATC GCGGTCAAGC GCGCCAGCAA CAAGGCCGAA CTGCAAGCCC TGCTGGACGC GGATTTTCGC GGCCTCATCA TCTCGATGAT CCACAAGTTC GACGAGGTGA AGAAAGACAG TTGCACGCGT GACAACGTCT ATGTGTTCAT CGACGAGGCG CACCGCTCGG TGGCGAAAGA CTTGGGCACC TACCTGATGG CGGCATTGCC CAAGTCCACC ATCATCGGCT TTACTGGCAC GCCGATTTCG CGCAGCGCGC AGGGCGAAGG CACGTTCAAG ATTTTCGGCA CGCAGGACGA ACACGGCTAT CTCGACAAGT ATTCGATTGC CGAAAGCATC ACCGACGAAA CCACGCTGCC CATCAAGCAC ATGATGGCCC CCAGCGAAAT GACCGTGCCT GCCGAACGGC TGGACAAGGA ATTCTTCGCG CTGGCGGAAA GCGAAGGCGT CAGCGACGTG GAGGAACTCA ACAAGGTGCT CGACCGCGCC GTGGGCCTGC GCACCTTCCT CACCGCCGAC GAGCGCATCG AGAAGGTGGC GGCTTTCGTC GCCGAACACT TTAAGGAAAA CGTGCTACCG CTGGGCTACA AGGCGTTTGT CGTGGCGGTG AACCGCGAAG CCTGCGCCAA GTACAAACAG GCGCTGGACA AGCTGCTGCC GCCGGAATGG ACGGTGCCGG TCTATACGCA GAACGCGGCG GATGCGATTG ATCGCCCGCT GGTGGCGAAG CTACAGCTTT CCGACGAGGC CGAGGAACAG GCGCGGCTGA TGTTCAAGAA GCCCGCCGAA AATCCGAAGA TTCTGATCGT CACCGACAAG CTGCTCACCG GCTACGACGC ACCGCTGCTG TATTGCCTCT ACCTCGACAA GCCGATGCGC GACCATGTGC TGCTGCAATC CATCGCGCGC GTGAACCGGC CTTATGTGGA TGCCAACGGC GTGCGCAAGC GCGTCGGGCT GGTGCTGGAT TTCGTCGGCG TGCTGCGCGA ATTGAAGAAG GCGTTGACCT TCGACTCCAG CGACGTGGGC GGCGTGATCG AGGACTTGGA CGTGCTGTTG CAGGATTTTC TGCAACGCAT TGCGCAGGCG AAACAGGAAT ACCTTGAGGC CGATGCGGAG GGTGCGCCGG ACGAGCGGCT GGAAAAGCTG GTGTTCGGTC GCTTCCTGAC ACCGGAAGCG CGCAAGACCT TCTTCGAGGC GTACAAGGAA ATCGAAGCGT TGTGGGAAAT CCTCTCGCCC ACGCCGGAAC TGCGCGACCA CATCGCCAGC TACAAGCAGT TGAGCCAGCT TTACGCGGCG GTGCGCAACG CCTATGCGGA GAAGGTCGGC TTCGTCGCCG ATCTGGCCTA CAAGACCCGG CGGCTGATCG AGGAAAGCGC CGAGCAGCAA GGCTTGGGAC GGTTGACCAA GAGCGTGACA TTCGACGTGG CGACCCTGCA ATCGTTGCGC GGCGACAAGG GTTCGGACGA AGGCAAGGTG TTCAACCTCG TGCGCGGCTT GCAGCAGGAG ATCGACCAGG ACGCCGCCGC CGCGCCCGTG TTGCAGCCAC TGAAAGACCG GGCCGAGCGC ATCCTGAAGG ATTTGGAGGA ACGCAAGACC ACGGGCCTGG CCGCGATGGA TCAATTGGCG GCGCTGGCGG CGGAGAAAGA AGCCGCGATG AAGGCGGCAC GCGACAGCGG CCTGTCGGCA CGCGCGTTCG GCGTGTTCTG GGTGCTGCGC GAGGATGCAG CGGTGAAGGC CGCAAGCCTC GATGCAATGG CGCTGGCGAA GGACATCGAG GAACTGCTGG GCCGCTTCCC CAATGCCGCG GTCAACCCCG ACGAACGACG GCGACTGCGC GCGGCCATCT ACAAGCCCTT GCTCGGCCTG CCGCCAGAAG AACGCACCCG CGCCGTCGAT CTGGTGTTCA AGATGCTTCT GGTGGAGGCG GATGAATGA
|
Protein sequence | MSKPLKISEA GTVQFPMVRH AAEIGWTTIT PDDARAKRGG DAGTFFRDVL EAKLAAFNPW LTADAIRSIV ETLDALPPGI EGNRELLGWL RGERQWYDEA EKRHRAVTLI DFDHPAENAF HVTWEWKIKP PARPKGNRAD VMFVVNGVPV VIVEHKNPKD GDAIERAIKQ LRRYEMETPE LLATAQLFNV THLLDYWYGV TWNATRRDMA RWKQAPEESY RFAVQAFFEP TEFLRTLQHW ILFYVQDGET RKSVLRQHQR RAIDAILARC ADPAKTRGLV WHTQGSGKTF TLLTAARQIL EDKARFKNAT VLLVVDRTEL EGQLKGWVER LLGEMQAQDI AVKRASNKAE LQALLDADFR GLIISMIHKF DEVKKDSCTR DNVYVFIDEA HRSVAKDLGT YLMAALPKST IIGFTGTPIS RSAQGEGTFK IFGTQDEHGY LDKYSIAESI TDETTLPIKH MMAPSEMTVP AERLDKEFFA LAESEGVSDV EELNKVLDRA VGLRTFLTAD ERIEKVAAFV AEHFKENVLP LGYKAFVVAV NREACAKYKQ ALDKLLPPEW TVPVYTQNAA DAIDRPLVAK LQLSDEAEEQ ARLMFKKPAE NPKILIVTDK LLTGYDAPLL YCLYLDKPMR DHVLLQSIAR VNRPYVDANG VRKRVGLVLD FVGVLRELKK ALTFDSSDVG GVIEDLDVLL QDFLQRIAQA KQEYLEADAE GAPDERLEKL VFGRFLTPEA RKTFFEAYKE IEALWEILSP TPELRDHIAS YKQLSQLYAA VRNAYAEKVG FVADLAYKTR RLIEESAEQQ GLGRLTKSVT FDVATLQSLR GDKGSDEGKV FNLVRGLQQE IDQDAAAAPV LQPLKDRAER ILKDLEERKT TGLAAMDQLA ALAAEKEAAM KAARDSGLSA RAFGVFWVLR EDAAVKAASL DAMALAKDIE ELLGRFPNAA VNPDERRRLR AAIYKPLLGL PPEERTRAVD LVFKMLLVEA DE
|
| |