Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4168 |
Symbol | |
ID | 4695223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 4584085 |
End bp | 4587147 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639851915 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_998891 |
Protein GI | 121611084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0263776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACC TATTTTTTGA GAAACCCATC CTCAACTCCC CGTACGGCTA TCCGTCGCAA CACTGGGAGC TGGACAAGAA TGGCCAGCCT ACGCAGCAAG TCATTGCCAG CCGGCGACGA GCTGAGTTCA TCACGCCCAT CCCCCAGCCT AGAAAGCGCA AAGGCCAGGC CAAGCAGGAG GCGCTTCTTT TCGACGAAGG CAAAGGCCTT TCGACGCGGG AGCAGCAATA CGACCATCAG GCCGTCATCA ATGCCGTACG GCACGAAGTC GACAAATGGC GCGCGTTGCC CGACTCTGCC GACTGGCGTG TGACGCCGGA AACTGCGCGG CTGCTACAGC ATTGGCGGCA CCACGATTTC AGCGGTGTGC GACCGTTCTT CTGCCAGATC GAGGCGGTGG AAACCGCCAT CTGGCTGACC GAGGTCGCGC CGCAACTTGG CAAGGCGGGC AAGACCTTTC TGGACCATCT GGAGCGCGCC AATCAGGATG CTAACCCGGG CCTTGCACGG CTGGCACTGA AACTGGCCAC GGGGGCCGGC AAGACCACTG TGATGGCCAT GCTGATTGCC TGGCAGACCA TCAATGCGGT GCGCAGACCC ACGAGCCGGC GCTTTACCCG CGGCTTTCTT GTCGTTGCTC CTGGGTTGAC CATCCGCGAC CGCTTGCGCG TCCTGCAGCC AAACGATCCC GACAGCTATT ACGCTAGCCG CGAGCTGGTA CCTGGCGACA TGCTGGCCGA TCTGGAGCGC GCCAAGATCG TCATCACCAA CTACCACGCG TTCAAGCGCC GCGAACGGGT GGAGTTGTCC AAAGGCGGCC GCGCCTTGCT GCAGGGACGC GGCGGCGAAG AACTCGACAC ACTGGAAACC GAAGGCCAGA TGCTCCAGCG GGTCATGCCC GACTTGATGG GTTTGAAAGA CGTGCTGGTC ATCAACGACG AGGCGCACCA CTGCTACCGT GAAAAACCCG AGGCATCGGA AGACGATGAC CTGAAAGGCG ACGAAAAGAA AGAAGCGGAA GAAAACAACG CCGCAGCCCG GCTCTGGATC TCCGGCCTCG AAGCCGTTCA ACGCAAACTG GGCCTGTCCC GCGTGTTCGA CCTGTCGGCT ACGCCTTTCT TCCTGCGCGG CTCAGGCTAC GCCGAAGGCA CGCTGTTCCC CTGGACCATG AGCGACTTCT CGCTGATGGA TGCCATCGAA TGCGGCATCG TCAAGCTGCC ACGTGTGCCC GTGGCCGACA ACATCCCCGG CGCTGAAATG CCGATGTTCC GCAACCTGTG GGAACACATC CGCGCCAAGA TGCCCAAGAA AGGTCGCGGC AAGGCCGAAG GGTTGAACCC GCTGGAGCTG CCCACGCAAC TGCAGACCGC GCTGGAAGCC CTGTACGGCC ACTATGAAAA GACCTTCGCC TTGTGGCAAG AGGCCCAGCT CGCGGTGCCG CCTTGTTTCA TCGTGGTCTG CAACAACACG GCCACGTCAA AGCTCGTGTA CGACTACATC TCGGGCTTCA CGCGGGCGGA TGGTGCCCAC GAAGCCGGCC GCCTGCCGCT GTTTCGCAAC CAGGACGACT ACGGCAACGC ACTGGGGCGT CCGCGCACCT TGCTGATCGA CAGCGAGCAG CTCGAATCCG GCGAAGCCCT GGACGACAGC TTTCGCGGCA TGGCCGCCGA TGAGATCGAC CGATTCCGCC GCGAAATCGT CGAGCGAACG GGGGACGCCC AAGCCGGCCA GAAGATCACC GACCAAGACT TGCTGCGCGA GGTGATGAAC ACCGTGGGCA AGCCGGGCCG GCTGGGGGAC TCCATCCGAT GCGTGGTCTC CGTCTCCATG CTGACCGAGG GCTGGGACAC CAACACCGTG ACCCACGTGC TCGGCGTGCG CGCCTTTGGC ACCCAGTTGC TTTGTGAGCA GGTGATTGGC CGCGCATTGC GCCGCCAGTC CTACGAGCTC AACGCAGAAG AGCGCTTCAA CGTCGAATAC GCCGACGTGC TCGGCATTCC CTTCGACTTC AACGCCGAGC CCGTGGTCGC GCCGCCGCAA AAGCCGCGTG AAACCATCCA GGTGAAAGCG CTGCGCGAAC GCGAGGCGTT GGAGATCCGC TTCCCGCGGG TACAGGGCTA TCGCGTGGAA CTTCCCGAGG AGCAGTTGCG TGCCGAGTTC ACCGACGACT CTCGCATGGA GCTCACGCCT GCGCTGGTGG GAGCCACCGA GACGCGCAAC TCCGGCATCA TCGGCGCGCA GGTGGACCTG AACCTCGTCC ACACGGGCGA CGTGCGCCCC TCGCAGGTGG TGTACGAGCT GACCTCCCAT CTGCTGCTGA CCCGCTTCCG CGATGCCGAC GGGCAGCCCC AGCTGCACCT GTTCGGCCAG CTCAAACGCA TTGCCCGCCA GTGGCTAGAG AGCTACCTCG ACTGCAAGGG CGGCACCTAC CCCGCCCAGC TCAAGTACAA GACCCTGGCC GACGATGCCT GTGAACGCAT CAACGCCGGC ATCACCCGTG CCTTTCTCGG TCAGCGCCCC ATCCAGGCCG TGCTTGATCC CTACAACCCG GTGGGCAGCA CACGGCATGT GAGCTTCAAC ACGTCCAAGA CAGACCGTTG GGACACCAGT GGCCTGGCAC AGGGTGGCCC GCGCTGCCAT GTCAACTGGG TCATTCTCGA CAGCGACTGG GAAGCCGAGT TCTGCCGCGT GGCCGAGTCG CACCCGCGCG TACGCGCTTA CATAAAGAAC CACAACCTGG GCCTGGAAGT GCCCTACCGC AAGGATGGCC AGGCGCACCG CTACCGCCCC GACTTCATCG TGCGCGTGGA TGACGGACAT GGTGAAGGTG ACTTGCTGAA CCTCGTGGTC GAGATCAAGG GCTATCGCGG TGAAGACGCC AAGATCAAGA AGGAAACCAT GCTCACCCAC TGGGTGCCCG GCGTGAACCG GCTCGGCACC CATGGCCGCT GGGCCTTTGC CGAGTTCGTG GACGTGTGGC AGATGCAGGA CGACTTCGCC CAGAAGGTGC AAGAGGCGTT CGACGCCATG ATCGAGCAGC ACAACCGAAG GGAGAACACA TGA
|
Protein sequence | MTNLFFEKPI LNSPYGYPSQ HWELDKNGQP TQQVIASRRR AEFITPIPQP RKRKGQAKQE ALLFDEGKGL STREQQYDHQ AVINAVRHEV DKWRALPDSA DWRVTPETAR LLQHWRHHDF SGVRPFFCQI EAVETAIWLT EVAPQLGKAG KTFLDHLERA NQDANPGLAR LALKLATGAG KTTVMAMLIA WQTINAVRRP TSRRFTRGFL VVAPGLTIRD RLRVLQPNDP DSYYASRELV PGDMLADLER AKIVITNYHA FKRRERVELS KGGRALLQGR GGEELDTLET EGQMLQRVMP DLMGLKDVLV INDEAHHCYR EKPEASEDDD LKGDEKKEAE ENNAAARLWI SGLEAVQRKL GLSRVFDLSA TPFFLRGSGY AEGTLFPWTM SDFSLMDAIE CGIVKLPRVP VADNIPGAEM PMFRNLWEHI RAKMPKKGRG KAEGLNPLEL PTQLQTALEA LYGHYEKTFA LWQEAQLAVP PCFIVVCNNT ATSKLVYDYI SGFTRADGAH EAGRLPLFRN QDDYGNALGR PRTLLIDSEQ LESGEALDDS FRGMAADEID RFRREIVERT GDAQAGQKIT DQDLLREVMN TVGKPGRLGD SIRCVVSVSM LTEGWDTNTV THVLGVRAFG TQLLCEQVIG RALRRQSYEL NAEERFNVEY ADVLGIPFDF NAEPVVAPPQ KPRETIQVKA LREREALEIR FPRVQGYRVE LPEEQLRAEF TDDSRMELTP ALVGATETRN SGIIGAQVDL NLVHTGDVRP SQVVYELTSH LLLTRFRDAD GQPQLHLFGQ LKRIARQWLE SYLDCKGGTY PAQLKYKTLA DDACERINAG ITRAFLGQRP IQAVLDPYNP VGSTRHVSFN TSKTDRWDTS GLAQGGPRCH VNWVILDSDW EAEFCRVAES HPRVRAYIKN HNLGLEVPYR KDGQAHRYRP DFIVRVDDGH GEGDLLNLVV EIKGYRGEDA KIKKETMLTH WVPGVNRLGT HGRWAFAEFV DVWQMQDDFA QKVQEAFDAM IEQHNRRENT
|
| |