Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0929 |
Symbol | |
ID | 8415219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1132205 |
End bp | 1135387 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023893 |
Product | type III restriction protein res subunit |
Protein accession | YP_003181290 |
Protein GI | 257790684 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA AGTTCAAAGT ACAACGGTAC CAAACCGACG CCGCAGATGC GGTGACCTCG GTCTTCGAGG GGCAGCCCAA CCAGGGCGCC TCCGTCTACC TGCGCGATCT GGGCCGCAAG CCCAAGTGTG CGCAGGGCGA GATCGACTTC GGCGACGAGA GCACCGAGGG CTATGCCAAT GCGCCCGTCG CGCTTTCGGG CGCTGACATA CTGGCAAACG TGCGAGCGGT GCAGCGACGC AACCAGATCC CCGAGTCGGA GGAGCTTTTC TGCAGTATGG GAGCCTGCCA GCTGGACGTC GAGATGGAGA CGGGCACCGG CAAGACATAC GTCTACACCA AGACGATGCT CGAGCTCAAC CGCCTGTACG GCTGGTGCAA GTTCATCGTG GTGGTGCCAT CGGTCGCCAT CCGCGAGGGT GTGGCCAAGA GCCTGGAGAA CACCCAGGAG CACTTCTTCT CCCAATACCA CAAGAAGATC AGGTTCTTCA TATACGACTC GGACAACCTG ACCGAACTGG ATGCGTATTC CCAGTCGAGC GACGTCAACT GCATGGTCAT CAACATGCAG GCGTTCAACG CGTCGATGAA GGAGGGCGCC AAGAACAAGG CGGCGCGCAT CATCTTCGAC GAGCGCGACG AGTTCAGCAG CCGTAGGCCC ATCGACGTCA TAGCGGCGAA CCGCCCCATT GTCATCTGCG ATGAGCCCCA GAAGATGGGA AAGAAGGGGG GCGCGACCCA GAAGGGCATC GCCCGCTTCA ACCCGCTGTT CGTGCTGAAC TACTCGGCAA CTCACAAGGA GAAGCACGAC CTGGTGTATG CCTTGGACGC CTTGGACGCC TACAACCAGA AACTCGTCAA GCGCATCGAG GTGAAGGGCT TCGCCCTAAA GAACATGCGC GGAACCGACG GCTACCTGTA CCTGCGCGAC ATCGTGGTAT CCAAGAATCG CGCCCCCGAG GCCGTTATCG AGTTCAAGTG CATGGGGTCT GGCGGCAAGG TGCGCAAGAA GACCGCACGC TTTGGCGAGG GCGACAGCAT CTACGGCGCC AGCGGTTCGA CGAAGCTCGA AGCGTACCGC GGCTACACCA TCGCGAGCGG CAACGACGGC GTGGTCCCGC CCCAGGATGG GCGCCCAGGC TACGTGCGCT TCCTGGACGG CTCTATCGGG GACGACGGGC GCGTCTACAT CGGCGAGGTC TACGGCGACT CCGCCGCCGA CGACATGCAG CGCATACAGA TTCGCGAGAC GATCCTGAGC CACTTGCAAA AGGAGGAGGC GCTTTTCCGC CGCGGCATCA AGTGCCTGAG TCTTTTCTTC ATCGACCAGG TAGCGAAGTA CCGCGACCTC TCCGGAAACG GCGAAACAGT GGGCTACGGT AAGATCTTCG AGGAGGAGTA CGAGGCCATC GTCTCCGACA GGCTGGAGCA CCCGACGCAG GATGACATCC TCGACCCCTC GTATGCGGAG TACCTCGGGC GATTCGAGGC ATGCTCAGTG CACTCCGGAT ACTTCTCGGT GGACAAGAAG GGCAACGCCG TCGAGTCGAA GGCCGAGAGG AAGGCGGAGC GCGATGACGG CATCGGCATC AACGACGACG ACGCGAAACG CGGCTACGAC CTGATCCTGC GCGATAAGGA GCGCCTGCTT TCGTTCGATG AGCCGGTGCG CTTCATCTTC AGCCACTCGG CGCTGCGCGA GGGCTGGGAC AACCCGAACA TCTTCCAGAT CTGCACACTC AAAGAATCCG GCTCTGAGAC GAGCAAGCGC CAGGAGGTCG GCCGCGGCAT GCGCTTGGCG GTCGATCAAG ACGGCAACCG CCAGGACGCG GCGCTGCTGG GACCCGACGA GGTGCACCGC GTGAACTTGC TCACCGTGAT CGCGTCGGAG AGCTACGAGA CGTTCGTGAG GGATTTGCAG ACAGATATAA GCAAGAGCCT GCGCGATCGT CCGAAGAAGG TGGAGATGGA CCTGTTCAGC GGTCGCGACG TCGTGCTCGA CGGCGAGACG GTCTCCTTCA CCGAGGACGA GTCGAGGCGC GTATACAAAA CCCTCTACAA GTGCGACCTC ATCGATGACG ACGACAAGCC GACCGCCGAG TTCCGCAAGG CCGTGGAGGA CGGCACCTTC GTGGAGTACT TCGTCGCGAA GCTGCCCGAG GAAATCGCCG ACGCTGCCCA CGCCAAGGCG GTCGAGGCGC TGGTGAAGAG CGTCTACGAC GCGCACGCGC TCGACGGCAT GATCGGGCGC GCCCAGGAGA AGATCTCCGA GAACACGCTG ACCGACAACT TCGCAAGACG CGAGTTCAAG GAACTGTGGG CGCGCATCAA CCGAAAGCAC GCCTACACGG TGAGCTTCTC GGACGACGAG TTGAGGCGCA AGTCGATAGA GCGTATCAAC TGTGACCTGC GCGTGAGCAG GCTGCAGTAC ACCTTGACGG TCGGCGGGCA AAAGTCACGG GCAACGCGCG ACGAGGTGCA GGGCGGCTCG TCGTTCGGCC GCACGCAGAC CGAGACTCGC AACGTCGACG CCGGAACCGC GTCCGTGGGC GTGACCTACG ACCTCGTGGG CGAGGTGGCG CAGGCCGCTG CTATCACGCG CAGGAGCGCC GCGGCGATTC TCGCCGGCAT CGATGCCAAC GTGTTCGGGC TCTACCGCGT GAATCCCGAG GAGTTCATCA AGAAGACGGC GGCGCTCATC GTGTCGGAGA AGGCGACGAT GGTGGTGGAG CACATCAGCT ACCACGAGAT CGACGACGTC TACGATGAGG CTATCTTCAC TGAGCGCATG CCCGACAACG CGAGCAAGGC GTACGAGGCC AAGAAGAACA TCCAGCGCTT CGTGTTTCCC GACTCCGACG GCGAGCGCAG GTTCGCTGAG GACATGGACG CCGCCGCCGA GGTAGCGGTC TATGCCAAGC TGCCGCGCAC GTTCCAGATT CCCACGCCGG TGGGCAACTA CGCCCCAGAC TGGGCCATCG CCTTCAAGGA AGGCAGCGTG CGCCATGTGT TCTTCGTCGC CGAGACCAAG GGCACCGTGG ACACGCTGGA GCTCTCCGGT GTGGAGAACG CCAAGATCGC CTGCGCCAAG AAGCTTTTCA ACGAGATGAG CACGGGCGAC GTGCGCTACC ACAACGTGGC GACGTACGAG GACCTGCTCG AGGTGATGGG CAGGATGGGC TAG
|
Protein sequence | MKFKFKVQRY QTDAADAVTS VFEGQPNQGA SVYLRDLGRK PKCAQGEIDF GDESTEGYAN APVALSGADI LANVRAVQRR NQIPESEELF CSMGACQLDV EMETGTGKTY VYTKTMLELN RLYGWCKFIV VVPSVAIREG VAKSLENTQE HFFSQYHKKI RFFIYDSDNL TELDAYSQSS DVNCMVINMQ AFNASMKEGA KNKAARIIFD ERDEFSSRRP IDVIAANRPI VICDEPQKMG KKGGATQKGI ARFNPLFVLN YSATHKEKHD LVYALDALDA YNQKLVKRIE VKGFALKNMR GTDGYLYLRD IVVSKNRAPE AVIEFKCMGS GGKVRKKTAR FGEGDSIYGA SGSTKLEAYR GYTIASGNDG VVPPQDGRPG YVRFLDGSIG DDGRVYIGEV YGDSAADDMQ RIQIRETILS HLQKEEALFR RGIKCLSLFF IDQVAKYRDL SGNGETVGYG KIFEEEYEAI VSDRLEHPTQ DDILDPSYAE YLGRFEACSV HSGYFSVDKK GNAVESKAER KAERDDGIGI NDDDAKRGYD LILRDKERLL SFDEPVRFIF SHSALREGWD NPNIFQICTL KESGSETSKR QEVGRGMRLA VDQDGNRQDA ALLGPDEVHR VNLLTVIASE SYETFVRDLQ TDISKSLRDR PKKVEMDLFS GRDVVLDGET VSFTEDESRR VYKTLYKCDL IDDDDKPTAE FRKAVEDGTF VEYFVAKLPE EIADAAHAKA VEALVKSVYD AHALDGMIGR AQEKISENTL TDNFARREFK ELWARINRKH AYTVSFSDDE LRRKSIERIN CDLRVSRLQY TLTVGGQKSR ATRDEVQGGS SFGRTQTETR NVDAGTASVG VTYDLVGEVA QAAAITRRSA AAILAGIDAN VFGLYRVNPE EFIKKTAALI VSEKATMVVE HISYHEIDDV YDEAIFTERM PDNASKAYEA KKNIQRFVFP DSDGERRFAE DMDAAAEVAV YAKLPRTFQI PTPVGNYAPD WAIAFKEGSV RHVFFVAETK GTVDTLELSG VENAKIACAK KLFNEMSTGD VRYHNVATYE DLLEVMGRMG
|
| |