Gene Elen_0929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0929 
Symbol 
ID8415219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1132205 
End bp1135387 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content62% 
IMG OID645023893 
Producttype III restriction protein res subunit 
Protein accessionYP_003181290 
Protein GI257790684 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA AGTTCAAAGT ACAACGGTAC CAAACCGACG CCGCAGATGC GGTGACCTCG 
GTCTTCGAGG GGCAGCCCAA CCAGGGCGCC TCCGTCTACC TGCGCGATCT GGGCCGCAAG
CCCAAGTGTG CGCAGGGCGA GATCGACTTC GGCGACGAGA GCACCGAGGG CTATGCCAAT
GCGCCCGTCG CGCTTTCGGG CGCTGACATA CTGGCAAACG TGCGAGCGGT GCAGCGACGC
AACCAGATCC CCGAGTCGGA GGAGCTTTTC TGCAGTATGG GAGCCTGCCA GCTGGACGTC
GAGATGGAGA CGGGCACCGG CAAGACATAC GTCTACACCA AGACGATGCT CGAGCTCAAC
CGCCTGTACG GCTGGTGCAA GTTCATCGTG GTGGTGCCAT CGGTCGCCAT CCGCGAGGGT
GTGGCCAAGA GCCTGGAGAA CACCCAGGAG CACTTCTTCT CCCAATACCA CAAGAAGATC
AGGTTCTTCA TATACGACTC GGACAACCTG ACCGAACTGG ATGCGTATTC CCAGTCGAGC
GACGTCAACT GCATGGTCAT CAACATGCAG GCGTTCAACG CGTCGATGAA GGAGGGCGCC
AAGAACAAGG CGGCGCGCAT CATCTTCGAC GAGCGCGACG AGTTCAGCAG CCGTAGGCCC
ATCGACGTCA TAGCGGCGAA CCGCCCCATT GTCATCTGCG ATGAGCCCCA GAAGATGGGA
AAGAAGGGGG GCGCGACCCA GAAGGGCATC GCCCGCTTCA ACCCGCTGTT CGTGCTGAAC
TACTCGGCAA CTCACAAGGA GAAGCACGAC CTGGTGTATG CCTTGGACGC CTTGGACGCC
TACAACCAGA AACTCGTCAA GCGCATCGAG GTGAAGGGCT TCGCCCTAAA GAACATGCGC
GGAACCGACG GCTACCTGTA CCTGCGCGAC ATCGTGGTAT CCAAGAATCG CGCCCCCGAG
GCCGTTATCG AGTTCAAGTG CATGGGGTCT GGCGGCAAGG TGCGCAAGAA GACCGCACGC
TTTGGCGAGG GCGACAGCAT CTACGGCGCC AGCGGTTCGA CGAAGCTCGA AGCGTACCGC
GGCTACACCA TCGCGAGCGG CAACGACGGC GTGGTCCCGC CCCAGGATGG GCGCCCAGGC
TACGTGCGCT TCCTGGACGG CTCTATCGGG GACGACGGGC GCGTCTACAT CGGCGAGGTC
TACGGCGACT CCGCCGCCGA CGACATGCAG CGCATACAGA TTCGCGAGAC GATCCTGAGC
CACTTGCAAA AGGAGGAGGC GCTTTTCCGC CGCGGCATCA AGTGCCTGAG TCTTTTCTTC
ATCGACCAGG TAGCGAAGTA CCGCGACCTC TCCGGAAACG GCGAAACAGT GGGCTACGGT
AAGATCTTCG AGGAGGAGTA CGAGGCCATC GTCTCCGACA GGCTGGAGCA CCCGACGCAG
GATGACATCC TCGACCCCTC GTATGCGGAG TACCTCGGGC GATTCGAGGC ATGCTCAGTG
CACTCCGGAT ACTTCTCGGT GGACAAGAAG GGCAACGCCG TCGAGTCGAA GGCCGAGAGG
AAGGCGGAGC GCGATGACGG CATCGGCATC AACGACGACG ACGCGAAACG CGGCTACGAC
CTGATCCTGC GCGATAAGGA GCGCCTGCTT TCGTTCGATG AGCCGGTGCG CTTCATCTTC
AGCCACTCGG CGCTGCGCGA GGGCTGGGAC AACCCGAACA TCTTCCAGAT CTGCACACTC
AAAGAATCCG GCTCTGAGAC GAGCAAGCGC CAGGAGGTCG GCCGCGGCAT GCGCTTGGCG
GTCGATCAAG ACGGCAACCG CCAGGACGCG GCGCTGCTGG GACCCGACGA GGTGCACCGC
GTGAACTTGC TCACCGTGAT CGCGTCGGAG AGCTACGAGA CGTTCGTGAG GGATTTGCAG
ACAGATATAA GCAAGAGCCT GCGCGATCGT CCGAAGAAGG TGGAGATGGA CCTGTTCAGC
GGTCGCGACG TCGTGCTCGA CGGCGAGACG GTCTCCTTCA CCGAGGACGA GTCGAGGCGC
GTATACAAAA CCCTCTACAA GTGCGACCTC ATCGATGACG ACGACAAGCC GACCGCCGAG
TTCCGCAAGG CCGTGGAGGA CGGCACCTTC GTGGAGTACT TCGTCGCGAA GCTGCCCGAG
GAAATCGCCG ACGCTGCCCA CGCCAAGGCG GTCGAGGCGC TGGTGAAGAG CGTCTACGAC
GCGCACGCGC TCGACGGCAT GATCGGGCGC GCCCAGGAGA AGATCTCCGA GAACACGCTG
ACCGACAACT TCGCAAGACG CGAGTTCAAG GAACTGTGGG CGCGCATCAA CCGAAAGCAC
GCCTACACGG TGAGCTTCTC GGACGACGAG TTGAGGCGCA AGTCGATAGA GCGTATCAAC
TGTGACCTGC GCGTGAGCAG GCTGCAGTAC ACCTTGACGG TCGGCGGGCA AAAGTCACGG
GCAACGCGCG ACGAGGTGCA GGGCGGCTCG TCGTTCGGCC GCACGCAGAC CGAGACTCGC
AACGTCGACG CCGGAACCGC GTCCGTGGGC GTGACCTACG ACCTCGTGGG CGAGGTGGCG
CAGGCCGCTG CTATCACGCG CAGGAGCGCC GCGGCGATTC TCGCCGGCAT CGATGCCAAC
GTGTTCGGGC TCTACCGCGT GAATCCCGAG GAGTTCATCA AGAAGACGGC GGCGCTCATC
GTGTCGGAGA AGGCGACGAT GGTGGTGGAG CACATCAGCT ACCACGAGAT CGACGACGTC
TACGATGAGG CTATCTTCAC TGAGCGCATG CCCGACAACG CGAGCAAGGC GTACGAGGCC
AAGAAGAACA TCCAGCGCTT CGTGTTTCCC GACTCCGACG GCGAGCGCAG GTTCGCTGAG
GACATGGACG CCGCCGCCGA GGTAGCGGTC TATGCCAAGC TGCCGCGCAC GTTCCAGATT
CCCACGCCGG TGGGCAACTA CGCCCCAGAC TGGGCCATCG CCTTCAAGGA AGGCAGCGTG
CGCCATGTGT TCTTCGTCGC CGAGACCAAG GGCACCGTGG ACACGCTGGA GCTCTCCGGT
GTGGAGAACG CCAAGATCGC CTGCGCCAAG AAGCTTTTCA ACGAGATGAG CACGGGCGAC
GTGCGCTACC ACAACGTGGC GACGTACGAG GACCTGCTCG AGGTGATGGG CAGGATGGGC
TAG
 
Protein sequence
MKFKFKVQRY QTDAADAVTS VFEGQPNQGA SVYLRDLGRK PKCAQGEIDF GDESTEGYAN 
APVALSGADI LANVRAVQRR NQIPESEELF CSMGACQLDV EMETGTGKTY VYTKTMLELN
RLYGWCKFIV VVPSVAIREG VAKSLENTQE HFFSQYHKKI RFFIYDSDNL TELDAYSQSS
DVNCMVINMQ AFNASMKEGA KNKAARIIFD ERDEFSSRRP IDVIAANRPI VICDEPQKMG
KKGGATQKGI ARFNPLFVLN YSATHKEKHD LVYALDALDA YNQKLVKRIE VKGFALKNMR
GTDGYLYLRD IVVSKNRAPE AVIEFKCMGS GGKVRKKTAR FGEGDSIYGA SGSTKLEAYR
GYTIASGNDG VVPPQDGRPG YVRFLDGSIG DDGRVYIGEV YGDSAADDMQ RIQIRETILS
HLQKEEALFR RGIKCLSLFF IDQVAKYRDL SGNGETVGYG KIFEEEYEAI VSDRLEHPTQ
DDILDPSYAE YLGRFEACSV HSGYFSVDKK GNAVESKAER KAERDDGIGI NDDDAKRGYD
LILRDKERLL SFDEPVRFIF SHSALREGWD NPNIFQICTL KESGSETSKR QEVGRGMRLA
VDQDGNRQDA ALLGPDEVHR VNLLTVIASE SYETFVRDLQ TDISKSLRDR PKKVEMDLFS
GRDVVLDGET VSFTEDESRR VYKTLYKCDL IDDDDKPTAE FRKAVEDGTF VEYFVAKLPE
EIADAAHAKA VEALVKSVYD AHALDGMIGR AQEKISENTL TDNFARREFK ELWARINRKH
AYTVSFSDDE LRRKSIERIN CDLRVSRLQY TLTVGGQKSR ATRDEVQGGS SFGRTQTETR
NVDAGTASVG VTYDLVGEVA QAAAITRRSA AAILAGIDAN VFGLYRVNPE EFIKKTAALI
VSEKATMVVE HISYHEIDDV YDEAIFTERM PDNASKAYEA KKNIQRFVFP DSDGERRFAE
DMDAAAEVAV YAKLPRTFQI PTPVGNYAPD WAIAFKEGSV RHVFFVAETK GTVDTLELSG
VENAKIACAK KLFNEMSTGD VRYHNVATYE DLLEVMGRMG