Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0018 |
Symbol | |
ID | 8414297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 25115 |
End bp | 28177 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645022993 |
Product | hypothetical protein |
Protein accession | YP_003180401 |
Protein GI | 257789795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.69615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTACG CAGAGCAGTA CATCGCGCTC TGCCTTGGCG GCGCGGGCAG TGCTTCTGCC CCGGCGCCGG GTATCGTGTT GGACGGAACC GCGCCCTTCA CGCTCGATAT GATGGTGCGA GGCATTCCGG TGGAAAGCGC CGCATCGGTG CTGCACCAGG AAGGGGCGCT CGACGTCAGG CTGACGGCGA AGGGGTTTTC CTTCTGGCGC GAGGGGTTTG GCATCTTTTC CACTTCAAGC GACGGCGAAA CGTTTCAACA GGGCGAGTGG AACCACCTGT GCATCGCCTA CGAGCCGGGA ACGGTGCGCC TGTTCGTCAA CGGCGCGCTC GATTGCGTTG TGCAGAAGCC GTGCAAGGGA AGCGCGTGCC CGAAGCCGTT CGTCGTGGGA GCGGGCGTCA AGGGAGGGGT TCGTCAACTG CGGTTGTTCG ACCGCGCGTT CGGCGGGATG GAAGTGCAGG ATCTGCTGCT GATGGACTTT GCCGATATCC GGGCGTCCTC CTACGCGGGC TCGCTGGCGG CGTTCTACGA CTTCGGATGC AAGGCTCCTG TCGAGCGCGT GTCCGGCTCG ACTATCGCGC TGCAGGGCGA TGCGAAGATG CGCGCTCTGT TCCCTTCCGT TCAGCTGCGG GGCAGCGCGT ACCTGGCCAT CTCCAACGAA CCGGGGATTA ACCCTGCGGG GCGGCGCAAC GACGCCTATT CCATCCAGGC TTGGATCAGG CTCGAACCGT TCGACGGCCA AGACGCGTAT ACCGTGTTCG CGAACGGCGA CTTGTCGGAG GAGGCGGGCA TGTCGCTGTA CGTGGCGCGC GACGAAGCGA GCTGGCGCCT ATGCGCGCTG CGGGGCGACG AGGAGCCCAT GATTTCGAAG GGGCTCGTGC AACCGCAGCT TTGGACGAAC GTGTGCCTGA CGTATGACGG CCTCCAAACC CAATCGTTGT ACGTGGACGG CGTACTGGAC AGCCAGATTT CCACATGCCT GCCCATTTCA GACGTGCTCG AGGAGCCGAA ACTTCGCATC GGCGCCGACC TCTCGAACGG AAGCGACAAC GGCAAAGACT GTTTCTCGGG CGCCATCTCG CGCGTGGACG TATGGAACCG CGCGCTCACG GCCGAGGAGG TGAAAAGCTA CGCCGCCGAA GAGCCTTCGT TCGACGCGGA AGGGCTGCAG GCATCCTACG ATTTGAGCTT CGCCGACATC AACAACGCCG TGTCCAGCGA TCCCATAGGA TTGCGCAACG GCGTAGTGGT CGACGACGTC AGACAGGAGG CAGGTACGAC TCCGATGCCG ACTGCATGTC CGCCGAAGCC CGATCCGTTG AGCGACGAGG AGCTGCGGCG TTGCCGAGCC GCGTGCCTGA AGGGGAACGA CTCCTCTCCT CTACGCGTGA GCCGCTTGGA AAAGGATGGG TATGTGTGCT TCGTCGGCCA CTACCACGAC GGTTCGCAGA CCATCGCGTG CGCAAAGGAA GGCTACGACG AATGGACGCT GTGGTATATC GAACTCGTTC TGCTGCTGGT GGGCGGCGTG CTCACCGTGC TGGCAGGCGT GAGGATTGCC GGAGGCAATA AGATCACCAA CTTCATCGTA ACGAAGATCA TGCCAAACCC GGCGTTTCGC TCGCTGTTCT CGGGGCCGGT GTCCTTCAAA ACAATCATCA CGTTCTTCTA CCTTTTGAAG GCGAACGGGT TGCTGACACC GCTTTTGAAG GCCGCAATGA GCGGGCTGCG CTGGTTCAAA GTGGCCTGGT CGATTGCCGT GATGACAACT ATGGCTGTAG CTATTTGCAC GGGCATGGGT CTGATCTATT ACGCCGCAGC GTTTGCCGAC CTGGCCGTCA GCCTGATCGT TCACCTGGCC GACATGCCCG CTTCGGGCAC GTTGTTGCCG TGCGGAGTGA GCGCGTTGTT CTTCGATCAC CATGCGGTGA CGAGCACTGT TCCGCTGCCT ACGGGCGAAG CCGACGCCAT CGCGCTGGCT TGGAACGGGA CCCAGCTCGT GTCCAAGCCC GAGTGGGATA GCAGCAAAAG CGACCCGTGC GCCTACTGCA TCGAGGCGGT CAAGGGAAAG AAGATCACGA TCAAGGCGAA CCTCACGTGC TCCGACCCTT CATTGGCTTC CGTGAAAGTG CGTGCCGTCG ACAAGAGCCG ATCGACGTTG CTCGGCGATT CCGACGAGAT CGCGGTGACG TTCAGATACG GGCGGGCCTC GGGCGCGACT TTGGCGTTTC CTCGTCACGC GCTGGCAAAC AAGGGCGTGG GCAAGCACGA GCTGCAGCTG GAGTGGCAGT GCTACTATCA GGGCGGATGG AAGAAGATGT CCACTACGAA GCATGTAATG TATACGTTGC TGTCGTACCC GAACGAGCCG TGGCTCAGCC GCAACGGATC CTCCCAGTAT CCGTGGGTTT CGCTGCTCGA AAAGGCCTGC TCTTGGGCGT CGGGGAAGAA GACGCCCGCC GAAGCGGCGG GCACGATCGA GCGAAAGGTG AACGAAGGGC TGGGCCTCGA ATACGATACG TCGGGATGGG GGCGATCCTA CTACTGCACG AACACGGGCT ACTTCCTGTT GGGCAATTTC TTGAGGCAAA CCTCTTCTCT GGTCAACTGC ACGGACTGCG CGATCATCGT GACCACGTTC GCCAACGCGT TGGGCTGCGA CTTGCACGAA GCGCGCATGG AGGATCCTTC GCCGAGCAAC AAGCAGCAAT TCACGTTCTT GAAGGTGAAA TCGATCGGCA AGAAGGTCTG GCAAGATGGC AGGTTCACCT ATCATGAAGT GGCCGTATCC AGGAAAGCGG CGACGACGAA CAATCAAGAC CGTGCGGTGT ACGACGCATG TTGCACGCTC AACGGGTCTG ATACGCCCTC TTCGGCGAGC AAGCGAGATC CTGTGCTGTC GAACGGCATG AACTTCTCCG ACTTCGACGA TACCGAGCCT ATCCCGCGTA CGATCACGGC GCGATCCTCC TATCGGGAGC ATTTTGCAAC GAACGACGCG GCGGGTGTTG GAAGGTGTGC CTACGTTTGG TCGAGTGAGA CCCGTCGTCC GGCTATGCCG TAA
|
Protein sequence | MEYAEQYIAL CLGGAGSASA PAPGIVLDGT APFTLDMMVR GIPVESAASV LHQEGALDVR LTAKGFSFWR EGFGIFSTSS DGETFQQGEW NHLCIAYEPG TVRLFVNGAL DCVVQKPCKG SACPKPFVVG AGVKGGVRQL RLFDRAFGGM EVQDLLLMDF ADIRASSYAG SLAAFYDFGC KAPVERVSGS TIALQGDAKM RALFPSVQLR GSAYLAISNE PGINPAGRRN DAYSIQAWIR LEPFDGQDAY TVFANGDLSE EAGMSLYVAR DEASWRLCAL RGDEEPMISK GLVQPQLWTN VCLTYDGLQT QSLYVDGVLD SQISTCLPIS DVLEEPKLRI GADLSNGSDN GKDCFSGAIS RVDVWNRALT AEEVKSYAAE EPSFDAEGLQ ASYDLSFADI NNAVSSDPIG LRNGVVVDDV RQEAGTTPMP TACPPKPDPL SDEELRRCRA ACLKGNDSSP LRVSRLEKDG YVCFVGHYHD GSQTIACAKE GYDEWTLWYI ELVLLLVGGV LTVLAGVRIA GGNKITNFIV TKIMPNPAFR SLFSGPVSFK TIITFFYLLK ANGLLTPLLK AAMSGLRWFK VAWSIAVMTT MAVAICTGMG LIYYAAAFAD LAVSLIVHLA DMPASGTLLP CGVSALFFDH HAVTSTVPLP TGEADAIALA WNGTQLVSKP EWDSSKSDPC AYCIEAVKGK KITIKANLTC SDPSLASVKV RAVDKSRSTL LGDSDEIAVT FRYGRASGAT LAFPRHALAN KGVGKHELQL EWQCYYQGGW KKMSTTKHVM YTLLSYPNEP WLSRNGSSQY PWVSLLEKAC SWASGKKTPA EAAGTIERKV NEGLGLEYDT SGWGRSYYCT NTGYFLLGNF LRQTSSLVNC TDCAIIVTTF ANALGCDLHE ARMEDPSPSN KQQFTFLKVK SIGKKVWQDG RFTYHEVAVS RKAATTNNQD RAVYDACCTL NGSDTPSSAS KRDPVLSNGM NFSDFDDTEP IPRTITARSS YREHFATNDA AGVGRCAYVW SSETRRPAMP
|
| |