Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0028 |
Symbol | |
ID | 8414307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 36065 |
End bp | 38371 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023003 |
Product | hypothetical protein |
Protein accession | YP_003180411 |
Protein GI | 257789805 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGT CCTATCTCGA GCATATCAAG ATCGTCAGCT TCGGCGCGTT CTCAAACAAA GCGGTGGGTC CGTTCGCCCC GCACCTCAAC GTGGTGTACG GGCCGAACGA AGCCGGCAAG ACCACGCTCG CGTCGTTCGT GGGCGGCGTG CTGTTCGGCT GGGAAGAAGC GCGCGGCAGC CGCAACACCT ACAAGCCTGC GAATGCGGAG CGATCTGGAT CGCTGTTCTT CGCCCAGCGC GACGCGTCCT CCGGCGTTTC GTCCGCGCCG TCGGATGCGC CCGCCCTCGC CGCCCCGGAC GGGCGCGTCG CCGCCGTCGA AGCCGACGGT TCCCTCGCGG CGTCAGCTGC GCCGCTTCCC ACGCCTGAAC CCTCTCGCGC GCCCGAGCTC GAGCTCTCGC GCGTGCGCAA CAGCGACGGC TTGCAAGGCG ACGCATCCCT CGTGGCCGAC ATCGACAAAG AGACATTTCA GACGATGTTC TCCCTGACCA GCGACGAGCT GCGGACGCTG CGCAACACCA CCGACGTTAC GGCTAAGCTG CTCACCGCCG GTTCGGGAAC GGGCGCTTCG CCCGCTCACG CGCTGGCCAC CGTGCAGGAA AAGCTGGCTG AGTACACTTC GCGCGCCGCC GGCGTCGAGC ACTCCATCGC CAACCTCACG ACGCAGGAGA ACGAGCTGCG CGCCAAGATG ACGGCCGCCG CCGAGGAGGC CGAGCGCTTC AAGCGGCAGG ATAAGGAGTT CCGCGAGCTG GAACCCCAGC GCAACGAGCT TCTGGCGCGC CTCGATGCTC TGAACACGTC TATCGAGACG CTGACCGTCC AACGCTCGAA CATCGAGAAG CTGGATCACG AGATAGACTC GCTGACCGAC CAGGTAGCGT CGCTGTGCGA CGAGGAGGAC GTGCTGGTCA CAACCCATTG CGCCGCCGAC CGCAGCGTGT CTGAGCTGGT TGAGCTGCCA GCGTCGGAAG AGCGCGCCCT GCGCGACCAG ATCGACTCGC TGGCGGTCGA CGAGGCGAAA TGCGAGCACG CCGTCGATCT GGCGCAGGAC AACTTCGCAA CGTCGAAGGC CGCTTACGAG GCGCTGCTGG AAACCGACGA CGAACGCGCC CAACGCGAAC GCGCTCGCCG CCAGCGCAGC GTGCAGGTGG GGCTGTCCAT CGCGCTGCCG CTCATGTTCG TGTGCACGGG CTTGCCCCTG TTCATTCACG GCCGCGAGAT CACCAGCTTG TCGTTCACGG CGCTCGGCAT CGGTCTGGTG GTGTTCGCCA TCATGCTGGC GCTGGCCGCG ATGGTGATGC TGTTTCGCCC CAACAAGGAA GAGGACGCGA AGGAAGCGCG CAAGCAGGAC GCCCAGTGGG TGATGCTGCA AGACAAGAAG AAGCTGGAAG CGTGCCTGGA AAGCCAGGCA GCGTTCAGGA ACCGCGCCCG TGCCCAGCTG GACGCCGCGG GCTTGGCCGA GGCGCAGGGA TCGCTGCGCC GATCGCGTAC GCTGCTGGAC GAGGCGAAGG ACGCGCGCGC CGAGAACAAC CTGTTCCAGC AGCGGTTGCA GGCGCTCGTG TCGCGCCGCA GCGCGCTGGA GGAGAGCCTG GCCAGCGCGA AGCGCCAACG CCGCCGCCTG TACGAGCGAA TCGACCTGCG CGCCGAGCGC ACCCTCGACG CCGTCGACGA GGCTATCGCG CAGAAGTCGC AGCAGCGCAT GGGGCTCATG GAGGCCAGCG AAGGCCTCAA CCGCCGTTAC GGCGAGCTCA AGCAGGAGCT TTCCCATGCC CAGCACCTGC GCGACTTCGA CGAGTACAAG CTGCTGTACC AGCAGATCCG CACGCGCCAG GACGAGAGCG CGCAGGACTA CGCGCGCCTG CTGCTGGCGC GCCGCATGCT GGAAAGCGCC ATCGCCACTT GGGAGAGCAA GAGCCAGCCG GAGGTGTATC GTCAGGCCAG CCGCCTGTTG TCGTTGATGA CGGACGGCCG CTGGACGAAG GTGAGCCTCA CCGCCGAGGG CCGCTTGCAG GTGACCGACG CGGTGAAGAC CACGCGCGAT CCCGGCCATC TGTCGCTGGG CACGTGCCAG CAGCTGTACC TCGCGCTGCG CATCGCCCTG CTGATGACCG CCGACAACGT GGGTCGCGCC GTGCCCATCC TGGCCGACGA CATCCTCGTG AACTTCGACA CGTCCCGACG TGCGGGCGCC GCCCGCGCGT TGGCCGAGCT CGCCCGTATG CGCCAGGTGA TCCTGTTCAC CTGCCATGAG GAAGTGGTGG AGGCGTTGCG GGAGGCGGAC CCGACCTTGA ACGAGGTCGA ACTGTAG
|
Protein sequence | MKRSYLEHIK IVSFGAFSNK AVGPFAPHLN VVYGPNEAGK TTLASFVGGV LFGWEEARGS RNTYKPANAE RSGSLFFAQR DASSGVSSAP SDAPALAAPD GRVAAVEADG SLAASAAPLP TPEPSRAPEL ELSRVRNSDG LQGDASLVAD IDKETFQTMF SLTSDELRTL RNTTDVTAKL LTAGSGTGAS PAHALATVQE KLAEYTSRAA GVEHSIANLT TQENELRAKM TAAAEEAERF KRQDKEFREL EPQRNELLAR LDALNTSIET LTVQRSNIEK LDHEIDSLTD QVASLCDEED VLVTTHCAAD RSVSELVELP ASEERALRDQ IDSLAVDEAK CEHAVDLAQD NFATSKAAYE ALLETDDERA QRERARRQRS VQVGLSIALP LMFVCTGLPL FIHGREITSL SFTALGIGLV VFAIMLALAA MVMLFRPNKE EDAKEARKQD AQWVMLQDKK KLEACLESQA AFRNRARAQL DAAGLAEAQG SLRRSRTLLD EAKDARAENN LFQQRLQALV SRRSALEESL ASAKRQRRRL YERIDLRAER TLDAVDEAIA QKSQQRMGLM EASEGLNRRY GELKQELSHA QHLRDFDEYK LLYQQIRTRQ DESAQDYARL LLARRMLESA IATWESKSQP EVYRQASRLL SLMTDGRWTK VSLTAEGRLQ VTDAVKTTRD PGHLSLGTCQ QLYLALRIAL LMTADNVGRA VPILADDILV NFDTSRRAGA ARALAELARM RQVILFTCHE EVVEALREAD PTLNEVEL
|
| |