Gene Elen_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0018 
Symbol 
ID8414297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp25115 
End bp28177 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content61% 
IMG OID645022993 
Producthypothetical protein 
Protein accessionYP_003180401 
Protein GI257789795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.69615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACG CAGAGCAGTA CATCGCGCTC TGCCTTGGCG GCGCGGGCAG TGCTTCTGCC 
CCGGCGCCGG GTATCGTGTT GGACGGAACC GCGCCCTTCA CGCTCGATAT GATGGTGCGA
GGCATTCCGG TGGAAAGCGC CGCATCGGTG CTGCACCAGG AAGGGGCGCT CGACGTCAGG
CTGACGGCGA AGGGGTTTTC CTTCTGGCGC GAGGGGTTTG GCATCTTTTC CACTTCAAGC
GACGGCGAAA CGTTTCAACA GGGCGAGTGG AACCACCTGT GCATCGCCTA CGAGCCGGGA
ACGGTGCGCC TGTTCGTCAA CGGCGCGCTC GATTGCGTTG TGCAGAAGCC GTGCAAGGGA
AGCGCGTGCC CGAAGCCGTT CGTCGTGGGA GCGGGCGTCA AGGGAGGGGT TCGTCAACTG
CGGTTGTTCG ACCGCGCGTT CGGCGGGATG GAAGTGCAGG ATCTGCTGCT GATGGACTTT
GCCGATATCC GGGCGTCCTC CTACGCGGGC TCGCTGGCGG CGTTCTACGA CTTCGGATGC
AAGGCTCCTG TCGAGCGCGT GTCCGGCTCG ACTATCGCGC TGCAGGGCGA TGCGAAGATG
CGCGCTCTGT TCCCTTCCGT TCAGCTGCGG GGCAGCGCGT ACCTGGCCAT CTCCAACGAA
CCGGGGATTA ACCCTGCGGG GCGGCGCAAC GACGCCTATT CCATCCAGGC TTGGATCAGG
CTCGAACCGT TCGACGGCCA AGACGCGTAT ACCGTGTTCG CGAACGGCGA CTTGTCGGAG
GAGGCGGGCA TGTCGCTGTA CGTGGCGCGC GACGAAGCGA GCTGGCGCCT ATGCGCGCTG
CGGGGCGACG AGGAGCCCAT GATTTCGAAG GGGCTCGTGC AACCGCAGCT TTGGACGAAC
GTGTGCCTGA CGTATGACGG CCTCCAAACC CAATCGTTGT ACGTGGACGG CGTACTGGAC
AGCCAGATTT CCACATGCCT GCCCATTTCA GACGTGCTCG AGGAGCCGAA ACTTCGCATC
GGCGCCGACC TCTCGAACGG AAGCGACAAC GGCAAAGACT GTTTCTCGGG CGCCATCTCG
CGCGTGGACG TATGGAACCG CGCGCTCACG GCCGAGGAGG TGAAAAGCTA CGCCGCCGAA
GAGCCTTCGT TCGACGCGGA AGGGCTGCAG GCATCCTACG ATTTGAGCTT CGCCGACATC
AACAACGCCG TGTCCAGCGA TCCCATAGGA TTGCGCAACG GCGTAGTGGT CGACGACGTC
AGACAGGAGG CAGGTACGAC TCCGATGCCG ACTGCATGTC CGCCGAAGCC CGATCCGTTG
AGCGACGAGG AGCTGCGGCG TTGCCGAGCC GCGTGCCTGA AGGGGAACGA CTCCTCTCCT
CTACGCGTGA GCCGCTTGGA AAAGGATGGG TATGTGTGCT TCGTCGGCCA CTACCACGAC
GGTTCGCAGA CCATCGCGTG CGCAAAGGAA GGCTACGACG AATGGACGCT GTGGTATATC
GAACTCGTTC TGCTGCTGGT GGGCGGCGTG CTCACCGTGC TGGCAGGCGT GAGGATTGCC
GGAGGCAATA AGATCACCAA CTTCATCGTA ACGAAGATCA TGCCAAACCC GGCGTTTCGC
TCGCTGTTCT CGGGGCCGGT GTCCTTCAAA ACAATCATCA CGTTCTTCTA CCTTTTGAAG
GCGAACGGGT TGCTGACACC GCTTTTGAAG GCCGCAATGA GCGGGCTGCG CTGGTTCAAA
GTGGCCTGGT CGATTGCCGT GATGACAACT ATGGCTGTAG CTATTTGCAC GGGCATGGGT
CTGATCTATT ACGCCGCAGC GTTTGCCGAC CTGGCCGTCA GCCTGATCGT TCACCTGGCC
GACATGCCCG CTTCGGGCAC GTTGTTGCCG TGCGGAGTGA GCGCGTTGTT CTTCGATCAC
CATGCGGTGA CGAGCACTGT TCCGCTGCCT ACGGGCGAAG CCGACGCCAT CGCGCTGGCT
TGGAACGGGA CCCAGCTCGT GTCCAAGCCC GAGTGGGATA GCAGCAAAAG CGACCCGTGC
GCCTACTGCA TCGAGGCGGT CAAGGGAAAG AAGATCACGA TCAAGGCGAA CCTCACGTGC
TCCGACCCTT CATTGGCTTC CGTGAAAGTG CGTGCCGTCG ACAAGAGCCG ATCGACGTTG
CTCGGCGATT CCGACGAGAT CGCGGTGACG TTCAGATACG GGCGGGCCTC GGGCGCGACT
TTGGCGTTTC CTCGTCACGC GCTGGCAAAC AAGGGCGTGG GCAAGCACGA GCTGCAGCTG
GAGTGGCAGT GCTACTATCA GGGCGGATGG AAGAAGATGT CCACTACGAA GCATGTAATG
TATACGTTGC TGTCGTACCC GAACGAGCCG TGGCTCAGCC GCAACGGATC CTCCCAGTAT
CCGTGGGTTT CGCTGCTCGA AAAGGCCTGC TCTTGGGCGT CGGGGAAGAA GACGCCCGCC
GAAGCGGCGG GCACGATCGA GCGAAAGGTG AACGAAGGGC TGGGCCTCGA ATACGATACG
TCGGGATGGG GGCGATCCTA CTACTGCACG AACACGGGCT ACTTCCTGTT GGGCAATTTC
TTGAGGCAAA CCTCTTCTCT GGTCAACTGC ACGGACTGCG CGATCATCGT GACCACGTTC
GCCAACGCGT TGGGCTGCGA CTTGCACGAA GCGCGCATGG AGGATCCTTC GCCGAGCAAC
AAGCAGCAAT TCACGTTCTT GAAGGTGAAA TCGATCGGCA AGAAGGTCTG GCAAGATGGC
AGGTTCACCT ATCATGAAGT GGCCGTATCC AGGAAAGCGG CGACGACGAA CAATCAAGAC
CGTGCGGTGT ACGACGCATG TTGCACGCTC AACGGGTCTG ATACGCCCTC TTCGGCGAGC
AAGCGAGATC CTGTGCTGTC GAACGGCATG AACTTCTCCG ACTTCGACGA TACCGAGCCT
ATCCCGCGTA CGATCACGGC GCGATCCTCC TATCGGGAGC ATTTTGCAAC GAACGACGCG
GCGGGTGTTG GAAGGTGTGC CTACGTTTGG TCGAGTGAGA CCCGTCGTCC GGCTATGCCG
TAA
 
Protein sequence
MEYAEQYIAL CLGGAGSASA PAPGIVLDGT APFTLDMMVR GIPVESAASV LHQEGALDVR 
LTAKGFSFWR EGFGIFSTSS DGETFQQGEW NHLCIAYEPG TVRLFVNGAL DCVVQKPCKG
SACPKPFVVG AGVKGGVRQL RLFDRAFGGM EVQDLLLMDF ADIRASSYAG SLAAFYDFGC
KAPVERVSGS TIALQGDAKM RALFPSVQLR GSAYLAISNE PGINPAGRRN DAYSIQAWIR
LEPFDGQDAY TVFANGDLSE EAGMSLYVAR DEASWRLCAL RGDEEPMISK GLVQPQLWTN
VCLTYDGLQT QSLYVDGVLD SQISTCLPIS DVLEEPKLRI GADLSNGSDN GKDCFSGAIS
RVDVWNRALT AEEVKSYAAE EPSFDAEGLQ ASYDLSFADI NNAVSSDPIG LRNGVVVDDV
RQEAGTTPMP TACPPKPDPL SDEELRRCRA ACLKGNDSSP LRVSRLEKDG YVCFVGHYHD
GSQTIACAKE GYDEWTLWYI ELVLLLVGGV LTVLAGVRIA GGNKITNFIV TKIMPNPAFR
SLFSGPVSFK TIITFFYLLK ANGLLTPLLK AAMSGLRWFK VAWSIAVMTT MAVAICTGMG
LIYYAAAFAD LAVSLIVHLA DMPASGTLLP CGVSALFFDH HAVTSTVPLP TGEADAIALA
WNGTQLVSKP EWDSSKSDPC AYCIEAVKGK KITIKANLTC SDPSLASVKV RAVDKSRSTL
LGDSDEIAVT FRYGRASGAT LAFPRHALAN KGVGKHELQL EWQCYYQGGW KKMSTTKHVM
YTLLSYPNEP WLSRNGSSQY PWVSLLEKAC SWASGKKTPA EAAGTIERKV NEGLGLEYDT
SGWGRSYYCT NTGYFLLGNF LRQTSSLVNC TDCAIIVTTF ANALGCDLHE ARMEDPSPSN
KQQFTFLKVK SIGKKVWQDG RFTYHEVAVS RKAATTNNQD RAVYDACCTL NGSDTPSSAS
KRDPVLSNGM NFSDFDDTEP IPRTITARSS YREHFATNDA AGVGRCAYVW SSETRRPAMP