Gene Elen_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0131 
Symbol 
ID8414415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp178059 
End bp180176 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content65% 
IMG OID645023111 
Producthypothetical protein 
Protein accessionYP_003180514 
Protein GI257789908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCA CGAGGTCACG CTCGAACGGC ATCGTCGCGA TCGCAATCGC TTTCGCGCTC 
GCTGCAAGCA ATCTCGCCTT CGCGCCTCAG CCCGCGCATG CGCAACCCGA GGCGAACGAA
GTCGTCATCG TGCAGAACAT GACGCCCATC GGCAGCGCCG AGGAATACGA CGCGCTGTTC
CCGTCCGGCT ACCCGAGTGC CGACAAGCCC CTTTCGGCCG GAAATGAAGG CGAATCGACC
GACGAGGGCG CAACCGGGGA AACGCCCCTT TGCGGCGCGC GCTCTTTTGC GCCAAGCGAT
GCCGAGACGA TGTCCTCGCT GAAGCAGCAG ATCGCCGCTG ACGAACCTCG CATCACGACT
GCCGAGAAAA CCGATTACCA GGTAGGCGAT TCGAAGACGT TTCGCGCCGC AGGCCGTCCG
GAAGGATTCA CGGCCACGGC AGTGGCTGTC GGAGAGCTGT TCACCCTGTG GGTCGAGGAC
GCGGAGTCCG ACATGCTCCC CGCAGATCTG GTGCAAAGGC TGGCTGGCAA GATCGACCCC
GTTCTTCGAA AAGTAACGGA CGCCTTCGGG TCGACGGTTC GCGTCGATTT GGACGGCGAC
GGGAAAACCG CGTTCGTCTT CCACCGATTC CCGCCGGAGA CGGAGGTGCT GGACGGCTAT
TTCACCTCGA TCGACTTGTA CACACCCGAG CAGCTGACGG CTGCAGACCT GATCGAAGAG
GCCTCCTACA CCAACGCTAT GGACGTGCTG CACCTCAACG TGCTGAACCG AAAGTCGTTG
GAGGGCGTGG GCGAGTTCGA CGAGAGCCTC GTCCCGCCGA TGATCGCCCA CGAATTCCAG
CACCTGGTGA ACTTCGCGCA GACAGACGGC TCCAGCGAAG CATGGCTCAA CGAGGCGTTC
TCCCAAGCGG CCGTGGCCAT CGCAGGATAC GGCTCCACCC AGAAAACCCG AGCTCAGAAC
TTGGCCGTGA TGGTCAACCT CAGCGGCCGC ATTCCCCCGT TCGTATACGA GGGGAGCTTC
GTGCCGGACG CTTCTCTGGG CGCAGGAGGA ACAGCGGTGT ACGCGCACGG TTACCTGTTC
TCGCGCTATC TCGCCAACCA AACCCGCGGG CTTCCCGGAG GCGGAGACAG CGTGTACCGA
TCGGTGTTCG ACGCGATGCG GGACGAGCGA GGGCTGGGTC AGTGCACGTC GGAGAGCCTG
ATGGCCGCAC TCGACAACAT CGGGTACGCG GGCGTCGGCG ACGACTGCGC CGTGGCCAGC
CTGGACGACC TTGCCCTCGG TTACGCGACG GCGCTTTTCC TGCGCGAGGA AACGGGCCCG
CACAGCTTGG TGAACCGCGC AGGATCCAAT CCGTCCATCG TGGACGGGTT GGAAGTTCCC
CTGCTTTCCG TGCCGGAACC CTCCAAGTCG CTGCAAGGAG GCGGCTCGGC GACGATAGCC
TCACTGGCAG CCTCGGGAGC GCCGGGTGCG AACGCCGGGT CCGGCACCCA AACGAAGTTC
GCCACCTCTT CGCTGCCTGT TTCCTATAAG ATAGCGGCAA ACCCGTCGAG CGGACCGGTG
AAGCCGGGAA GCCAGATCGC ACTGAGCTCG CCTCAGCTGG CAAGCCTCCC CGGCGCGCAT
TACGAGGTTG CGACCCTCAC CACCTACGAG CAGATCCTCA ACTTGAGCGC GCCCTTCCTG
CCGCTGGAAG ACCCGCTGCT GTTCGAGCCG GGCGTTTTGG CGTACGCGGT ACGCATCGCA
AGCGACCGCG GAACGACGCC CCACACCGTA TTCGGCTTTT ACGAAACCGC CGAACCAGAC
GAGGGCGAAG GCGACCAGGG CAACGATCCG TCCGGCGGCA CTCCCCCGTC GGACGATGCC
GGAGACGCGC CTTCCGGCGG GGATCAGCCT TCGGGGGGCA GCCCCTCAAG CGACGGCGCG
CAGCATCCCG GCAGCGCGCC CGATGACAAC GGCACCGACG ACGCGCGGAT CCGCCAGATG
CCCGCAAAAG CGCTCGCCGC CACCGGCGAT GGAGAGGCAC CGATCGCCGC CCTCGCACTG
GCAGCCGCGG CAAGCCTGTG CTGCATGGCG CTCGCACGAT GCGCGAAGAA ACGGAGCGTC
GGCTCCCCAG CAAGGTGA
 
Protein sequence
MLRTRSRSNG IVAIAIAFAL AASNLAFAPQ PAHAQPEANE VVIVQNMTPI GSAEEYDALF 
PSGYPSADKP LSAGNEGEST DEGATGETPL CGARSFAPSD AETMSSLKQQ IAADEPRITT
AEKTDYQVGD SKTFRAAGRP EGFTATAVAV GELFTLWVED AESDMLPADL VQRLAGKIDP
VLRKVTDAFG STVRVDLDGD GKTAFVFHRF PPETEVLDGY FTSIDLYTPE QLTAADLIEE
ASYTNAMDVL HLNVLNRKSL EGVGEFDESL VPPMIAHEFQ HLVNFAQTDG SSEAWLNEAF
SQAAVAIAGY GSTQKTRAQN LAVMVNLSGR IPPFVYEGSF VPDASLGAGG TAVYAHGYLF
SRYLANQTRG LPGGGDSVYR SVFDAMRDER GLGQCTSESL MAALDNIGYA GVGDDCAVAS
LDDLALGYAT ALFLREETGP HSLVNRAGSN PSIVDGLEVP LLSVPEPSKS LQGGGSATIA
SLAASGAPGA NAGSGTQTKF ATSSLPVSYK IAANPSSGPV KPGSQIALSS PQLASLPGAH
YEVATLTTYE QILNLSAPFL PLEDPLLFEP GVLAYAVRIA SDRGTTPHTV FGFYETAEPD
EGEGDQGNDP SGGTPPSDDA GDAPSGGDQP SGGSPSSDGA QHPGSAPDDN GTDDARIRQM
PAKALAATGD GEAPIAALAL AAAASLCCMA LARCAKKRSV GSPAR