Gene Elen_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1506 
Symbol 
ID8415804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1796317 
End bp1797891 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID645024474 
ProductNLP/P60 protein 
Protein accessionYP_003181863 
Protein GI257791257 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.483483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGA GCGCGGCGAC ATACTCGGAC GGATCGGCAC GGCTCGAGGC GGCAAGCCGC 
GAGCAGGCGC GTGCGCTCGG GATGGACGAC CGGCCAGACG CCATCTCGGC GGGGATCGTG
AAAGGATCGA CCGAGCGCGC GGGCGAGGTG CTCTCATCGA AGGGCGAATC TCCGGGGCGG
GGCAGGGCGG ACTTCGCCGA ACCCTCCCCT GTCGGTGCGG GCAGGCCGAA GGTGAAATCC
CGCCTCGCCG CCCAGGCCAA GGGGAACCTC AAGCGCGCGA TCGCCGATGC CGCCGCGTCC
GAGGCGGACG ACTCCGAGGA GCTCTCCGGC ATCGGACAGG CGAGCAACAC CTTCCGCGGC
GCACGCTCCG TCATCGCACG GCATTCCGCC TCCAAGAAGG CTACTGCCGC CTCGAAGCCT
AAAGGCCCGC TGAAAGGGGC CGCAAAGCAT GCCAAGGCGG GGGCCTTCGG TCAAGGCGCT
GCCGCGAAGG CCCGGCATGC GGCCGTCAGC CAGGCATCTG CCGGGGCTGC GAAGGCTGCG
GCATCCGCCG GCGGCAAGGG CGCGGTCGTG AGCGCGGGCT CGTCGGTTGC TGTCCCCGTG
GCGGGCGTGC TCGCGGCGAT CATGGCGTTT TTGCTCGCCG TGCTCGCAAT CTCCCAGATC
GTGAGCGCCC TTTTCGGGTT CTGGGAAAAC GAGGCGTCCA AAGCCTCCCT CGAGGGGCTG
CCGCCCTACA TCACCTACGA GATGGTCGAG GAGGCGCTCG CGTGTCAGGA GGAGTACGGG
CACCCCGCAG GATGCACGAT CGCGCAGATC ATCGTCGAGT CGGGGCAGGG CGACCACCTC
TCGGGGCTCG CCACGCAGGA CCACAACCTG TTCGGCATGA AGTGGTCGAG CTCGTATGCG
CTGTGCGAGG AGGTCGCGGG GAAGAGCTCG TGGAGGACCG GCGAGGAGTA CGGCGGCGAG
CAGGTCACCA TCACGGCGGA CTTCATCAGC TTCGTCGGCG ACGCGGAGTG CATCCGCTTC
CGCAGCCGCG TCTTCCTGCA GGCCGATCGC TACGCGTCAA ACGCGCTCAT ACGCGAGGCG
ATTGCGAACC ACGACTCGGA CAAGATGGCC GAGGGGCTCA AGGACGCGGG ATGGGCGACG
AGCTCGAGCT ACGTCGAGAG CCTGAAATCC ACCATGGAGA CCTACAACCT CTACCGTTTC
GACGGCATGA GCCTGGAGGA CTTCAGGTCC GGGGCGGTCT TGGCAGATGC GATCGTCTCC
GCCGCCTACA GCCAGCTCGG TGTCCCCTAC GTGTGGGGCG GGACGACCCC GGGCGTCGGC
CTGGACTGTA GCGGGCTCAC CCAGTATTGC TACAAGCAGG CCGGCATCTC GATACCGCGC
AACACCGAGG CCCAGTACGC GCAGGGAAAG AAGATCGCGC TCTCGGAGGC GCAGCCCGGC
GACATCCTCT ACCGCATGGG GCATGTCGGC ATCTACATAG GGGGCGACCG CTACATCCAC
GCGCCCCATC GGGGCGAGGT CGTGAAGATC GCAAGCGGGA TCTCGAGCTT CACCTGCGCC
CTGTCGTATC GATAG
 
Protein sequence
MPESAATYSD GSARLEAASR EQARALGMDD RPDAISAGIV KGSTERAGEV LSSKGESPGR 
GRADFAEPSP VGAGRPKVKS RLAAQAKGNL KRAIADAAAS EADDSEELSG IGQASNTFRG
ARSVIARHSA SKKATAASKP KGPLKGAAKH AKAGAFGQGA AAKARHAAVS QASAGAAKAA
ASAGGKGAVV SAGSSVAVPV AGVLAAIMAF LLAVLAISQI VSALFGFWEN EASKASLEGL
PPYITYEMVE EALACQEEYG HPAGCTIAQI IVESGQGDHL SGLATQDHNL FGMKWSSSYA
LCEEVAGKSS WRTGEEYGGE QVTITADFIS FVGDAECIRF RSRVFLQADR YASNALIREA
IANHDSDKMA EGLKDAGWAT SSSYVESLKS TMETYNLYRF DGMSLEDFRS GAVLADAIVS
AAYSQLGVPY VWGGTTPGVG LDCSGLTQYC YKQAGISIPR NTEAQYAQGK KIALSEAQPG
DILYRMGHVG IYIGGDRYIH APHRGEVVKI ASGISSFTCA LSYR