Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1506 |
Symbol | |
ID | 8415804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1796317 |
End bp | 1797891 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024474 |
Product | NLP/P60 protein |
Protein accession | YP_003181863 |
Protein GI | 257791257 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.483483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGA GCGCGGCGAC ATACTCGGAC GGATCGGCAC GGCTCGAGGC GGCAAGCCGC GAGCAGGCGC GTGCGCTCGG GATGGACGAC CGGCCAGACG CCATCTCGGC GGGGATCGTG AAAGGATCGA CCGAGCGCGC GGGCGAGGTG CTCTCATCGA AGGGCGAATC TCCGGGGCGG GGCAGGGCGG ACTTCGCCGA ACCCTCCCCT GTCGGTGCGG GCAGGCCGAA GGTGAAATCC CGCCTCGCCG CCCAGGCCAA GGGGAACCTC AAGCGCGCGA TCGCCGATGC CGCCGCGTCC GAGGCGGACG ACTCCGAGGA GCTCTCCGGC ATCGGACAGG CGAGCAACAC CTTCCGCGGC GCACGCTCCG TCATCGCACG GCATTCCGCC TCCAAGAAGG CTACTGCCGC CTCGAAGCCT AAAGGCCCGC TGAAAGGGGC CGCAAAGCAT GCCAAGGCGG GGGCCTTCGG TCAAGGCGCT GCCGCGAAGG CCCGGCATGC GGCCGTCAGC CAGGCATCTG CCGGGGCTGC GAAGGCTGCG GCATCCGCCG GCGGCAAGGG CGCGGTCGTG AGCGCGGGCT CGTCGGTTGC TGTCCCCGTG GCGGGCGTGC TCGCGGCGAT CATGGCGTTT TTGCTCGCCG TGCTCGCAAT CTCCCAGATC GTGAGCGCCC TTTTCGGGTT CTGGGAAAAC GAGGCGTCCA AAGCCTCCCT CGAGGGGCTG CCGCCCTACA TCACCTACGA GATGGTCGAG GAGGCGCTCG CGTGTCAGGA GGAGTACGGG CACCCCGCAG GATGCACGAT CGCGCAGATC ATCGTCGAGT CGGGGCAGGG CGACCACCTC TCGGGGCTCG CCACGCAGGA CCACAACCTG TTCGGCATGA AGTGGTCGAG CTCGTATGCG CTGTGCGAGG AGGTCGCGGG GAAGAGCTCG TGGAGGACCG GCGAGGAGTA CGGCGGCGAG CAGGTCACCA TCACGGCGGA CTTCATCAGC TTCGTCGGCG ACGCGGAGTG CATCCGCTTC CGCAGCCGCG TCTTCCTGCA GGCCGATCGC TACGCGTCAA ACGCGCTCAT ACGCGAGGCG ATTGCGAACC ACGACTCGGA CAAGATGGCC GAGGGGCTCA AGGACGCGGG ATGGGCGACG AGCTCGAGCT ACGTCGAGAG CCTGAAATCC ACCATGGAGA CCTACAACCT CTACCGTTTC GACGGCATGA GCCTGGAGGA CTTCAGGTCC GGGGCGGTCT TGGCAGATGC GATCGTCTCC GCCGCCTACA GCCAGCTCGG TGTCCCCTAC GTGTGGGGCG GGACGACCCC GGGCGTCGGC CTGGACTGTA GCGGGCTCAC CCAGTATTGC TACAAGCAGG CCGGCATCTC GATACCGCGC AACACCGAGG CCCAGTACGC GCAGGGAAAG AAGATCGCGC TCTCGGAGGC GCAGCCCGGC GACATCCTCT ACCGCATGGG GCATGTCGGC ATCTACATAG GGGGCGACCG CTACATCCAC GCGCCCCATC GGGGCGAGGT CGTGAAGATC GCAAGCGGGA TCTCGAGCTT CACCTGCGCC CTGTCGTATC GATAG
|
Protein sequence | MPESAATYSD GSARLEAASR EQARALGMDD RPDAISAGIV KGSTERAGEV LSSKGESPGR GRADFAEPSP VGAGRPKVKS RLAAQAKGNL KRAIADAAAS EADDSEELSG IGQASNTFRG ARSVIARHSA SKKATAASKP KGPLKGAAKH AKAGAFGQGA AAKARHAAVS QASAGAAKAA ASAGGKGAVV SAGSSVAVPV AGVLAAIMAF LLAVLAISQI VSALFGFWEN EASKASLEGL PPYITYEMVE EALACQEEYG HPAGCTIAQI IVESGQGDHL SGLATQDHNL FGMKWSSSYA LCEEVAGKSS WRTGEEYGGE QVTITADFIS FVGDAECIRF RSRVFLQADR YASNALIREA IANHDSDKMA EGLKDAGWAT SSSYVESLKS TMETYNLYRF DGMSLEDFRS GAVLADAIVS AAYSQLGVPY VWGGTTPGVG LDCSGLTQYC YKQAGISIPR NTEAQYAQGK KIALSEAQPG DILYRMGHVG IYIGGDRYIH APHRGEVVKI ASGISSFTCA LSYR
|
| |