Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0385 |
Symbol | |
ID | 8414669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 495239 |
End bp | 496321 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023361 |
Product | NLP/P60 protein |
Protein accession | YP_003180764 |
Protein GI | 257790158 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATT TCACGCGCGG CCTCGATCGC CGCACGTTTA TATTTGGCGC GGCTGCCCTG GGTGCGGCCA CGCTGTTGCG GCCGACGCTC GCTTTTGCCG AGCCCACGTC CGCAGACAAG TTCGCCGAGG CCGATTCCGT TCGCGCTCGC ATCAACGAGA TGCAGGAGCA GCTGACCATC GCCGGTGAGA ACTATTACAA AGCGCTCGAC GAGCACGAAG CGGCCGTCCA AGCCGTTGCC GACGCCCAGG CGCGCATCGA CGAAGCGAAC GCCCAGATCG CCGAGTTGCA GGACAAGCTG AGCAAGCGGG CGCGCAGTAT GTATCGCAGC GGCCAGACCA CGGCGCTCGA CGTCATCTTG GGAGCCACGA CGTTCGAGGA ATTCGCCACA AGCTGGGATT TGCTGAACGA TATCAACGAC AACGATGCCG CGATGGTGCA GCAGACGAAA GACCTGCGCG CCGAGGTGGA GGCGGCGAAG ATCGAACTCG AAAAGCAAGA GCGTATCGCT GCCGAAAAAG CCGAAGAGGC GGCGCGCATC AAAGCTGAGG CAGAGCAGAC CATCCAAACG CTGGAAGCCA CGCTGCAACA GCTTGACGCC GAGGCTCAGG CGCTGCTGGT CCAGGAACAG GAAGCGGCAC GTGCTGCCGA AGCTGCCGCT GCGGCAGCCG AGGTCAAGCG CACGTACGCC TACAGCACGC CGAGCGCGTC CATCCCGTCC CAGGGCTCGG TGGTGGACTA CGCGCTGTCG CGCATCGGCT GCCCCTACGT GTGGGGTGCC GCCGGTCCGA ACGAATTCGA CTGCTCCGGA TTGGTGACGT GGGCATACGC TCAAGTGGGT ATTTCGGTGC CCCATCAGAC CGAGTCGATG TACTATGCCG CCGCCGCGCG TTTGCCGGTC TCCGAAGCGC AACCCGGCGA TGTGCTGTGG ATCAGCTACG GCGACGGGTA CAACGGACAT GCTGGCATCG CCTGCAATGC CGGGGGCACG CACTACGTTC ACGCCCCCAC GTTTGGCGCG CGCGTGCGCG ATACCGATCC GCTGAGCTGG GCCGGGTTTA CGCATGCGCT CCGATTCGCA TAA
|
Protein sequence | MTDFTRGLDR RTFIFGAAAL GAATLLRPTL AFAEPTSADK FAEADSVRAR INEMQEQLTI AGENYYKALD EHEAAVQAVA DAQARIDEAN AQIAELQDKL SKRARSMYRS GQTTALDVIL GATTFEEFAT SWDLLNDIND NDAAMVQQTK DLRAEVEAAK IELEKQERIA AEKAEEAARI KAEAEQTIQT LEATLQQLDA EAQALLVQEQ EAARAAEAAA AAAEVKRTYA YSTPSASIPS QGSVVDYALS RIGCPYVWGA AGPNEFDCSG LVTWAYAQVG ISVPHQTESM YYAAAARLPV SEAQPGDVLW ISYGDGYNGH AGIACNAGGT HYVHAPTFGA RVRDTDPLSW AGFTHALRFA
|
| |