Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1966 |
Symbol | |
ID | 8416277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2303345 |
End bp | 2304496 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024943 |
Product | peptidase M42 family protein |
Protein accession | YP_003182319 |
Protein GI | 257791713 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACA AGCAGGTGAA ATTCCTCAAG CAACTGCTCG AGACCCCGTC GGCCACCGGC ACCGAGATCG CCGTGGCCGA ACTGGTGCGC GAGCGCCTGG CCGGCACGGC CGACGAGATC CAGACCGACG TCATGGGCTC GGTCCATGCC CGCCTGTCGG GCACGGGCGT GGCCCCGTCG CTCATGCTGT CGGCCCACAT GGACGAGATC GGCCTCATGG TCACGTACAT CTCCGACGAG GGCTTCCTGT CCGTGTCGTC CGTCGGCGGC GTGGACGCGG CCATCCTCCC GGGCATGCGC GTGGACGTGC ACGCGTCGAA CTCCTTCGAG CCGCTGCGCG GCGTGGTGGG CCGCAAGCCC ATCCACCTCA TCGAGCCCGA CGAGCGCAAG AACGTCACGC CCATCGACAA GCTGGTCATC GACCTCGGCA TGCCGGCGAA GCGTGTGAAG AAGCTCGTCA TGGTGGGCGA CGTCATCACC TTCGGCGTGG GCTTCGAGCG CTTCGGCAAG AACATGGCTG TCTCGCGCGC CTTCGACGAC AAGGCGGGCG TGTGGGTCGC TGTGCGCGTG CTGGAGACGC TCGCTAAGGA GGGCCGCGCG CCCGGCGACT TCATCGTGGC CGCCACCGTT CAGGAGGAGA TCGGCACGCG CGGCGCCATT ACGTCCGCGT ACGGCCTGGA CCCCGATGTG GCCATCGCGT TCGACGTGAC GCACGCCACC GACTATCCCG GCATCGACAA GACGAAGTAC GGAAAGATCG TCTGCGGCGA GGGTCCCGTT ATCGCGCGCG GCCCCAACAT CAACCCGGCT GTGTTCGAGC GCCTCGTGGC GGCGGCCGAG GCCGAGGGTC TGCCGTACCA GATCGAGGCC GAGCCCGGCG TCACCGGCAC CGACGCCCGT TCCATCCAGA TCTCCCGCGG CGGCGTCCCC ACCGGCCTCG TGTCGGTGCC CCTGCGCTAC ATGCATACGC CCACCGAGGT GGTCAGCCTC GACGACCTCG ACGCAACGGT GAAGCTGCTC GCCCGTTTCG CGCGCGACCT GGACGAGGAC GCCTGCTTCG TGCCGGGCAT GGGCGACGCG GTGACCGAGG GCGACGGCGC TGGCGCCGAG TCCGCTGCGC AGATGCAGTT CGACGAGACG GGCGTGGAGT AG
|
Protein sequence | MKNKQVKFLK QLLETPSATG TEIAVAELVR ERLAGTADEI QTDVMGSVHA RLSGTGVAPS LMLSAHMDEI GLMVTYISDE GFLSVSSVGG VDAAILPGMR VDVHASNSFE PLRGVVGRKP IHLIEPDERK NVTPIDKLVI DLGMPAKRVK KLVMVGDVIT FGVGFERFGK NMAVSRAFDD KAGVWVAVRV LETLAKEGRA PGDFIVAATV QEEIGTRGAI TSAYGLDPDV AIAFDVTHAT DYPGIDKTKY GKIVCGEGPV IARGPNINPA VFERLVAAAE AEGLPYQIEA EPGVTGTDAR SIQISRGGVP TGLVSVPLRY MHTPTEVVSL DDLDATVKLL ARFARDLDED ACFVPGMGDA VTEGDGAGAE SAAQMQFDET GVE
|
| |