Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2232 |
Symbol | |
ID | 8416555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2621109 |
End bp | 2622365 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645025218 |
Product | peptidase U32 |
Protein accession | YP_003182582 |
Protein GI | 257791976 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.138304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.83326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACTC AATCGACAAG GACTCCCGAG CTCCTCGCGC CCGCGGGCGG GCTTGCGCAG CTGGAGGCCG CGCTGCGCTT CGGTGCGGAC GCCGTGTACC TGGCCGCCGA TCGTTTCGGG CTGCGGCAGC GCGCGGCGAA CTTCGCGCTG TACGACGTTC CCGCTGCGGC AGCTCGCGCG CACGATGCGG GCGCGAAGGC GTACGCGACG CTGAACGCCC TCATGGACGC CGACGACCTC AAGGCGCTTC CCGCGTACCT CGAAGCGCTG GCCGCGGCCG GCGTCGACGC GTTCATCGTG AGCGACCTGG GCGCGCTGCG CCTGGCACAG CGGCACGCGC CGAACGTCGA GCTGCACGTG AGCACCCAGG CCTCGGTATG CAACGCCGAG GCGGCGCGCG TATGGCACGA GCTGGGCGCG AGCCGCGTGG TGTGCGCGCG AGAGATGAGC GTGGAGGACA TCGCGCGACT GCGCGCCGGC GCCCCGCGCG AGCTGGAGCT GGAGGCGTTC GTGCACGGCG CCATGTGCAT GGCCGTGTCG GGCCGCTGCC TGATCAGCGC CGCGCTCACC GGCCGCTCCG GCAACAAGGG CCATTGCACC CAGCCGTGCC GGTGGAGCTA CGCGCTGGTG GAGGAGCAGC GTCCCGGCGA GTTCTTTCCC GTGGAGGAGG ACGTGCGCGG AACCTATGTC ATGAACGCGC AGGACCTCAA CATGCTGGCG CACCTCGACG ACTTGGCCGC GGCCGGCATC GACTCGTTCA AGATCGAAGG CCGCAACAAG AAGGCGTTCT ACGTGGCTTC GGTGGTGCGC GCTTACCGGC TGGCCCTGGA CGGCGTTCCC TCCTCCGAGC TGGCCGACGA GCTGCTGGCC GTGTCGCATC GCCCGTACGG CACGGGCTTC TACTACGGCG ACGCCAGGCA ATCGCCCGAC GTGGACGGCT ACACCGCCGA ATGCCGGCAT GCCGCCACGG TGGAAGCGTG CGAACCGGCC GGCGAAGGCG CGTTCCGCGT GATCGCGCGG TGCTACAACC GCTTCTGCGA AGGCGACGAG CTGGAGGCGC TGTCGCCGGG TCCGCACGTC CCTCGCGTGC GCGTGCGTAA CCTCGCCTGG CTCCCCGAGC CCGACGGGGA CGACGCGCAG CCAAAGCGGG TGCCGGTTGC CGTGGCGAAC CGCTCGGCCG AGCGCTATGC GTTCGAAACG GGGGAGGAGC TGGCTCCCGG CGACTTTCTG CGCATGCGTA TCAACGTTGA GCGATAG
|
Protein sequence | MRTQSTRTPE LLAPAGGLAQ LEAALRFGAD AVYLAADRFG LRQRAANFAL YDVPAAAARA HDAGAKAYAT LNALMDADDL KALPAYLEAL AAAGVDAFIV SDLGALRLAQ RHAPNVELHV STQASVCNAE AARVWHELGA SRVVCAREMS VEDIARLRAG APRELELEAF VHGAMCMAVS GRCLISAALT GRSGNKGHCT QPCRWSYALV EEQRPGEFFP VEEDVRGTYV MNAQDLNMLA HLDDLAAAGI DSFKIEGRNK KAFYVASVVR AYRLALDGVP SSELADELLA VSHRPYGTGF YYGDARQSPD VDGYTAECRH AATVEACEPA GEGAFRVIAR CYNRFCEGDE LEALSPGPHV PRVRVRNLAW LPEPDGDDAQ PKRVPVAVAN RSAERYAFET GEELAPGDFL RMRINVER
|
| |