Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2553 |
Symbol | |
ID | 8416877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2984890 |
End bp | 2986047 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025534 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_003182897 |
Protein GI | 257792291 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCAGC ACAGGAAGCA AGTCACCTAT TCTCAGCGTC CGAACCATGC AGCTCGCTCG GCTCATGCCC GGGGCGAGCG CCAGTTCCGT ACGTACGATA CCAGCTATAT CCGCCCGAAG AAAAGCAAGG CTCCTGCTAT AGTCGCCGCC GTTTTGGCCG TTCTTGTCGT CGGAGGTTTG GCGTGGGGCG CGCTCACCCT GTTCAACAGC TGTTCCGCGC AATCGGTCGA GCTTCTGGCC GAGGGTCAGG AGGCCACGAT CACGGTGGCC GAAGGTGCTG GTGCCAAGGT CGTCGGAGAG CAGCTTGCGG AAGCCCGTCT GGTTTCCAAT GCGGGAGACT TCACGAAGCG CGTCAACGAG ATGGGCGTTG ATTCCCAGCT CAAGCCCGGT ACCTACACAT TCGCGGGCGG TATGTCGCTC GACGCCATCA TCAACCAGCT GACGGCCGGT CCGGTGGCGA ACGCGCTCAC CATCCCCGAA GGAAGCACGC TCGAGGCCGT TGCCCAGAGC GTGGCAACCT TCACCGAGAA TCGCATCACG GCGGACGCGT TCACGGCCGC TGCGTCGGAT GCCAGCTCAT ACGCGGCCGA CTACGACTTC CTGGCCGACG CGGGCACGAA CAGCCTGGAA GGCTTCCTGT TCCCGAAAAC GTACGAGATC GGCGACGATG CCACGGCCGA GTCGGTAGTG CGCATGATGC TCGACCAGTT CAAGACCGAG ACGTCGGGGC TCGATTGGTC CTACCCGCAA AGCCAGGGCC TCACCATCTA CGATGCCGTG AAGCTGGCTT CCATCGTTGA GCGCGAGTCG TCGGGCGACG AGCAGATCCG CGCCCAGGTG GCCTCGGTGT TCTACAACCG CCTGAACAAC TTCGGCGATC CGAACTACGG CTTCCTGCAA AGCGATGCGA CCACGGCTTA CGAGCTGGGT CACGACCCCA CCCCCGAGGA TATCAAGAAT CCAACACCGT TCAACACCTA CACGAACACG GGTCTGCCTC CCACGCCCAT CTGCTCGCCG GGTCTCGATT GCCTGCAAGC CGTGTGCAAC CCTGCGCAGA CGAACTACTT CTTCTTCTAC TTCGCGCCTG ATGAAAGCGG TACGATGCAG TACTACTTCA GCGAAACGTA CGAAGAGCAT CAGCAGACGT TCTCCTAG
|
Protein sequence | MPQHRKQVTY SQRPNHAARS AHARGERQFR TYDTSYIRPK KSKAPAIVAA VLAVLVVGGL AWGALTLFNS CSAQSVELLA EGQEATITVA EGAGAKVVGE QLAEARLVSN AGDFTKRVNE MGVDSQLKPG TYTFAGGMSL DAIINQLTAG PVANALTIPE GSTLEAVAQS VATFTENRIT ADAFTAAASD ASSYAADYDF LADAGTNSLE GFLFPKTYEI GDDATAESVV RMMLDQFKTE TSGLDWSYPQ SQGLTIYDAV KLASIVERES SGDEQIRAQV ASVFYNRLNN FGDPNYGFLQ SDATTAYELG HDPTPEDIKN PTPFNTYTNT GLPPTPICSP GLDCLQAVCN PAQTNYFFFY FAPDESGTMQ YYFSETYEEH QQTFS
|
| |