Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2004 |
Symbol | |
ID | 8416315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2350609 |
End bp | 2351613 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645024981 |
Product | peptidase S58 DmpA |
Protein accession | YP_003182357 |
Protein GI | 257791751 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00577301 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000124631 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCAAC CCGCAACGCT CGCCGATTTG CCCGCATTCC TCTGCGCGCA CGCCGAGGAC GCCCGCGCGG GCACCGGCTG CACCGTCTTC ATAGCTCCTG ACGGGGCCAC CTGCGGCGTC GACGTGCGCG GCGGCGGTCC TGCCACGCGA GAGACCGATC TGCTCAAGCC CGAGAACATG ATCCAGGCCG TGCACGGCGT GGTCCTGTCG GGCGGCAGCG CCTTCGGGCT GGCCGCCGCG ACCGGCGTCA TGGACGAGCT GGCCGCACGC GGCATCGGCT TTCCCGTGGA GAGCGCCCGC GTGCCCATCG TCGTGGGAGC CTGCCTGTTC GATCTGCTGG TCGGGCAGAA CGCCCATCCC GATGCCGCCA TGGGGCGCGC TGCTGCGAGG GCGGCGTTCG AGCGCGAGGC GGCGGAACCG CTGGCCGAAG GCAACGTGGG CGCCGGATGC GGCGCATCGG TGGGCAAGCT CCTCGGCGGC GAGCGCGCCA TGAAAGCCGG GCTCGGAATC TGCGGATTGC GCCTGGGCGA GCTCACGGCG TGCGCCGTCG TGGCGGTGAA CGCGCTCGGC AACGTGCGCA GCGCGGACGG TGCCTGGATC GCCGGCTGCC GCGACGGAGA AGGGCGCGTC ATGGATCCCC TCGAGGCGTT CGGCGTCCTC GCGCAGCAGG CGGCCGCGCA TGCGGAGCAG GAAGCCGACC CCGCCGCAGG TCCGTGCGCC AACACCACCA TCGGCGTCGT GCTGACGAAC GCGCGTCTGA CGAAGGCGCA GGCGACGAAG GCTTCTTCGA CCGTCCACGA CGCCTACGCG CGCGCCATCA AGCCCGTGCA CACTTCCGGC GACGGCGACA CCGTGTTCAC GTTCGCATCC GGCGAGGTGG AAGCCGACTA CGATACGTTC GCCATCCTTG CCACCGAGGC CATGCAGGGA GCCGTCGTGC GTGCGGTCGA GCAGGCCGAG GGCGCCTACG GGTTGCCCGC CGCCCGCGAC CTCGTCTCCT GTTAG
|
Protein sequence | MLQPATLADL PAFLCAHAED ARAGTGCTVF IAPDGATCGV DVRGGGPATR ETDLLKPENM IQAVHGVVLS GGSAFGLAAA TGVMDELAAR GIGFPVESAR VPIVVGACLF DLLVGQNAHP DAAMGRAAAR AAFEREAAEP LAEGNVGAGC GASVGKLLGG ERAMKAGLGI CGLRLGELTA CAVVAVNALG NVRSADGAWI AGCRDGEGRV MDPLEAFGVL AQQAAAHAEQ EADPAAGPCA NTTIGVVLTN ARLTKAQATK ASSTVHDAYA RAIKPVHTSG DGDTVFTFAS GEVEADYDTF AILATEAMQG AVVRAVEQAE GAYGLPAARD LVSC
|
| |