Gene Elen_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2004 
Symbol 
ID8416315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2350609 
End bp2351613 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content72% 
IMG OID645024981 
Productpeptidase S58 DmpA 
Protein accessionYP_003182357 
Protein GI257791751 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00577301 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000124631 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCAAC CCGCAACGCT CGCCGATTTG CCCGCATTCC TCTGCGCGCA CGCCGAGGAC 
GCCCGCGCGG GCACCGGCTG CACCGTCTTC ATAGCTCCTG ACGGGGCCAC CTGCGGCGTC
GACGTGCGCG GCGGCGGTCC TGCCACGCGA GAGACCGATC TGCTCAAGCC CGAGAACATG
ATCCAGGCCG TGCACGGCGT GGTCCTGTCG GGCGGCAGCG CCTTCGGGCT GGCCGCCGCG
ACCGGCGTCA TGGACGAGCT GGCCGCACGC GGCATCGGCT TTCCCGTGGA GAGCGCCCGC
GTGCCCATCG TCGTGGGAGC CTGCCTGTTC GATCTGCTGG TCGGGCAGAA CGCCCATCCC
GATGCCGCCA TGGGGCGCGC TGCTGCGAGG GCGGCGTTCG AGCGCGAGGC GGCGGAACCG
CTGGCCGAAG GCAACGTGGG CGCCGGATGC GGCGCATCGG TGGGCAAGCT CCTCGGCGGC
GAGCGCGCCA TGAAAGCCGG GCTCGGAATC TGCGGATTGC GCCTGGGCGA GCTCACGGCG
TGCGCCGTCG TGGCGGTGAA CGCGCTCGGC AACGTGCGCA GCGCGGACGG TGCCTGGATC
GCCGGCTGCC GCGACGGAGA AGGGCGCGTC ATGGATCCCC TCGAGGCGTT CGGCGTCCTC
GCGCAGCAGG CGGCCGCGCA TGCGGAGCAG GAAGCCGACC CCGCCGCAGG TCCGTGCGCC
AACACCACCA TCGGCGTCGT GCTGACGAAC GCGCGTCTGA CGAAGGCGCA GGCGACGAAG
GCTTCTTCGA CCGTCCACGA CGCCTACGCG CGCGCCATCA AGCCCGTGCA CACTTCCGGC
GACGGCGACA CCGTGTTCAC GTTCGCATCC GGCGAGGTGG AAGCCGACTA CGATACGTTC
GCCATCCTTG CCACCGAGGC CATGCAGGGA GCCGTCGTGC GTGCGGTCGA GCAGGCCGAG
GGCGCCTACG GGTTGCCCGC CGCCCGCGAC CTCGTCTCCT GTTAG
 
Protein sequence
MLQPATLADL PAFLCAHAED ARAGTGCTVF IAPDGATCGV DVRGGGPATR ETDLLKPENM 
IQAVHGVVLS GGSAFGLAAA TGVMDELAAR GIGFPVESAR VPIVVGACLF DLLVGQNAHP
DAAMGRAAAR AAFEREAAEP LAEGNVGAGC GASVGKLLGG ERAMKAGLGI CGLRLGELTA
CAVVAVNALG NVRSADGAWI AGCRDGEGRV MDPLEAFGVL AQQAAAHAEQ EADPAAGPCA
NTTIGVVLTN ARLTKAQATK ASSTVHDAYA RAIKPVHTSG DGDTVFTFAS GEVEADYDTF
AILATEAMQG AVVRAVEQAE GAYGLPAARD LVSC