Gene Elen_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0639 
Symbol 
ID8414929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp815580 
End bp816866 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID645023616 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_003181013 
Protein GI257790407 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.621487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.635347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAT GTGCTCAGAT GCCTTTCACT CACAGCGCTT ACAGGAACAT GCTTCAGATC 
CTGAAAGAAC GGGGGTACGG GTTCTGCGGG TACGGCGATT GGGGAGGTGT TGGGAAGCCT
GTCATTTTGC GGCACGACAT CGACTTCGAT CCGATCGCAG CGCTCGCGAT GGCCGAGCTC
GAATCGAGCG AGGGCGTCCG ATCGACGTAT TTCGTGCTCT TGCGAACGGA TTTCTACAAC
CCTCTAGAGC GCGGAAACGT CGAGAGGCTT CGAGAGATCG CGAGGCTCGG CCACGACATC
GGGCTCCATT ACGACGAAAC GCAGTACGAG GACGGCGACG ACGCGATCGC CGCGATCCAA
CGCGAGGCGG ACACGCTGGG GGGCGCCCTC GGCCTGCCCA TCGAATGCGT TTCCATGCAC
CGTCCGAGCA AGGCGTCGCT CGAAGCGCAG TGGAGCATCC CCGGCATCGT CAACAGCTAT
TCGAGCGAGT TCTTCCAGGG CTTCGAATAC GCTTCGGACA GCCGGAGGCG GTGGCGCAAG
CCCATTTTGG ACATGATCGA GTCCGGGAAG TATCCGCGCC TGCATATCTT GACCCATCCG
TTCTGGTACG GCGGGACGGA GGCCTCGCTC GAGGAATCTC TACGGCGGTT CATAGAAAGG
GCGGGCGCCG ATCGCCTGGG CAGCCTCGAT CGCAACTTTA CCGGGCTCGA CTCCGTGCTC
GGCCCTGCGG ACGTCCTTTC CGCCCGCCTC GCTTCCCTGC GCAATGAGCG GTTTGGGACT
GAAAGGCTCG TCTTGCGTCC CTTGCGGCTG GAGGATGCTG CCGACATGTT CGAATACACG
TCGGACCCCG AGATAAGCAG ATTCCTGAAT TGGGCACCCC ATGGCGAACC CGGGGAGGCG
CGGGATTGGA TAGCCTCCAA GCTCGCCCGA CCGGAGCCGG ACGACCTGCT GCTCGGCATA
GAGCTCCGCG AGCCTCGCAA GCTCATCGGC ACCGTGCGCG CCTACCGCTT CGATGCCGCC
GCCTGCTCCT GCGAGGTGTC TTACGCGCTC AACTCCGCCT TCCAGGGCTG CGGCTACATG
GGAGAAGCTC TGGGAAAGCT CGCCGACATC TGCTTCGACG AGGTGCGCGT GGGCAGGATT
GTCGCCCGCA TCGACGAGGA GAACGCCGCC TCGGCGCACG TTGCCCGCCG CCTGGGCATG
AAGCGCGTCC GTGACGGGGA CTTCGTGGTT CCGATCAAGG GCGAGGAGCG GATCCAGCAC
ACCTACGTTC TCGGAAGGAG GCCGTGA
 
Protein sequence
MPSCAQMPFT HSAYRNMLQI LKERGYGFCG YGDWGGVGKP VILRHDIDFD PIAALAMAEL 
ESSEGVRSTY FVLLRTDFYN PLERGNVERL REIARLGHDI GLHYDETQYE DGDDAIAAIQ
READTLGGAL GLPIECVSMH RPSKASLEAQ WSIPGIVNSY SSEFFQGFEY ASDSRRRWRK
PILDMIESGK YPRLHILTHP FWYGGTEASL EESLRRFIER AGADRLGSLD RNFTGLDSVL
GPADVLSARL ASLRNERFGT ERLVLRPLRL EDAADMFEYT SDPEISRFLN WAPHGEPGEA
RDWIASKLAR PEPDDLLLGI ELREPRKLIG TVRAYRFDAA ACSCEVSYAL NSAFQGCGYM
GEALGKLADI CFDEVRVGRI VARIDEENAA SAHVARRLGM KRVRDGDFVV PIKGEERIQH
TYVLGRRP