Gene Elen_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3064 
Symbol 
ID8417399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3561657 
End bp3563237 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content68% 
IMG OID645026044 
ProductHydantoinase/oxoprolinase 
Protein accessionYP_003183396 
Protein GI257792790 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATC AGAAGCGCAT CTGGCGTCTG GGTATCGACG TGGGCGGCAC GAACACCGAC 
GCGGTGGTCA TCGACGGCGA CCTCAAGCTG GTGGCCGCGA CGAAAAGCCC CACCACCGAG
GACGTCATGA GCGGCATCGT GGCCGCCATG CACGAGGTGA TCACCCAGAT CGGTGCCGAC
GAGGCGCGCA ACATCGGGTT CGCCATGCTG GGCACCACGC ATTGCACGAA CGCCATCGTC
GAGCGCAAGC GCCTGAACAA GGTGGCCGCG CTGCGCGTGG GCGCTCCGGC CACGACGGCC
ATCAGCTGCA TGGCTGACTG GCCCGACGAG CTGAAGAACG CCATGCGCGT GCGCGACTTC
CTCGTGCACG GCGGCAACGA GTTCGACGGT CGCGAGATCA GCGCGCTGTC GGAAGACGAG
ATCCGCGAGG TCGCGCGCGT CGTGCGCGAA GAGGGCTTCG AGTCCGTGGC CGTGACCAGC
GTGTTCTCGC CGGTGTCCGA CGCGCACGAG AAGCGCGCCG CCGCCGTTCT GCGCGAGGAG
CTGGGCGAGG GCTTCCCCAT CACGCTGTCG TCGGAGATCG GGTCGCTCGG CTTCCTCGAG
CGCGAGAACG CGTCCATCCT GAACGCGGCG CTGTACGACG TGGCGCGCAC GACGGCCGAC
AGCTTCGAGG CGGCGCTCGC GTCTGAGGGC CTCGCCGATG TGGCTGTGTA CCTGGGCCAG
AACGACGGCA CGCTCATGAG CGTGGACTAC GCGAAGCGTT ACCCCATCTT CACCATCGCG
TGCGGGCCTA CGAACTCCAT CCGCGGCGCG TCGTTCTTGA CGCAGGAGAA GGACGCCGTG
GTCGTCGACA TCGGCGGCAC CACCACCGAC GTGGGCGTGC TGGCGCACGG CTTCCCGCGC
GAGAGCATGG TGGCCGTGGA AATCGGCGAC GTGCGCACGA ACTTCCGCAT GCCCGACCTG
GTGTCGGTGG GCCTCGGCGG CGGCTCGCTC GTGCGCCAGC TGGAGGACGG CAGCGTGACG
GTGGGCCCCG ACAGCGTGGG CTACCTGGTC ACGAAGAAGG CGCGGTGCTT CGGCGGCGAC
ACGCTGACTG CGACCGATAT CGTGGTGGCG AAGGGCCTGG CCGAGGGCGT GGGCGATCCG
ACGCTGGTGG CCGACCTCGA GCCGGCGCTC GTGGACGCGG CCTATGCCGA GATCACGCGC
ATCATCGAGG ACGCGGTGGA CGCGATGAAG ACTTCGGCCG GCGACGTGAC GGTGATTCTC
GTGGGCGGCG GCTCCATCCT GGCGCCCGAC CAGCTGGAAG GCTCGGACAA CGTGCTGCGC
CCTGAGAACT TCGGCGTGGC GAACGCGGTG GGTTCGGCCA TCGCGCAGGT GTCCGGCCAG
ATCGCCAAGG TGTTCTCGCT GACCGAGACG CCGCGCGAGC AGGCGCTTGC CGAGTCGAAG
CAGCGTGCGT GCGACGAGGC CATCGAAGCC GGCGCCGATC CGAGCACCGT GGAAGTGGTC
GACGTCGAGG ACATCCCGAT GGCCTATCTG GGCGATGCGC TCTGCATTCG CGTCAAAGCC
GTCGGCGATC TGATGCTTTA A
 
Protein sequence
MADQKRIWRL GIDVGGTNTD AVVIDGDLKL VAATKSPTTE DVMSGIVAAM HEVITQIGAD 
EARNIGFAML GTTHCTNAIV ERKRLNKVAA LRVGAPATTA ISCMADWPDE LKNAMRVRDF
LVHGGNEFDG REISALSEDE IREVARVVRE EGFESVAVTS VFSPVSDAHE KRAAAVLREE
LGEGFPITLS SEIGSLGFLE RENASILNAA LYDVARTTAD SFEAALASEG LADVAVYLGQ
NDGTLMSVDY AKRYPIFTIA CGPTNSIRGA SFLTQEKDAV VVDIGGTTTD VGVLAHGFPR
ESMVAVEIGD VRTNFRMPDL VSVGLGGGSL VRQLEDGSVT VGPDSVGYLV TKKARCFGGD
TLTATDIVVA KGLAEGVGDP TLVADLEPAL VDAAYAEITR IIEDAVDAMK TSAGDVTVIL
VGGGSILAPD QLEGSDNVLR PENFGVANAV GSAIAQVSGQ IAKVFSLTET PREQALAESK
QRACDEAIEA GADPSTVEVV DVEDIPMAYL GDALCIRVKA VGDLML