Gene Elen_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0652 
Symbol 
ID8414942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp829495 
End bp831045 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID645023627 
ProductHydantoinase/oxoprolinase 
Protein accessionYP_003181024 
Protein GI257790418 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0481682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAT TGGGCATCGA CGTGGGCGGC ACCAACACGG ACGCCGTTTT GATCGACGAG 
GACCTGAACG TGGTGGCGGC GGTGAAGAAC CCCACGTCGG ACGATATCTA CACGGGCATC
ATGGGCGCGG TGGACGCCGT GCTGGCCGAC GGCGGCGTGG ACCGCGCGCA GATCGCGCAG
GCCATGCTGG GCACCACCCA GTGCACGAAC GCCATCGTGG AGCGCAAGGG CCTTGCGCCC
ATCGCCATCC TGCGCATCGG CGCGCCGGCC ACGGTTGGCA TCCCGCCGAT GGTGGACTGG
GCCGACGACA TCGCTGCGGT GTGCGTGGAC GCGCTCGTCA TCGAGGGCGG CTTCGAGTAC
GACGGCAAGC GCCTGGCGGA GTTCGACGAG GCCGCGTGCC GCGCGTTCTT CGAGGGCGTG
AAGAGCCGCG TGGAGGCTGT GGCCGTGTCC AGCGTGTTCT CCACGGTGCG CAACGACGAC
GAGCTGCGCG CCGCGACCAT CGCGCGCGAG GTGCTGGGCG AGGACGTGCA TGTGTCCATT
TCCAGCGAGA TCGGCTCGAT GGGCCTGATC GAGCGCGAGA ACGCCACCAT CTTGAACGCG
GCGCTGTACG ACGTGGCGCG CAAGTTCACC GAGGGCTTCG CGGCCAGTCT GGCCGACAAG
GGCGTGACGA ACGCCGAAGT GTACCTGTCG CAGAACGACG GCACGCTCAT GACCATGGAA
CACGCGCGCC GCTATCCCAT CCTCACCATC GCGTGCGGGC CGACGAACTC CATCCGCGGC
GCCAGCTACC TATCGCGCCG CGACGACGCC ATCGTCATCG ACGTGGGCGG CACCACCACC
GACCTGGGCG TTCTGTCGCA CGGCTTCCCG CGCGAGAGCG GCGTGGCGGT GACCATCGGC
GGCGTGCGCA CGAACTTCCG CATGCCCGAC GTGGTGTCCA TCGGCCTGGG CGGCGGTTCC
ATCGTGCGCG TGGCCGATGA CGGCAGCGTC ACCGTGGGGC CCGATTCGGT GGGCTACGCC
ATCACCGAGC GCGCGCTGGT GTTCGGCGGC GATACGATGA CGGCCACCGA CATCGCCGTG
CGCCTGGGGA TGGCCTCGGT GGGGGACGCC TCGCTCGTGG CCGATATCCC GCAGGATGTG
GCCGAACGCG CGATGGCGGC CATCCGCGCG CTGGTGGAGG ACGCCATCGA CGTGATGAAG
GTGTCCAGCG ACGACATCGA CGTGGTGCTG GTGGGCGGCG GTGCCATCGT GCTGCCGCAC
GAGCTGGCCG GCACGGCCGA GGTGGACGCG CCCGAGCACG CGGGTTGCGC GAACGCCATC
GGCTCGGCCA TCTCGAAGGT GAGCGGCGTG TACGAGGCGC TGGTGGACTA CGACGTCACG
CCGCGCGACG AGGCGCTGGC GGCGGCGCGT GCGGCGGCGA TCGAGGCGGC CGTTGAGGCC
GGCGCCGTGC ACGACACCGT GGAGATCATC GACGCCGAAG ACGTGCCGCT GGCGTACTAC
CCGGGCCATA CGAACCGCGT GAAGGTGAAA GCCGCGGGCG ACCTGGCGTA G
 
Protein sequence
MYKLGIDVGG TNTDAVLIDE DLNVVAAVKN PTSDDIYTGI MGAVDAVLAD GGVDRAQIAQ 
AMLGTTQCTN AIVERKGLAP IAILRIGAPA TVGIPPMVDW ADDIAAVCVD ALVIEGGFEY
DGKRLAEFDE AACRAFFEGV KSRVEAVAVS SVFSTVRNDD ELRAATIARE VLGEDVHVSI
SSEIGSMGLI ERENATILNA ALYDVARKFT EGFAASLADK GVTNAEVYLS QNDGTLMTME
HARRYPILTI ACGPTNSIRG ASYLSRRDDA IVIDVGGTTT DLGVLSHGFP RESGVAVTIG
GVRTNFRMPD VVSIGLGGGS IVRVADDGSV TVGPDSVGYA ITERALVFGG DTMTATDIAV
RLGMASVGDA SLVADIPQDV AERAMAAIRA LVEDAIDVMK VSSDDIDVVL VGGGAIVLPH
ELAGTAEVDA PEHAGCANAI GSAISKVSGV YEALVDYDVT PRDEALAAAR AAAIEAAVEA
GAVHDTVEII DAEDVPLAYY PGHTNRVKVK AAGDLA