Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0652 |
Symbol | |
ID | 8414942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 829495 |
End bp | 831045 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645023627 |
Product | Hydantoinase/oxoprolinase |
Protein accession | YP_003181024 |
Protein GI | 257790418 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0481682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAAT TGGGCATCGA CGTGGGCGGC ACCAACACGG ACGCCGTTTT GATCGACGAG GACCTGAACG TGGTGGCGGC GGTGAAGAAC CCCACGTCGG ACGATATCTA CACGGGCATC ATGGGCGCGG TGGACGCCGT GCTGGCCGAC GGCGGCGTGG ACCGCGCGCA GATCGCGCAG GCCATGCTGG GCACCACCCA GTGCACGAAC GCCATCGTGG AGCGCAAGGG CCTTGCGCCC ATCGCCATCC TGCGCATCGG CGCGCCGGCC ACGGTTGGCA TCCCGCCGAT GGTGGACTGG GCCGACGACA TCGCTGCGGT GTGCGTGGAC GCGCTCGTCA TCGAGGGCGG CTTCGAGTAC GACGGCAAGC GCCTGGCGGA GTTCGACGAG GCCGCGTGCC GCGCGTTCTT CGAGGGCGTG AAGAGCCGCG TGGAGGCTGT GGCCGTGTCC AGCGTGTTCT CCACGGTGCG CAACGACGAC GAGCTGCGCG CCGCGACCAT CGCGCGCGAG GTGCTGGGCG AGGACGTGCA TGTGTCCATT TCCAGCGAGA TCGGCTCGAT GGGCCTGATC GAGCGCGAGA ACGCCACCAT CTTGAACGCG GCGCTGTACG ACGTGGCGCG CAAGTTCACC GAGGGCTTCG CGGCCAGTCT GGCCGACAAG GGCGTGACGA ACGCCGAAGT GTACCTGTCG CAGAACGACG GCACGCTCAT GACCATGGAA CACGCGCGCC GCTATCCCAT CCTCACCATC GCGTGCGGGC CGACGAACTC CATCCGCGGC GCCAGCTACC TATCGCGCCG CGACGACGCC ATCGTCATCG ACGTGGGCGG CACCACCACC GACCTGGGCG TTCTGTCGCA CGGCTTCCCG CGCGAGAGCG GCGTGGCGGT GACCATCGGC GGCGTGCGCA CGAACTTCCG CATGCCCGAC GTGGTGTCCA TCGGCCTGGG CGGCGGTTCC ATCGTGCGCG TGGCCGATGA CGGCAGCGTC ACCGTGGGGC CCGATTCGGT GGGCTACGCC ATCACCGAGC GCGCGCTGGT GTTCGGCGGC GATACGATGA CGGCCACCGA CATCGCCGTG CGCCTGGGGA TGGCCTCGGT GGGGGACGCC TCGCTCGTGG CCGATATCCC GCAGGATGTG GCCGAACGCG CGATGGCGGC CATCCGCGCG CTGGTGGAGG ACGCCATCGA CGTGATGAAG GTGTCCAGCG ACGACATCGA CGTGGTGCTG GTGGGCGGCG GTGCCATCGT GCTGCCGCAC GAGCTGGCCG GCACGGCCGA GGTGGACGCG CCCGAGCACG CGGGTTGCGC GAACGCCATC GGCTCGGCCA TCTCGAAGGT GAGCGGCGTG TACGAGGCGC TGGTGGACTA CGACGTCACG CCGCGCGACG AGGCGCTGGC GGCGGCGCGT GCGGCGGCGA TCGAGGCGGC CGTTGAGGCC GGCGCCGTGC ACGACACCGT GGAGATCATC GACGCCGAAG ACGTGCCGCT GGCGTACTAC CCGGGCCATA CGAACCGCGT GAAGGTGAAA GCCGCGGGCG ACCTGGCGTA G
|
Protein sequence | MYKLGIDVGG TNTDAVLIDE DLNVVAAVKN PTSDDIYTGI MGAVDAVLAD GGVDRAQIAQ AMLGTTQCTN AIVERKGLAP IAILRIGAPA TVGIPPMVDW ADDIAAVCVD ALVIEGGFEY DGKRLAEFDE AACRAFFEGV KSRVEAVAVS SVFSTVRNDD ELRAATIARE VLGEDVHVSI SSEIGSMGLI ERENATILNA ALYDVARKFT EGFAASLADK GVTNAEVYLS QNDGTLMTME HARRYPILTI ACGPTNSIRG ASYLSRRDDA IVIDVGGTTT DLGVLSHGFP RESGVAVTIG GVRTNFRMPD VVSIGLGGGS IVRVADDGSV TVGPDSVGYA ITERALVFGG DTMTATDIAV RLGMASVGDA SLVADIPQDV AERAMAAIRA LVEDAIDVMK VSSDDIDVVL VGGGAIVLPH ELAGTAEVDA PEHAGCANAI GSAISKVSGV YEALVDYDVT PRDEALAAAR AAAIEAAVEA GAVHDTVEII DAEDVPLAYY PGHTNRVKVK AAGDLA
|
| |