Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1060 |
Symbol | |
ID | 8415350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1282936 |
End bp | 1284252 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024023 |
Product | amidohydrolase |
Protein accession | YP_003181420 |
Protein GI | 257790814 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000132044 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCTCCT ACGTTTTCAC CCATGCGACC GTGCTCGACG GCACCGAGGG CATGGAGCCG CAGCCCAACA TGACCGTCGT GGTGAACGAG GGCGTCATCG AGAAGGTGGG CCCCGCCGCC TCTACCGTGG GGCCGTTGGG CGCGCGCGAG ATCGATCTGG CGGGAGCGTA TCTGGCGCCG GGCCTGGTGA ACCTGCACGT GCACCTGTGC GGCTCGGGCA AGCCCACGAG CGCCGGCGCC GCAGGCGACC TCATCGACAA GGTGGTGGGT AACCCGCTGG GCAGGTGGTA CCTGCGCCGC ACGATCAGGG CGCACGCGCA GCAGCAGCTG GCCAGCGGCG TGACCACGGT GCGCTCGGTG GGCGATCCTG GGTTCGCCGA CGTGGACGTG CGCGATGCCA TCAACGCGGG GAAGCATCCG GGTCCGCGGC TGGTCACGTC CGGTGTGGGG GTCACGGTGC CCGGCGGCCA CGGGGCGGGT TTGTTCGCGC ATGTCGCGTC CACGCCGGAA GAGGCGCGCG CCATCGTGCG CGACTGCTTC TCGCACAAGT GCGACCTGGT GAAGCTGTTC GTCACGGGAG GCGTGTTCGA CGCCGAGGTG GAGGGCGAGC CGGGCGTGCT GCGCATGTCG CCCGAGGTCG CGCAGGCGGC TTGCGACGAG GCACGCAAGC TGGGCCTGCG CACCGCCGCG CACATCGAAA GCGCCGAGGG CGTGCGCGTG GGCCTCGAGG CCGGCGTGGA CACCATCGAG CACGGCGCCC CGCTGGACGA CGAGCTGATC GCGCTGTTCA AGCGCAACGG AGCCGGGCGC GCCTCGTCGC TGACCTGCAC CGTCTCGCCC GCGCTTCCGT TCGTGGAGCT CGATCCCGCC AAAACGCATT CCACCGAGGT GCAGAAGGTG AACGGCCGCA TCGTGTTCGA GGGCATCGTG CAGGCGGCGA AGCAGGCGCT GGCGGCGGGG ATCCCCGTAG GTTTGGGAAC CGATTCGTCG TGCCCCTACA TCACCCAGTA CGACATGTGG CGCGAGGTGG TGTACTTCGA GCGCATCGTG GGCGCGTCGC GTCAGATGGC GCTGCATACG GCCACGCTGG GCAACGCGCG CATCCTGGGG CTGGGCGACG AGACGGGCTC CGTCGAGGCG GGCAAGGCGG CCGACCTCAT CGTGCTCGAC CGCAACCCCC TGGAGAACCT GGAGGCGCTT CGCGACGTGC GCATGGTCAT GGCTCGCGGC GTGCTGGACG AGCATCCTCG CGTGAAGCGC CTTGCCGAAC TGGACGCCGA GCTCGACGGC TTCCTGCCGG GTAACCAAAA GCATTGA
|
Protein sequence | MTSYVFTHAT VLDGTEGMEP QPNMTVVVNE GVIEKVGPAA STVGPLGARE IDLAGAYLAP GLVNLHVHLC GSGKPTSAGA AGDLIDKVVG NPLGRWYLRR TIRAHAQQQL ASGVTTVRSV GDPGFADVDV RDAINAGKHP GPRLVTSGVG VTVPGGHGAG LFAHVASTPE EARAIVRDCF SHKCDLVKLF VTGGVFDAEV EGEPGVLRMS PEVAQAACDE ARKLGLRTAA HIESAEGVRV GLEAGVDTIE HGAPLDDELI ALFKRNGAGR ASSLTCTVSP ALPFVELDPA KTHSTEVQKV NGRIVFEGIV QAAKQALAAG IPVGLGTDSS CPYITQYDMW REVVYFERIV GASRQMALHT ATLGNARILG LGDETGSVEA GKAADLIVLD RNPLENLEAL RDVRMVMARG VLDEHPRVKR LAELDAELDG FLPGNQKH
|
| |