Gene Elen_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1060 
Symbol 
ID8415350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1282936 
End bp1284252 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID645024023 
Productamidohydrolase 
Protein accessionYP_003181420 
Protein GI257790814 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000132044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTCCT ACGTTTTCAC CCATGCGACC GTGCTCGACG GCACCGAGGG CATGGAGCCG 
CAGCCCAACA TGACCGTCGT GGTGAACGAG GGCGTCATCG AGAAGGTGGG CCCCGCCGCC
TCTACCGTGG GGCCGTTGGG CGCGCGCGAG ATCGATCTGG CGGGAGCGTA TCTGGCGCCG
GGCCTGGTGA ACCTGCACGT GCACCTGTGC GGCTCGGGCA AGCCCACGAG CGCCGGCGCC
GCAGGCGACC TCATCGACAA GGTGGTGGGT AACCCGCTGG GCAGGTGGTA CCTGCGCCGC
ACGATCAGGG CGCACGCGCA GCAGCAGCTG GCCAGCGGCG TGACCACGGT GCGCTCGGTG
GGCGATCCTG GGTTCGCCGA CGTGGACGTG CGCGATGCCA TCAACGCGGG GAAGCATCCG
GGTCCGCGGC TGGTCACGTC CGGTGTGGGG GTCACGGTGC CCGGCGGCCA CGGGGCGGGT
TTGTTCGCGC ATGTCGCGTC CACGCCGGAA GAGGCGCGCG CCATCGTGCG CGACTGCTTC
TCGCACAAGT GCGACCTGGT GAAGCTGTTC GTCACGGGAG GCGTGTTCGA CGCCGAGGTG
GAGGGCGAGC CGGGCGTGCT GCGCATGTCG CCCGAGGTCG CGCAGGCGGC TTGCGACGAG
GCACGCAAGC TGGGCCTGCG CACCGCCGCG CACATCGAAA GCGCCGAGGG CGTGCGCGTG
GGCCTCGAGG CCGGCGTGGA CACCATCGAG CACGGCGCCC CGCTGGACGA CGAGCTGATC
GCGCTGTTCA AGCGCAACGG AGCCGGGCGC GCCTCGTCGC TGACCTGCAC CGTCTCGCCC
GCGCTTCCGT TCGTGGAGCT CGATCCCGCC AAAACGCATT CCACCGAGGT GCAGAAGGTG
AACGGCCGCA TCGTGTTCGA GGGCATCGTG CAGGCGGCGA AGCAGGCGCT GGCGGCGGGG
ATCCCCGTAG GTTTGGGAAC CGATTCGTCG TGCCCCTACA TCACCCAGTA CGACATGTGG
CGCGAGGTGG TGTACTTCGA GCGCATCGTG GGCGCGTCGC GTCAGATGGC GCTGCATACG
GCCACGCTGG GCAACGCGCG CATCCTGGGG CTGGGCGACG AGACGGGCTC CGTCGAGGCG
GGCAAGGCGG CCGACCTCAT CGTGCTCGAC CGCAACCCCC TGGAGAACCT GGAGGCGCTT
CGCGACGTGC GCATGGTCAT GGCTCGCGGC GTGCTGGACG AGCATCCTCG CGTGAAGCGC
CTTGCCGAAC TGGACGCCGA GCTCGACGGC TTCCTGCCGG GTAACCAAAA GCATTGA
 
Protein sequence
MTSYVFTHAT VLDGTEGMEP QPNMTVVVNE GVIEKVGPAA STVGPLGARE IDLAGAYLAP 
GLVNLHVHLC GSGKPTSAGA AGDLIDKVVG NPLGRWYLRR TIRAHAQQQL ASGVTTVRSV
GDPGFADVDV RDAINAGKHP GPRLVTSGVG VTVPGGHGAG LFAHVASTPE EARAIVRDCF
SHKCDLVKLF VTGGVFDAEV EGEPGVLRMS PEVAQAACDE ARKLGLRTAA HIESAEGVRV
GLEAGVDTIE HGAPLDDELI ALFKRNGAGR ASSLTCTVSP ALPFVELDPA KTHSTEVQKV
NGRIVFEGIV QAAKQALAAG IPVGLGTDSS CPYITQYDMW REVVYFERIV GASRQMALHT
ATLGNARILG LGDETGSVEA GKAADLIVLD RNPLENLEAL RDVRMVMARG VLDEHPRVKR
LAELDAELDG FLPGNQKH