Gene Elen_2060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2060 
Symbol 
ID8416376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2425535 
End bp2427178 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content68% 
IMG OID645025041 
ProductAmidohydrolase 3 
Protein accessionYP_003182412 
Protein GI257791806 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000181577 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.134966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG ATCTGATCAT CGAGAGCCGA AACGTGTTCA CAGGAGTCGG CGACGTCGCC 
CGACCTGCGG CCATCGCCGT CGCGGGCGAT CGTATCGTCG CCGTCGGATC GCGCGAGGAC
GTGCGCGCCT TCGCGCTCGA GGCGAACGCA GGCGGGGGCG CGCCCGAGGT GCGCGACTTC
GGGGACGCGC TCGTGGTTCC GGGATTTCAC GACTCGCACC TGCACTTCTT CCATTCGGCC
GTGTATTCCT CGCCCTTGGC GACCATGTTT CTGGGCGAAA GCGAGACCGA CTGCGTGGCG
CGCATGCAAG CGTTCGCGAA AGACCGGCCG AACGGCTGGC TTCTAGCACA AGGATGGCGC
GAATACCGCT GGAACCCGCC AGTGCTGCCG TCGAAGCGCT CGCTGGACGA AGCGTTCCCC
ACGCGTCCCG TGGCGCTGTA CTCGGGAGAC GCGCACACGC TGTGGCTGAA CTCGGCCGCG
CTCGACGAGC TGGGGCTCAC GCGCGACAGC GTGCCGCCGG CAGGCGGCTC CTACGACCGC
GACGAGACAG GCGAGCTGAC CGGCATCGTG CGCGAGGCGG CCGCCATGGA GTTGATGCCG
CAGATCATGG GGTCGTTCAC CGACGAGGAG GTGGCCGACG CTTACCGCGG CTTCTTCGCG
CGGCTGGCCG AGAACGGCGT GACCAGCGTA TGCGACATGT CGCTCATGGC CCATCCGGGG
CTCGACTTCA TCCGGGACGA CGTGCACGCA TCCCTGCTGG AGCGCGGCGA GCTGACCGCG
CGCGTGCACC TCTTCCCCAC CTTGCTCGAC GACATGAGCC GCTTCGAGGA CATGCGTGCG
CGCTACACGG GTCCGTGCCT GCAGGCACCC GGTTTCAAGC AGTTCTTCGA CGGCGTGTCG
AGCCAGCATA CCGCCTGGGT GACCGAACCG TACGCGAACG CGCACGTCGA AGGCGATTGC
GGCCGACCCA CGGTGGATCC CGAGATCATG CGACGCTACG TGCTTGCCGC AGCCGAGCAG
GGCTTCCCCG TGCGCATCCA CGCCATCGGC GATGCGGCCA TCCATGCGGC GCTCGACATC
TTCGAAGAGG CGCGCGCGAA ATTCGGACCG CTGCCGGAGG GGCGGCGAAA CTGCCTGGAG
CACCTGGAGA ACTTCCTGCC CGGGGATATG AAGCGGCTGG CCGATCTGCA GGTGGTGGCC
GCCGTGCAGC CTCCTCACAT GACGCTGGAT CCCGGCGGCC CCGAGCGCGA CCTGGGGCCC
GAACGCGTTC CGTACATGTG GCCCTTCCGC ACGCTGCTGG ACGACAACAC GGTGCTGGCG
TTCGGCACGG ACTCGCCCGT GGTAGGTGTG AACTCGATGG ACGTGCTGTA CAGCGCGGTA
ACGCGGCAGG ACCCGGGCAC CCACGAGCCG ACAGGCGGAT GGCTGCACGA CGAGCGCATC
GGCATGGCTG AAGCCCTGCG CGCCTATACG CAGGGCAGCG CCGCGTCGGC AGGACGTCGC
AGCGAGCTGG GCACGCTGGA AGCGGGCAAG CTGGCCGACA TCGCCGTGCT CGACCGCAAT
CTGCTGGCCT GCGACGCCGA CGACATCCAG AAGACGAAGG TGCTGGCCAC CTTCATGGGC
GGGACGTGCG TGTTCGAGCG GTGA
 
Protein sequence
MRIDLIIESR NVFTGVGDVA RPAAIAVAGD RIVAVGSRED VRAFALEANA GGGAPEVRDF 
GDALVVPGFH DSHLHFFHSA VYSSPLATMF LGESETDCVA RMQAFAKDRP NGWLLAQGWR
EYRWNPPVLP SKRSLDEAFP TRPVALYSGD AHTLWLNSAA LDELGLTRDS VPPAGGSYDR
DETGELTGIV REAAAMELMP QIMGSFTDEE VADAYRGFFA RLAENGVTSV CDMSLMAHPG
LDFIRDDVHA SLLERGELTA RVHLFPTLLD DMSRFEDMRA RYTGPCLQAP GFKQFFDGVS
SQHTAWVTEP YANAHVEGDC GRPTVDPEIM RRYVLAAAEQ GFPVRIHAIG DAAIHAALDI
FEEARAKFGP LPEGRRNCLE HLENFLPGDM KRLADLQVVA AVQPPHMTLD PGGPERDLGP
ERVPYMWPFR TLLDDNTVLA FGTDSPVVGV NSMDVLYSAV TRQDPGTHEP TGGWLHDERI
GMAEALRAYT QGSAASAGRR SELGTLEAGK LADIAVLDRN LLACDADDIQ KTKVLATFMG
GTCVFER