Gene Elen_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0970 
Symbol 
ID8415260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1176578 
End bp1178428 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID645023934 
ProductAmidohydrolase 3 
Protein accessionYP_003181331 
Protein GI257790725 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.31521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000183819 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCACA GCGAGAACCA GAAGACGGGG CGCGGACGAG GACGCGGAGG CGCCCGGGTC 
GCAGGAGCGC CTCTCACGCG CCGCACGTTC GTGGCGGGCG CGACCGTGCT GGCAGCCGGG
GCGCTGCTGG GCGGCCCGCT GGGCTGCAGC GCGCCGGGGC AGGACGGCGC GACGGCGCAG
GCGGCCGGGG ATGCTGCCGA CCTCGTGTTC AAGAACGGGC ACGTGCAGAC GCTCGTGCGC
GAAGGCGATG CGGCCGAGGC GGTGGCCGTG CGCGAAGGAT CCATCGTGTA CGTGGGCGAC
ACCGCCGGCG TCGAGGCGTA CGTGGGCGAC TCCACGAAGG TGGTCGACCT CGAGGGCCGG
TTCCTCTGCC CCGGCTTCAT GGACGGCCAC CTGCACGGGC CCCAGCCGTA CTACGAGCAG
ATGTTCCAGA TCTCCATCCC CGACGGCACC GTCGACAACG ACGAGTACCT CCGCATCATC
CGGGAGTTCG TCGAGGCGCA CCCCGACGAC GAGGCGTACT ACGGCGGCCC CTTCATGCAG
AACGCCTACC TGCAGCCCGA CGGCTCGAAC CCGGGCCCGC AGAAGGAGGA TCTCGACGCC
ATCTGCGCCG ACAAGCCCGT CATGATCCGC GACGTGTCGC ACCACTCCTA CTGGGTGAAC
AGCAAGGCGC TCGAGATCGC CGGCATCACG GCCGACACGC CCGACCCCGA CGGCGGCTCC
ATCGTGCGCA ACGCCGCAGG CGAGCCGAGC GGCCTGCTTA CCGACGCGGC GAAGAACCTC
GTGGCGTCGA AGATCGAGGT GCCCTACTCC ACCGAGAACA TGGCGAAAGC CTACGAGGCG
TTCCAGGAGT ACTGCCATTC GCTGGGCATC ACGGGCCTCA CGAACATCAA CCTGTCGGGC
GTCGAGCTCA TCCACGCCGA GGCGCTTCAC GACATGGACG CGCGGGGCGA CCTGCACCTG
CGCCAGCGCT TCCTCGTGTG GGGGCAGCCG GGCATGGGCT ACGAGGGCAT CAAGGAGAAG
CTCGACGTCG TGGCCGCCTA CGACTCCGAG ATGTTCCAGA CCGGCACGGT GAAGATCGTC
TACGACGGTG TGACCGAGGG CGCGACGGCC GTCATGCTGG AGCCGTACCT GCCGGCCGCC
GGCAAGGGCG AGGGCTGGAC CGCCACGAGC GACTGGTCGG TCGAAGAGCT CGACCAGGTG
GTGGCCGACC TCGACAAACA CGGCTACCAG GCGCACATCC ACGCCATCGG CGACGGCGCG
GTGCGCACCT CGCTCGACGC CTACGAGCGC GCCGAGGCGG CCAACGGCAA GCACGACGCG
CGCCACACGA TGGTGCACGT GTGCGCCATC ACGCCCGAGG ACATCAAGCG CTGCGCCGAT
CTCGAAGTGG TCAGCGACCT GCAGTTCCTC TGGATGTACA ACGATCCGCT GTGTCAACTT
GAAACCGCGT TCGTCGGTAA GGAGCGCGCC TTCGCCATGT ACCCGGCCAA GGACATGCTC
GAGGCCGGCT GCATCCTCAG CGGCGGCAGC GACGGCGCCG TGACCGCCTA CGACCCGCTG
CTCGAGATCG AGGTGGGCAT CACGCGCAAC AGCCCCTTCC CCGGCGAGGA GGACGAGGAC
TTCTACCGCT GGCCCGAGCA GGGGCTGACC GCCTACCAGA TGCTCGAGAT GTACACGAAG
AACGTGGCGT ACGAGAACTT CATGGAGGAC GTCGTGGGCA CCGTGGAAGT GGGCAAGAAG
GCCGACTTCG TCGTGCTCGA CCAGAACATC CTCGACATCG ACCCCAAGCA GATCTCCGAG
ACGAAGGTGG TGTGCACCGT CTCGAACGGC AACATCGTCT TCGAAGGCTA G
 
Protein sequence
MNHSENQKTG RGRGRGGARV AGAPLTRRTF VAGATVLAAG ALLGGPLGCS APGQDGATAQ 
AAGDAADLVF KNGHVQTLVR EGDAAEAVAV REGSIVYVGD TAGVEAYVGD STKVVDLEGR
FLCPGFMDGH LHGPQPYYEQ MFQISIPDGT VDNDEYLRII REFVEAHPDD EAYYGGPFMQ
NAYLQPDGSN PGPQKEDLDA ICADKPVMIR DVSHHSYWVN SKALEIAGIT ADTPDPDGGS
IVRNAAGEPS GLLTDAAKNL VASKIEVPYS TENMAKAYEA FQEYCHSLGI TGLTNINLSG
VELIHAEALH DMDARGDLHL RQRFLVWGQP GMGYEGIKEK LDVVAAYDSE MFQTGTVKIV
YDGVTEGATA VMLEPYLPAA GKGEGWTATS DWSVEELDQV VADLDKHGYQ AHIHAIGDGA
VRTSLDAYER AEAANGKHDA RHTMVHVCAI TPEDIKRCAD LEVVSDLQFL WMYNDPLCQL
ETAFVGKERA FAMYPAKDML EAGCILSGGS DGAVTAYDPL LEIEVGITRN SPFPGEEDED
FYRWPEQGLT AYQMLEMYTK NVAYENFMED VVGTVEVGKK ADFVVLDQNI LDIDPKQISE
TKVVCTVSNG NIVFEG