Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0970 |
Symbol | |
ID | 8415260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1176578 |
End bp | 1178428 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645023934 |
Product | Amidohydrolase 3 |
Protein accession | YP_003181331 |
Protein GI | 257790725 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.31521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000183819 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCACA GCGAGAACCA GAAGACGGGG CGCGGACGAG GACGCGGAGG CGCCCGGGTC GCAGGAGCGC CTCTCACGCG CCGCACGTTC GTGGCGGGCG CGACCGTGCT GGCAGCCGGG GCGCTGCTGG GCGGCCCGCT GGGCTGCAGC GCGCCGGGGC AGGACGGCGC GACGGCGCAG GCGGCCGGGG ATGCTGCCGA CCTCGTGTTC AAGAACGGGC ACGTGCAGAC GCTCGTGCGC GAAGGCGATG CGGCCGAGGC GGTGGCCGTG CGCGAAGGAT CCATCGTGTA CGTGGGCGAC ACCGCCGGCG TCGAGGCGTA CGTGGGCGAC TCCACGAAGG TGGTCGACCT CGAGGGCCGG TTCCTCTGCC CCGGCTTCAT GGACGGCCAC CTGCACGGGC CCCAGCCGTA CTACGAGCAG ATGTTCCAGA TCTCCATCCC CGACGGCACC GTCGACAACG ACGAGTACCT CCGCATCATC CGGGAGTTCG TCGAGGCGCA CCCCGACGAC GAGGCGTACT ACGGCGGCCC CTTCATGCAG AACGCCTACC TGCAGCCCGA CGGCTCGAAC CCGGGCCCGC AGAAGGAGGA TCTCGACGCC ATCTGCGCCG ACAAGCCCGT CATGATCCGC GACGTGTCGC ACCACTCCTA CTGGGTGAAC AGCAAGGCGC TCGAGATCGC CGGCATCACG GCCGACACGC CCGACCCCGA CGGCGGCTCC ATCGTGCGCA ACGCCGCAGG CGAGCCGAGC GGCCTGCTTA CCGACGCGGC GAAGAACCTC GTGGCGTCGA AGATCGAGGT GCCCTACTCC ACCGAGAACA TGGCGAAAGC CTACGAGGCG TTCCAGGAGT ACTGCCATTC GCTGGGCATC ACGGGCCTCA CGAACATCAA CCTGTCGGGC GTCGAGCTCA TCCACGCCGA GGCGCTTCAC GACATGGACG CGCGGGGCGA CCTGCACCTG CGCCAGCGCT TCCTCGTGTG GGGGCAGCCG GGCATGGGCT ACGAGGGCAT CAAGGAGAAG CTCGACGTCG TGGCCGCCTA CGACTCCGAG ATGTTCCAGA CCGGCACGGT GAAGATCGTC TACGACGGTG TGACCGAGGG CGCGACGGCC GTCATGCTGG AGCCGTACCT GCCGGCCGCC GGCAAGGGCG AGGGCTGGAC CGCCACGAGC GACTGGTCGG TCGAAGAGCT CGACCAGGTG GTGGCCGACC TCGACAAACA CGGCTACCAG GCGCACATCC ACGCCATCGG CGACGGCGCG GTGCGCACCT CGCTCGACGC CTACGAGCGC GCCGAGGCGG CCAACGGCAA GCACGACGCG CGCCACACGA TGGTGCACGT GTGCGCCATC ACGCCCGAGG ACATCAAGCG CTGCGCCGAT CTCGAAGTGG TCAGCGACCT GCAGTTCCTC TGGATGTACA ACGATCCGCT GTGTCAACTT GAAACCGCGT TCGTCGGTAA GGAGCGCGCC TTCGCCATGT ACCCGGCCAA GGACATGCTC GAGGCCGGCT GCATCCTCAG CGGCGGCAGC GACGGCGCCG TGACCGCCTA CGACCCGCTG CTCGAGATCG AGGTGGGCAT CACGCGCAAC AGCCCCTTCC CCGGCGAGGA GGACGAGGAC TTCTACCGCT GGCCCGAGCA GGGGCTGACC GCCTACCAGA TGCTCGAGAT GTACACGAAG AACGTGGCGT ACGAGAACTT CATGGAGGAC GTCGTGGGCA CCGTGGAAGT GGGCAAGAAG GCCGACTTCG TCGTGCTCGA CCAGAACATC CTCGACATCG ACCCCAAGCA GATCTCCGAG ACGAAGGTGG TGTGCACCGT CTCGAACGGC AACATCGTCT TCGAAGGCTA G
|
Protein sequence | MNHSENQKTG RGRGRGGARV AGAPLTRRTF VAGATVLAAG ALLGGPLGCS APGQDGATAQ AAGDAADLVF KNGHVQTLVR EGDAAEAVAV REGSIVYVGD TAGVEAYVGD STKVVDLEGR FLCPGFMDGH LHGPQPYYEQ MFQISIPDGT VDNDEYLRII REFVEAHPDD EAYYGGPFMQ NAYLQPDGSN PGPQKEDLDA ICADKPVMIR DVSHHSYWVN SKALEIAGIT ADTPDPDGGS IVRNAAGEPS GLLTDAAKNL VASKIEVPYS TENMAKAYEA FQEYCHSLGI TGLTNINLSG VELIHAEALH DMDARGDLHL RQRFLVWGQP GMGYEGIKEK LDVVAAYDSE MFQTGTVKIV YDGVTEGATA VMLEPYLPAA GKGEGWTATS DWSVEELDQV VADLDKHGYQ AHIHAIGDGA VRTSLDAYER AEAANGKHDA RHTMVHVCAI TPEDIKRCAD LEVVSDLQFL WMYNDPLCQL ETAFVGKERA FAMYPAKDML EAGCILSGGS DGAVTAYDPL LEIEVGITRN SPFPGEEDED FYRWPEQGLT AYQMLEMYTK NVAYENFMED VVGTVEVGKK ADFVVLDQNI LDIDPKQISE TKVVCTVSNG NIVFEG
|
| |