Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2189 |
Symbol | |
ID | 8416511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2569195 |
End bp | 2571966 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025175 |
Product | metallophosphoesterase |
Protein accession | YP_003182540 |
Protein GI | 257791934 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCC AACAGCTCGC GTCAGCCTAC GACCGCGGCA CATTCGGCGT GTCGCGACGC GCGTTCCTCA AGTTGGCCGC CGTCGCCGGC GTGTCAGCCG CCGCAGGGAT GCTCGTCTTT CCGGAGCAGC AGGCCGCAGC CGCGTTCACG CAGCCCGCAG CCCCGAACTG GCCGCCTTTG TCCGGCGACA AGATCCGCTT CACCGTGCAC AGCGACACCC ATGTCGGCGC AAGCCCGAAC AACAACTACC GCGACAAGAT CCCTGAAGCG TTCTCGGCCA TCTACGCCAT GGCACCCGAC GTGGACGCGC ATTTCTTCGT GGGCGACTCG GCCGACACTG GCCACCCGGA CCAGTACGTC GAGCTCGCGC AACTGCTGAA CGCTAACGCG CGCAAGCCGG TCGGCATCGT CATGGGCAAC CACGAGTACT ACAACTGGAG CGGCAGCAAG CAGAACGCCC AGGACGCGTT CAAGACGTTC CTCGCCTCGG AGCTGAACGT GCCCGGTTCG TTCCAAATGC CAGGCGGTGC GAACGAAGGC CAGGTGGATG CCGACTTCGC CGTAGGAGGC TACCACGTGC TGGCGGTCGC GCCCGAACCG GGCGGCTACG ACAACAGCTG GTACGGAGCG AAGCGCGATT GGATCCTCGA ACGCTGCGCG GCCGCGGCGG CGGAGGATCC GGCCAAGCCC ATATTCCTGC TCACGCATCA CCCTTTCCCG AACACCGTAT GGTACTCGGG TTCGAACAGC TGGAACGGCC AGTTCGACGA GAATGCCAAC ACGGCGCAGT CCGGCGCGTT CTACCAGGAG CTGTGCGAGA AGTATCCGCA GATCATCCAC TTCTCCGGCC ACACCCACAT CCCTATGGCG GACCCGCGCT CCATTTATCA GGACGACGGG TTCACGCTCA TCCAAACCGC CACGTTCGCC AATAACTTCT GGATGGAAAA CGACGGGCAC GACGAAACCG GCAGCGCAGG CGGCCACCCG AACGCCGGCT GGGACGCGAA TCAATGCGAA CTCGTGGAAA TCGACCCGGC GACCAACTCC GTGTCCGTAT ACCGACTCGA CTTCCGCAGC GGATGCGCCC TCGGCGCACC GTGGGTCATC GAGCCGTCGA AGGGAACGAC CGCCTTTCGC TACACGCACG CAGGCATGGC CGCGCGCAGC AAGCCGCCCC TCGTGCTGGA CGGCGCCGAA GTAGCGGTGA TCGAGGAGTC CGTCACGGCG AACGGTGCCT CGTTCAGCGT AAAGGCCGCC CGGGTGGCGC CCGACGCAAG CGGGCTCGAA GACGACATCG TCATTTCGTA TCGCGCGGTA GTCGCCTTGG CCGAAGCGCC TGACGCGGCG GTGTACGATG CGCGCTTCAT GTCCGACTAC TACAAGGCCG AGGCGAACCG CGCCGAGGTG TTCGAGCGTC CGCTGTTCGG CGCGGGGCTT GCCGAGGACA CGTCGTACGT GCTGCGCGTG TTCGCCGCGA ACCCGTTCGG GAAGGAGACG CTCGTCGGGG AGGCGGCGTT CCGCACGGCG GCCCGGGTCA CGCCGGCTCT GGGCAACCCG CTGCTGTCCG TGGACTTCTC CACAGGAAGC CACGCAGATG CCGCGTTGGC GCCGCACAAC GCCGTGCCCA CCGGCACGCT GACCTACGAG AGCGATTCGT TCGGCGTGCC CGTAGCCGTG TTCGACGGCT CATCTGCCGT AGGATACGAT TTCACCGCCG ACGATTACGC GGCCATCGCG CAAGCCGAAA CGATCGAAGT GCTGTTCCAA TTCACGGCGA ACCCGACGAG CGGCTACTTC GACTTGTTCT CCAGCGCGCA AGGCGCCGGC CAGGACCTCA GCTACTATGC GCCCCAGCTG CAGCACTACG TCAACACGGG ATCGGGCTAC CGCTGCACCG AAGCCACTGT GCCGCTGAAC GCCTGGACGC ACGTGCTAGC CACCTACGAT GGCGTCATCA TGAGCTACTA CCTGAACGGG GAGCTCGTAT CGACGCTGGA CAATCCGGGA TCGATCCCTG CGCCCACCGC TTCGGCGACC CGTTGGTTCG TAGGAGCCGA CGTCAATTCG AACGGCGAGA TGGAGAAGCC CATGACGGGC AAGGTAGCGT TCGCGAAGCT GACGCCCGGC GTGGCGACGG CAGCGCAGGC AGCCGAACTC TACGTGGCGG CTGCGCCCGT TGCGGCGGCG GTCGCACCCC CGAGCGCCGA CGCCATCGGG ACGGCGACCG CAGGCGAGGT CTACGCCATC CCCCCGCTGC CGTTCGCCGA CGCGAACGGA CGGGAGCTGC AGGGCATCCC GTCGGTCGTC GGTCCCAACG GGGCGGCGGT GGACGTCGTC AGGACGGAGG CCGAGGGGGT CGCCGCCTAC CGCTTCACGC CCGCCGTCGA AGGCGCGCAC ACGGTGACGT ACGCCGCCGG CTACGCGCAG CGGCCCACGT TCGAGCTGGC CGTGGCCGCC GCCGTCGCGA AGCCCGATCC CGACCCGGAA CCCGATCCCT CGCCCGACCC GGAGCCGACG CCCGACCCGG AGCCGGAACC CGATCCCGAA CCCGAGCCGA CGCCCGATCC TGCGCCGGAG CCCTCCCCCG CTCCCGACGC CTTCGCGCCC GAAGTCGTCA CGTCTACGCC AACGCCGCAA GCGAAGAAGC TGGCCCGGAC GGGCGACCCC GTCGCCATCG GTGGCATCGT CGTCGGCGTC GCCGCCACCG CCGGAGCCGC CATCTGCGTG GCCCGCAGCC TCATGAAAGG AGAGTTGAGC GAGGAGGAGT AG
|
Protein sequence | MDPQQLASAY DRGTFGVSRR AFLKLAAVAG VSAAAGMLVF PEQQAAAAFT QPAAPNWPPL SGDKIRFTVH SDTHVGASPN NNYRDKIPEA FSAIYAMAPD VDAHFFVGDS ADTGHPDQYV ELAQLLNANA RKPVGIVMGN HEYYNWSGSK QNAQDAFKTF LASELNVPGS FQMPGGANEG QVDADFAVGG YHVLAVAPEP GGYDNSWYGA KRDWILERCA AAAAEDPAKP IFLLTHHPFP NTVWYSGSNS WNGQFDENAN TAQSGAFYQE LCEKYPQIIH FSGHTHIPMA DPRSIYQDDG FTLIQTATFA NNFWMENDGH DETGSAGGHP NAGWDANQCE LVEIDPATNS VSVYRLDFRS GCALGAPWVI EPSKGTTAFR YTHAGMAARS KPPLVLDGAE VAVIEESVTA NGASFSVKAA RVAPDASGLE DDIVISYRAV VALAEAPDAA VYDARFMSDY YKAEANRAEV FERPLFGAGL AEDTSYVLRV FAANPFGKET LVGEAAFRTA ARVTPALGNP LLSVDFSTGS HADAALAPHN AVPTGTLTYE SDSFGVPVAV FDGSSAVGYD FTADDYAAIA QAETIEVLFQ FTANPTSGYF DLFSSAQGAG QDLSYYAPQL QHYVNTGSGY RCTEATVPLN AWTHVLATYD GVIMSYYLNG ELVSTLDNPG SIPAPTASAT RWFVGADVNS NGEMEKPMTG KVAFAKLTPG VATAAQAAEL YVAAAPVAAA VAPPSADAIG TATAGEVYAI PPLPFADANG RELQGIPSVV GPNGAAVDVV RTEAEGVAAY RFTPAVEGAH TVTYAAGYAQ RPTFELAVAA AVAKPDPDPE PDPSPDPEPT PDPEPEPDPE PEPTPDPAPE PSPAPDAFAP EVVTSTPTPQ AKKLARTGDP VAIGGIVVGV AATAGAAICV ARSLMKGELS EEE
|
| |