Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2834 |
Symbol | |
ID | 8417165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3289819 |
End bp | 3291174 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645025814 |
Product | amidohydrolase |
Protein accession | YP_003183170 |
Protein GI | 257792564 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTGT GCGCACAATA CATTCTTCCC ATCACGTCCG AGCCGTTTCA GAAGGGCGCG GTGCTTGTCC GCGACAACGT CATCCGCGAC ATCGGCACGG CCGAGATGCT CAAGCTGCGC TATCCCGACG AAGAAGTGGT CGATTTCGGC CAGGCGGCTA TCATGCCGGG CCTCGTCGAC CTGCACACGC ACCTCGAGAA CTCCGTGATG CGCGGTATCG TGCACGATGT GCCCTATACC ACGTGGGTCA CGTCCATGTT GGAGAAAAGC GCAAAGATGG ACGTGAGCGA CTGGTACGAC TCCGCTATCC TCGGGGGTCT TGAAGCGCTG TCCAGCGGCA TTACCTGCGT CGCCGATATC ACTGCCACCG GCGCCGCATG CACCGCCACG CAGAAGTTGG GCATGCGCAG CGTCATCTAC CGCGAGGTGG GCGCCATGGA CAAGCGCCGC GTCGATTACG CCATGCGCAT CGCCGAGAAC GACATCATGC ACTGGCGCGA AGAGGTTGAC GGCGACCGCA TCACCATCGG CGTGGCTCCC GCGGCTATGT ATGCCTGCCA TCCGTCCATG TTCTCGAAGG TGTCCGAATT CGCTCGGCGC GAGAACGTGC CCGTCGCCAT GCACGTGGCC GGCAACCGCG AAGAGTACAA CTTCATCAAG TACGGCTCGT CGCCGTTCTC GGTGCACACG ATGGACCAGA AGCGCGGCTT CGTGGAGATT CCGCCGTGGC TGCCCACCGG CACGACGCCC GTGCGCTACG CTTTGAACTG GGGCGCGTTC GAGTCCGACA ACGTGCTGGC CATCCACTGC GTGCACGTGG ACGACAAAGA CGTGCAGAAG CTGAAAGAGT ACGACGTGGC CGTGGCCGTG TGCCCGCGCT GCAACGCGCA GCTGGGCATG GGCGTGGCTC CCATCAACGA GTTCATGCGC GCAGGCCTTC GCCTGGGCAT GGGGACCGAT TCGCCGGCCG CAACCGACTC CACCGACATG CTCACCGAGA TGCGCATCGG CATGCTGGTG CAGCGCGCGG TGAACGTGGG CGAGTTCCTG GATTCGGCCA CCATGCTGGA GATGGCCACC ATCGGCGGCG CCCGCGCGCT CAAGCTGGAC GACAAGATCG GCTCCCTGGA AATAGGCAAG CTGGCCGACA TCATCGCGGT CGACCTGTCC GGCTCGCATC AGACGCCCAC CACCGATCCG GTTTCGGCCG TGGTCAACAC CTGCAGCGGC GCCGACATCC TCATGACCAT GGTGAACGGC ACCGCGCTGT ACGAGAAGAA CAAGTGGAAC GTGGGCGTCG AGGTTGCCAG GAACATCGCC CGCATCATCG AAATCCGCGG TAAGTTGAGG TTGTAA
|
Protein sequence | MLLCAQYILP ITSEPFQKGA VLVRDNVIRD IGTAEMLKLR YPDEEVVDFG QAAIMPGLVD LHTHLENSVM RGIVHDVPYT TWVTSMLEKS AKMDVSDWYD SAILGGLEAL SSGITCVADI TATGAACTAT QKLGMRSVIY REVGAMDKRR VDYAMRIAEN DIMHWREEVD GDRITIGVAP AAMYACHPSM FSKVSEFARR ENVPVAMHVA GNREEYNFIK YGSSPFSVHT MDQKRGFVEI PPWLPTGTTP VRYALNWGAF ESDNVLAIHC VHVDDKDVQK LKEYDVAVAV CPRCNAQLGM GVAPINEFMR AGLRLGMGTD SPAATDSTDM LTEMRIGMLV QRAVNVGEFL DSATMLEMAT IGGARALKLD DKIGSLEIGK LADIIAVDLS GSHQTPTTDP VSAVVNTCSG ADILMTMVNG TALYEKNKWN VGVEVARNIA RIIEIRGKLR L
|
| |