Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2226 |
Symbol | |
ID | 8416548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2613352 |
End bp | 2614653 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025211 |
Product | amidohydrolase |
Protein accession | YP_003182576 |
Protein GI | 257791970 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.306812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTCG CTGATATCGA CCTGTTGGAT GAGAACCTTG ATTTCCGCTC CCATTGCTGG GTGGGCGTGC GCGACGGGCG CGTCGCCTAC GTGGGGGATG CGGCTCCGGC GGGCGAGGAA GCGGCGAGGT ACGGAGAGGT GTACGACGGA CGTGGCAAGC TCTTGTGTCC TGCGTTCTAC AACGCCCACG CGCACGCGCC CATGACGCTG TTGCGCGGCT ACGCCGAGAA CCTGCCTTTG CAGGCGTGGC TGAACGACAT GGTGTTTCCG TTCGAGGCGA AGATCACGCC CGAAGACTGC TACTGGGGCA CGCTTCTGTC CTGCGCCGAG ATGGCGCGGT ACGGCTGCGT GAGCTTCTCG GACATGTACT ATCACATGGA GGAGGGCGCC CGCGCAGCGC TCGACGCCGG CATCAAGATG AACCTGTCCG ACTCGCTTCT TGCCTTCAAC GGCGAAGGGT TGGACGACCT GCCGGTGAAG GGGAACCTCG ACCGTCTCAT CCGCGACCTC CAGGGCGCAG GCGATGGCCG CATCGTGGTG GACTGCAACA TCCATGCCGA GTACACGTCG AACCCGCGCG CCGTGGCCGA TTTGGCGGCG TACGCGAAGG AGCACGGGCT TCGGTTGCAG GTGCACGTCT CCGAGACGCG CCTCGAGCAC GAGGAGTGCA AGCAGCGCCA CGACGGTTTG ACGCCGGTGC GCTACTTCGA GAGCCTGGGC GTGCTCGACG TGCCCGTGAC GGCGGCGCAC TGCGTGTGGG TGGACGACGG CGACATCGAC GTGCTGGCGG AGCGCGGGGT GTTCGTGGCG GCGAACCCGG CGTCGAACAT GAAGCTGGGC AGCGGTTTCG CCCCTGTGGC AAAGATGCTC GCGCGCGGCG TGAACGTGTG CCTGGGCACC GACGGCATGG CGTCGAACAA CAACCACGAC ATGATGCAGG ATATGCACCT GCTGGCGCTG ACGGCGAAGG GATCGACGAA CGATCCGGCC GTGGTCACGC CGAAGCAGGC GCTTACGGCC GCTACGCGCG TGGGCGCGCT TTCGCAGGGG CGCGACGACT GCGGGTACGT GGCCGTGGGG GCGAAGGCTG ACTTGTGCGT GCTGGACACG TCGGGGCCGT CGTGGGCGCC GATGACGAAC CCGCTGGTGA ACGTCGTGTA CGCGGGGCAT GGCGCCGACG TGTGCCTGAC GATGTGCGAC GGGGTCGTGG TGTATCGCGA GGGCGAGTGG CCCACGCTGG ACATCGAGCG AGCGAAGGCC GAGGTCGAGG CCCGCACGAA GCGCATCATC GGCGAGCTGT AG
|
Protein sequence | MLFADIDLLD ENLDFRSHCW VGVRDGRVAY VGDAAPAGEE AARYGEVYDG RGKLLCPAFY NAHAHAPMTL LRGYAENLPL QAWLNDMVFP FEAKITPEDC YWGTLLSCAE MARYGCVSFS DMYYHMEEGA RAALDAGIKM NLSDSLLAFN GEGLDDLPVK GNLDRLIRDL QGAGDGRIVV DCNIHAEYTS NPRAVADLAA YAKEHGLRLQ VHVSETRLEH EECKQRHDGL TPVRYFESLG VLDVPVTAAH CVWVDDGDID VLAERGVFVA ANPASNMKLG SGFAPVAKML ARGVNVCLGT DGMASNNNHD MMQDMHLLAL TAKGSTNDPA VVTPKQALTA ATRVGALSQG RDDCGYVAVG AKADLCVLDT SGPSWAPMTN PLVNVVYAGH GADVCLTMCD GVVVYREGEW PTLDIERAKA EVEARTKRII GEL
|
| |