Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0678 |
Symbol | |
ID | 8414968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 858355 |
End bp | 859515 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023652 |
Product | agmatine deiminase |
Protein accession | YP_003181049 |
Protein GI | 257790443 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | [TIGR03380] agmatine deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.421408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGA TTCACGAGAG CGTTTCCACC CCGAAGGCCG ACGGCTACCG CATGCCCGGC GAGTTCGAGC CGCAGACCCG CATCTGGATG GCGTGGCCGC ACCGCACCGA CACGTGGGCC TGGGGCGCGA AGCCGGCTCA GAAGCAGTAC GCCGACGTGG CGCGCGCTAT CGCCGAGTTC GAGCCCGTCA CCATGTGCGT GAACCAGGTG GACTACGCCA ACGCCAAGGC CGTGTTCGAG GACGACGAGA ACGTCACCGT CGTCGAGATG ACCACCGACG ACGCGTGGGT GCGCGACACC GGCGCCACCT GGGTGGTCAA CGACGAGGGC GACAAGCGCG CCGTGCATTG GCACTTCAAC GCCTACGGCG GCTTCGAGAA CGGCCTGTAC TTCCCGTGGG ACAAAGACGA GCAGATCGCC CTCAAGATGG CCGAGATGAG CGGCTGCCGT CGCTATCGCC CCGAAAGCTT CATCCTCGAG GGCGGCTCCA TCCACGTGGA CGGCGAGGGC ACGGTCATCA CCACCGACAT GTGCCTGCTC GATCCCGGCC GCAACGCGTC CGTGACCGAC TACGAGCCCT GGTCCGAGGA GCTGCGCGCG TACTGCGACG AGCAGCTGAA GAAGTACCTG GGCGTGGAGA AGGTCATCTG GGTCAAGGAC GGCATCGACC CCGAGGAGAC GAACGGCCAC ATCGACGATG TCGCCCAGAT CGTCGCTCCC GGCAAGGTGC TGTGCATCTG GTCCGACGAC CCGGACTACC CGTTCTACAA CGAGTGTCAT GCCGCTTACG AGACGCTGTC CAACGCCGTG GACGCCAAGG GCCGCAAGCT CGAGGTGACC AAGCTCTGCA TGCCCGTGAA GCCGCTGTAC ATGGACCAGG CGTCCTGCGA CTCCATCGAC ACCGAGGAGT ACGCCGAGCC GCGCGTGGCC GATGAGCCGC TGATCGCGTC GTACATGAAC TTCCTCATCG TCAACGGCGG CGTCATCGTG CCGCAGTACG GCGACGAGAA CGACGCGCTG GCCGTCCAGC AGATCCAGGC TGCGTTCGAC GAGGCGTGGG GCGAGGGCGC GTACAAGGCC GTGGGCGTGA AGACCGACCA GGTGGTCTTC GGCGGCGGCA ACATCCACTG CATCACCCAG CAGGAGCCGG CCGGCAAGTA G
|
Protein sequence | MKTIHESVST PKADGYRMPG EFEPQTRIWM AWPHRTDTWA WGAKPAQKQY ADVARAIAEF EPVTMCVNQV DYANAKAVFE DDENVTVVEM TTDDAWVRDT GATWVVNDEG DKRAVHWHFN AYGGFENGLY FPWDKDEQIA LKMAEMSGCR RYRPESFILE GGSIHVDGEG TVITTDMCLL DPGRNASVTD YEPWSEELRA YCDEQLKKYL GVEKVIWVKD GIDPEETNGH IDDVAQIVAP GKVLCIWSDD PDYPFYNECH AAYETLSNAV DAKGRKLEVT KLCMPVKPLY MDQASCDSID TEEYAEPRVA DEPLIASYMN FLIVNGGVIV PQYGDENDAL AVQQIQAAFD EAWGEGAYKA VGVKTDQVVF GGGNIHCITQ QEPAGK
|
| |