Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0672 |
Symbol | |
ID | 8414962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 849922 |
End bp | 851037 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023646 |
Product | agmatine deiminase |
Protein accession | YP_003181043 |
Protein GI | 257790437 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | [TIGR03380] agmatine deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.915937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000489111 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATACGA TTTACGAGAA CGAATCGACC CCGAAGAAAG ACGGCTACCG CATGCCCGGC GAATTCGAGC CGCAGGAGTG CATCTGGATG CTGTGGCCGC ATCGTCCCGA CAACTGGCGC GACGGCGCGA AGCCCGCGCA GAAGGCGTAC GCCGACGTGG CGCGCGGCAT CGCCCAGTTC GAGCCGGTCA TCGTGGGCGT GAACCCCGAG GACTACGCCG CCGCGCACTA CGTGCTGGCG GGCGAGGAGA ACATCCTGGT TGTGGAGATG ACTAGCGACG ACTCGTGGAT CCGCGACTGC GGCCCCACGT TCGTGGTGAA CGACGACGGC GACGTGCGCG CGGTGCACTG GCACTTCAAC GCATGGGGCG GGCTGGTGGA CGGCCTGTAC TTCCCGTGGG ACCAGGACGC GCTCGTGGGC CTGAAGGTGG CCGACCTCGC CGGCGTGGAC CGCTACCGCC CGGACTCGTT CGTGCTGGAG GGCGGCTCCA TCCACGTGGA CGGCGAAGGC ACCGTGATGA CCACGGAGAT GTGCCTCTTG TCCGAGGGGC GCAACCCCGA GCTCTCGAAG GAGCAAATCG AGAACTACCT GTGCGAGTAC CTGGGCGTCG ACAAGGTGAT CTGGATCAAG GACGGCATAG ACCCCGAGGA GACGAACGGG CACATCGACG ACGTGGCCTG CTTCGTGCGC CCGGGCGAGG TGGCCTGCAT CTGGACCGAC GACGAGGACA ACCCGTTCTA CGAAGCCGCG CACGCCGCCT ACGAGACGCT GTCGAACGCC ACCGACGCCA AGGGGCGGGC GCTCAAGGTG CACAAGCTGA CCATGCCGAA GGAGCCGGTC TACATGACGC AGGAGGAAGT GGACGCCATC GACGTGGTGG AGGGCACCAT CCCGCGCACC ACCGAGGACG TGTGCATCGC CTCGTACATG AACTTCCTCA TCGGCAACGA TTTCGTGCTG GTGCCCCAGT ACGACGACGA ATACGACGAG ATGGCGTTGC AGCAGGTGCA GCAGATGTTC CCCGAACGCG AAGTCGTGGG CGTGCCCACG CGCGAAGTGG TGTACGGCGG CGGCAACATC CACTGCATCA CCCAGCAGCA GCCGGCTGGC GTGTAA
|
Protein sequence | MDTIYENEST PKKDGYRMPG EFEPQECIWM LWPHRPDNWR DGAKPAQKAY ADVARGIAQF EPVIVGVNPE DYAAAHYVLA GEENILVVEM TSDDSWIRDC GPTFVVNDDG DVRAVHWHFN AWGGLVDGLY FPWDQDALVG LKVADLAGVD RYRPDSFVLE GGSIHVDGEG TVMTTEMCLL SEGRNPELSK EQIENYLCEY LGVDKVIWIK DGIDPEETNG HIDDVACFVR PGEVACIWTD DEDNPFYEAA HAAYETLSNA TDAKGRALKV HKLTMPKEPV YMTQEEVDAI DVVEGTIPRT TEDVCIASYM NFLIGNDFVL VPQYDDEYDE MALQQVQQMF PEREVVGVPT REVVYGGGNI HCITQQQPAG V
|
| |