Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0412 |
Symbol | |
ID | 8414696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 528646 |
End bp | 529887 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645023387 |
Product | Agmatine deiminase |
Protein accession | YP_003180790 |
Protein GI | 257790184 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.512784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000333823 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTGGAAA ACTATCGGCA TCCAGGGGAA TTCGAACCCC AATCAGACGT TTTCATGGAA TGGATTCCCG ACGCGTATCA GATGAAGGGC TACGACAACA GCCGGTCGTG CGCCGAGATC GTCAAAGCGC TGCAGGAGTT CGGGGGCGTG ACGCTCCATA TCAACTGCGG TGCGGAAGGC GTTCTCGAAC GTGCCAAGTC GAGCTTGGCG GAGAAGGGGG TCGATACGGC CGACATCCGC TTCGTGCAAT TCGCCGATCC GAACTTCTAT GTGCGGGACA ACGGCCCGAC GGTTATGGTG GACGATCGAG GCGGCAGAAT CCTGATCAAC CCGAATTGGA GCTACTACGG CACGCTGCCG CCCGACGACC CGTACTGCGT GCAGTCGCGC ATCGCCGCCG TGCAGATGGG GGTGTCCTTG GGCATCTTCG ACGTGGTGAA TTCCGATGTG GTGTCCGAAG GAGGGGATCG GGAGTTCAAC GGTCAGGGCG TCATGATGTG CATCGAGGAC ACGGAAGTGC GCAAACGTAA TCCGGGTCTT ACGAAAGAGC AGGTAGAAGC CGAATTCAAG AGACTCTACA ACGTGGAGAA GATCATCTGG ATCCCACAGC CTTTGCTAGA AGACGACGAT TTCAGGATGG GGCCGTTGGA ATACCGCGAC GGCGTGCCGT ACCTCGGCTC CAGCTTCGCG GCCCATATCG ACGAGCTGTG CCGCTTCGTG GATGCGAACA CCGTCGTGCT TGCCGAGGTG ACCGACGATG AGGCGGCGGA AAGCGCGATC GGCGCAGAGA ACAAACGACG CATCGAAGCC GCCTACGATG TGCTCTCGAA GGCGACGGAC GTCCATGGCA ACCCGTTCGC CATCAAGCGC ATGCCCGTGC CTATCTCCAT CGATTACGTC TTGACCGAGG ACGACGAGAA CTACGGGCTG TACGAGGGGC CCGTGATGGA GATGGGCGGC GCCTTCGCCG ACGGCACGCC GTGGCCCGGC GGCCCCATCC ATCTCATAGC CTCGACGGGG TACTGCAATT TCCTCATCTG CAACGGCGTG GTCATCGGCC AGCGCTACTG GCATGAGGGG ATGGATCCGG CAATCAAGGG GAAGGACGAA GCCGCCCAAG CGGTTCTCGA GGAGTGCTTC CCGGATCGCA CGGTGGTGAT GGTGGACAGC TTGGCGCTGA ACATGACCGG CGGCGGCGTG CATTGCTGGA CGAAGAACGT TGCGGCGTCC GAGCCGCGAT GA
|
Protein sequence | MLENYRHPGE FEPQSDVFME WIPDAYQMKG YDNSRSCAEI VKALQEFGGV TLHINCGAEG VLERAKSSLA EKGVDTADIR FVQFADPNFY VRDNGPTVMV DDRGGRILIN PNWSYYGTLP PDDPYCVQSR IAAVQMGVSL GIFDVVNSDV VSEGGDREFN GQGVMMCIED TEVRKRNPGL TKEQVEAEFK RLYNVEKIIW IPQPLLEDDD FRMGPLEYRD GVPYLGSSFA AHIDELCRFV DANTVVLAEV TDDEAAESAI GAENKRRIEA AYDVLSKATD VHGNPFAIKR MPVPISIDYV LTEDDENYGL YEGPVMEMGG AFADGTPWPG GPIHLIASTG YCNFLICNGV VIGQRYWHEG MDPAIKGKDE AAQAVLEECF PDRTVVMVDS LALNMTGGGV HCWTKNVAAS EPR
|
| |