Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0033 |
Symbol | |
ID | 8414312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 42668 |
End bp | 43894 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023008 |
Product | Glutamate N-acetyltransferase |
Protein accession | YP_003180416 |
Protein GI | 257789810 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGGC CGAGGGCGGT GCGGCATCCT GAGATGCCGA GCCTTGCGAC GATCGACGAG GGAGGCGTGA CTTCCGCCCT CGGCTTCACG GCTTCGGGCG TGCATGCGGG CTTCTACGAG GGTAACGACC GCCTCGACTG CGCGCTGGTG TCGGCTGACG TACCCTGCCC CTGCGCGGCG CTGTTCACGC GCAACGCGTT CAGCGCCGCG CCTGTGGACG TGTCGCGGGA CCATCTTCGC CGCGTCTCGT TCGGATTCGT GCGTGCCGTG CTGATCAACT CGGGCAACGC CAACGCGCTG ACCGGCGAGA ACGGCCTCGA GGTGGCGCGG CGCTCGGCAA GCCTCGCGTC GGGAGAGCTG GGTTGCCGGG AAGGCGAGGT GCTGGTGGCT TCCACGGGAA TCATGGGCTC GCGCCCTCCC GTCGAGCCGT TCGAGCGCGG TGTTCCGCTT GCGTGCAGGC GGGCGGCGCG CGACGGCGGC CACGATGCGG CGCGCGCCAT CCTGACCTCC GGCGCGCACC CGAAGGAAGC TGCCGTTTCG TACCGCAGCA CCGATGCTGC GTACCGGGGC TGCACGTTCA CCGTAGGCGG CATGGCGAAA GGCCCCGAGA TGCTGTTGGT GCTGACCACT GACGCGCCGC TTTCTCCTGC GCTGGCATAC CGGGCGCTTG AGAAGTCGGC TTCCGCAAGC TTCAACAAGG TGATTGTCGA TGCCGGCTCG TCCACGAACG ATAGCTGCTT CCTGCTTGCC AGCGGCTATG GCGCGAAGCC GGGAAAGCCC ATTCGCGAGG GCACCCAGGC GTTTCGCGAG TTCTCCGAGG CCTTGAAAGA GGTGAGCGGC CGCCTCGCGC GTTGCATAGC GTCTGACGAG CAATGTGTAT CGTGCCTGAT CACCGTGCAT GTCGTCGGAG CCTTCGACGA GGCCGACGCC GACCGGGTGG CGCGCTCGGT CGCCCATTCG CTGGTGGTTC GGTCCACCGT TGCCGGACGT CATGCGAACT GGTCGCATAT CGTCTCTTCG ATCGGGTACG CCGACGCGCT GTTCATGAGA GAGCGCGTGT CGGTGGATGT CATGGGCGTT CCTGTGCTGA GACGCGGAGC GCTATGCCCC TTCGACGAGC AACGGCTGCT GCGCGAAGCG GGCGATCGGG AGATCGTTAT CCGCGTGGAC CTTGGGGCGG GCGGTGCGCA AACGACGTAC TGGACTGGCG ATCTGCCGCC GGGCTAG
|
Protein sequence | MRRPRAVRHP EMPSLATIDE GGVTSALGFT ASGVHAGFYE GNDRLDCALV SADVPCPCAA LFTRNAFSAA PVDVSRDHLR RVSFGFVRAV LINSGNANAL TGENGLEVAR RSASLASGEL GCREGEVLVA STGIMGSRPP VEPFERGVPL ACRRAARDGG HDAARAILTS GAHPKEAAVS YRSTDAAYRG CTFTVGGMAK GPEMLLVLTT DAPLSPALAY RALEKSASAS FNKVIVDAGS STNDSCFLLA SGYGAKPGKP IREGTQAFRE FSEALKEVSG RLARCIASDE QCVSCLITVH VVGAFDEADA DRVARSVAHS LVVRSTVAGR HANWSHIVSS IGYADALFMR ERVSVDVMGV PVLRRGALCP FDEQRLLREA GDREIVIRVD LGAGGAQTTY WTGDLPPG
|
| |