Gene Elen_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0033 
Symbol 
ID8414312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp42668 
End bp43894 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID645023008 
ProductGlutamate N-acetyltransferase 
Protein accessionYP_003180416 
Protein GI257789810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGGC CGAGGGCGGT GCGGCATCCT GAGATGCCGA GCCTTGCGAC GATCGACGAG 
GGAGGCGTGA CTTCCGCCCT CGGCTTCACG GCTTCGGGCG TGCATGCGGG CTTCTACGAG
GGTAACGACC GCCTCGACTG CGCGCTGGTG TCGGCTGACG TACCCTGCCC CTGCGCGGCG
CTGTTCACGC GCAACGCGTT CAGCGCCGCG CCTGTGGACG TGTCGCGGGA CCATCTTCGC
CGCGTCTCGT TCGGATTCGT GCGTGCCGTG CTGATCAACT CGGGCAACGC CAACGCGCTG
ACCGGCGAGA ACGGCCTCGA GGTGGCGCGG CGCTCGGCAA GCCTCGCGTC GGGAGAGCTG
GGTTGCCGGG AAGGCGAGGT GCTGGTGGCT TCCACGGGAA TCATGGGCTC GCGCCCTCCC
GTCGAGCCGT TCGAGCGCGG TGTTCCGCTT GCGTGCAGGC GGGCGGCGCG CGACGGCGGC
CACGATGCGG CGCGCGCCAT CCTGACCTCC GGCGCGCACC CGAAGGAAGC TGCCGTTTCG
TACCGCAGCA CCGATGCTGC GTACCGGGGC TGCACGTTCA CCGTAGGCGG CATGGCGAAA
GGCCCCGAGA TGCTGTTGGT GCTGACCACT GACGCGCCGC TTTCTCCTGC GCTGGCATAC
CGGGCGCTTG AGAAGTCGGC TTCCGCAAGC TTCAACAAGG TGATTGTCGA TGCCGGCTCG
TCCACGAACG ATAGCTGCTT CCTGCTTGCC AGCGGCTATG GCGCGAAGCC GGGAAAGCCC
ATTCGCGAGG GCACCCAGGC GTTTCGCGAG TTCTCCGAGG CCTTGAAAGA GGTGAGCGGC
CGCCTCGCGC GTTGCATAGC GTCTGACGAG CAATGTGTAT CGTGCCTGAT CACCGTGCAT
GTCGTCGGAG CCTTCGACGA GGCCGACGCC GACCGGGTGG CGCGCTCGGT CGCCCATTCG
CTGGTGGTTC GGTCCACCGT TGCCGGACGT CATGCGAACT GGTCGCATAT CGTCTCTTCG
ATCGGGTACG CCGACGCGCT GTTCATGAGA GAGCGCGTGT CGGTGGATGT CATGGGCGTT
CCTGTGCTGA GACGCGGAGC GCTATGCCCC TTCGACGAGC AACGGCTGCT GCGCGAAGCG
GGCGATCGGG AGATCGTTAT CCGCGTGGAC CTTGGGGCGG GCGGTGCGCA AACGACGTAC
TGGACTGGCG ATCTGCCGCC GGGCTAG
 
Protein sequence
MRRPRAVRHP EMPSLATIDE GGVTSALGFT ASGVHAGFYE GNDRLDCALV SADVPCPCAA 
LFTRNAFSAA PVDVSRDHLR RVSFGFVRAV LINSGNANAL TGENGLEVAR RSASLASGEL
GCREGEVLVA STGIMGSRPP VEPFERGVPL ACRRAARDGG HDAARAILTS GAHPKEAAVS
YRSTDAAYRG CTFTVGGMAK GPEMLLVLTT DAPLSPALAY RALEKSASAS FNKVIVDAGS
STNDSCFLLA SGYGAKPGKP IREGTQAFRE FSEALKEVSG RLARCIASDE QCVSCLITVH
VVGAFDEADA DRVARSVAHS LVVRSTVAGR HANWSHIVSS IGYADALFMR ERVSVDVMGV
PVLRRGALCP FDEQRLLREA GDREIVIRVD LGAGGAQTTY WTGDLPPG