Gene Elen_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0672 
Symbol 
ID8414962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp849922 
End bp851037 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID645023646 
Productagmatine deiminase 
Protein accessionYP_003181043 
Protein GI257790437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.915937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000489111 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATACGA TTTACGAGAA CGAATCGACC CCGAAGAAAG ACGGCTACCG CATGCCCGGC 
GAATTCGAGC CGCAGGAGTG CATCTGGATG CTGTGGCCGC ATCGTCCCGA CAACTGGCGC
GACGGCGCGA AGCCCGCGCA GAAGGCGTAC GCCGACGTGG CGCGCGGCAT CGCCCAGTTC
GAGCCGGTCA TCGTGGGCGT GAACCCCGAG GACTACGCCG CCGCGCACTA CGTGCTGGCG
GGCGAGGAGA ACATCCTGGT TGTGGAGATG ACTAGCGACG ACTCGTGGAT CCGCGACTGC
GGCCCCACGT TCGTGGTGAA CGACGACGGC GACGTGCGCG CGGTGCACTG GCACTTCAAC
GCATGGGGCG GGCTGGTGGA CGGCCTGTAC TTCCCGTGGG ACCAGGACGC GCTCGTGGGC
CTGAAGGTGG CCGACCTCGC CGGCGTGGAC CGCTACCGCC CGGACTCGTT CGTGCTGGAG
GGCGGCTCCA TCCACGTGGA CGGCGAAGGC ACCGTGATGA CCACGGAGAT GTGCCTCTTG
TCCGAGGGGC GCAACCCCGA GCTCTCGAAG GAGCAAATCG AGAACTACCT GTGCGAGTAC
CTGGGCGTCG ACAAGGTGAT CTGGATCAAG GACGGCATAG ACCCCGAGGA GACGAACGGG
CACATCGACG ACGTGGCCTG CTTCGTGCGC CCGGGCGAGG TGGCCTGCAT CTGGACCGAC
GACGAGGACA ACCCGTTCTA CGAAGCCGCG CACGCCGCCT ACGAGACGCT GTCGAACGCC
ACCGACGCCA AGGGGCGGGC GCTCAAGGTG CACAAGCTGA CCATGCCGAA GGAGCCGGTC
TACATGACGC AGGAGGAAGT GGACGCCATC GACGTGGTGG AGGGCACCAT CCCGCGCACC
ACCGAGGACG TGTGCATCGC CTCGTACATG AACTTCCTCA TCGGCAACGA TTTCGTGCTG
GTGCCCCAGT ACGACGACGA ATACGACGAG ATGGCGTTGC AGCAGGTGCA GCAGATGTTC
CCCGAACGCG AAGTCGTGGG CGTGCCCACG CGCGAAGTGG TGTACGGCGG CGGCAACATC
CACTGCATCA CCCAGCAGCA GCCGGCTGGC GTGTAA
 
Protein sequence
MDTIYENEST PKKDGYRMPG EFEPQECIWM LWPHRPDNWR DGAKPAQKAY ADVARGIAQF 
EPVIVGVNPE DYAAAHYVLA GEENILVVEM TSDDSWIRDC GPTFVVNDDG DVRAVHWHFN
AWGGLVDGLY FPWDQDALVG LKVADLAGVD RYRPDSFVLE GGSIHVDGEG TVMTTEMCLL
SEGRNPELSK EQIENYLCEY LGVDKVIWIK DGIDPEETNG HIDDVACFVR PGEVACIWTD
DEDNPFYEAA HAAYETLSNA TDAKGRALKV HKLTMPKEPV YMTQEEVDAI DVVEGTIPRT
TEDVCIASYM NFLIGNDFVL VPQYDDEYDE MALQQVQQMF PEREVVGVPT REVVYGGGNI
HCITQQQPAG V