Gene Elen_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0678 
Symbol 
ID8414968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp858355 
End bp859515 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID645023652 
Productagmatine deiminase 
Protein accessionYP_003181049 
Protein GI257790443 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.421408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGA TTCACGAGAG CGTTTCCACC CCGAAGGCCG ACGGCTACCG CATGCCCGGC 
GAGTTCGAGC CGCAGACCCG CATCTGGATG GCGTGGCCGC ACCGCACCGA CACGTGGGCC
TGGGGCGCGA AGCCGGCTCA GAAGCAGTAC GCCGACGTGG CGCGCGCTAT CGCCGAGTTC
GAGCCCGTCA CCATGTGCGT GAACCAGGTG GACTACGCCA ACGCCAAGGC CGTGTTCGAG
GACGACGAGA ACGTCACCGT CGTCGAGATG ACCACCGACG ACGCGTGGGT GCGCGACACC
GGCGCCACCT GGGTGGTCAA CGACGAGGGC GACAAGCGCG CCGTGCATTG GCACTTCAAC
GCCTACGGCG GCTTCGAGAA CGGCCTGTAC TTCCCGTGGG ACAAAGACGA GCAGATCGCC
CTCAAGATGG CCGAGATGAG CGGCTGCCGT CGCTATCGCC CCGAAAGCTT CATCCTCGAG
GGCGGCTCCA TCCACGTGGA CGGCGAGGGC ACGGTCATCA CCACCGACAT GTGCCTGCTC
GATCCCGGCC GCAACGCGTC CGTGACCGAC TACGAGCCCT GGTCCGAGGA GCTGCGCGCG
TACTGCGACG AGCAGCTGAA GAAGTACCTG GGCGTGGAGA AGGTCATCTG GGTCAAGGAC
GGCATCGACC CCGAGGAGAC GAACGGCCAC ATCGACGATG TCGCCCAGAT CGTCGCTCCC
GGCAAGGTGC TGTGCATCTG GTCCGACGAC CCGGACTACC CGTTCTACAA CGAGTGTCAT
GCCGCTTACG AGACGCTGTC CAACGCCGTG GACGCCAAGG GCCGCAAGCT CGAGGTGACC
AAGCTCTGCA TGCCCGTGAA GCCGCTGTAC ATGGACCAGG CGTCCTGCGA CTCCATCGAC
ACCGAGGAGT ACGCCGAGCC GCGCGTGGCC GATGAGCCGC TGATCGCGTC GTACATGAAC
TTCCTCATCG TCAACGGCGG CGTCATCGTG CCGCAGTACG GCGACGAGAA CGACGCGCTG
GCCGTCCAGC AGATCCAGGC TGCGTTCGAC GAGGCGTGGG GCGAGGGCGC GTACAAGGCC
GTGGGCGTGA AGACCGACCA GGTGGTCTTC GGCGGCGGCA ACATCCACTG CATCACCCAG
CAGGAGCCGG CCGGCAAGTA G
 
Protein sequence
MKTIHESVST PKADGYRMPG EFEPQTRIWM AWPHRTDTWA WGAKPAQKQY ADVARAIAEF 
EPVTMCVNQV DYANAKAVFE DDENVTVVEM TTDDAWVRDT GATWVVNDEG DKRAVHWHFN
AYGGFENGLY FPWDKDEQIA LKMAEMSGCR RYRPESFILE GGSIHVDGEG TVITTDMCLL
DPGRNASVTD YEPWSEELRA YCDEQLKKYL GVEKVIWVKD GIDPEETNGH IDDVAQIVAP
GKVLCIWSDD PDYPFYNECH AAYETLSNAV DAKGRKLEVT KLCMPVKPLY MDQASCDSID
TEEYAEPRVA DEPLIASYMN FLIVNGGVIV PQYGDENDAL AVQQIQAAFD EAWGEGAYKA
VGVKTDQVVF GGGNIHCITQ QEPAGK