Gene Elen_2192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2192 
Symbol 
ID8416514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2574823 
End bp2575944 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content65% 
IMG OID645025178 
Productagmatine deiminase 
Protein accessionYP_003182543 
Protein GI257791937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA TCACCGACAG CACGCCGAAG CAAGACGGAT ACCGCATGCC CGGAGAGTTC 
GAGCCGCGCA AGCAGGACTT CCTCATCTGG CCCGAGCGTC AGGACACGTG GCGCAACGGC
GGCAAGCCCG CCCAGGCCGT CCTCGTGGAG GTGGCGAAGG AGATCATCAA GCACGAGCCG
CTGACGGTGT TCTGCTCGGC CGACCAGTAC GAGAACGCCC GCACCCGCCT GCCCGAGGGC
GTGCGCGTGG TTGAGATGAC CATCGACGAC GCGTGGGCGC AGGACAAAGG CCCGTTCTAC
GTGGTGAACG ACAAGGGCGA CATGCGCGGC GTGACCTGGG GATGGAACGC GTACGGCGGG
CTCGAGGGCG GCCTGTACTT CCCGTGGAAG CGCGACCAGG AATTCGCCAC GAAGCTCTTG
GACCTCGAGA ACTACGACGC GTACGACGCC ACGAAGATGG TGTTCGAGGG CGGCGCCATG
CAGATCGACG GCGAGGGGAC GCTTATCGTG ACGGAGAACA GCGTGCTGAA CCACAACCGC
AACCCGCACC TCACCAAGGA AGAGGCGGAG TGGTACTTCA AGGAGTACAT GGGCTTGGAG
AAGGTCATCT GGCTGAAGGA CGGCATGGCC TTCGACGAGA CTGACGGCCA CATCGACGAC
GTGTGCTTCT TCGTCCGTCC CGGCGTGCTG GCGCTTTCGT GGACCGACGA CGAGGACAAC
CCGCAGTACC CGAACCTCAA GGCCGCCTAC GACGTGCTGT CCGAGGCGAC CGACGCGAAG
GGCCGCACGT TCGAGATCCA CAAGATCCCG ATCCCCGGCA TCATCCGTAT CTCCGAGGAG
GAGAGCGCCG GCGTGGATCT CTGCAAGGAC GCCGCGTCCC GCGAGGCCGA CCTGCCGTTG
GCCATCACGT ACATCAACAG CTACTTCGTG AACGGCGGCC TGCTGGTTCC CCAGTACGGC
GATCCGATGG ACCAGGTGGC GTGCGATATG TTCGCCGAGC TCATGCCCGA CCGCGAGATC
ATCAAGATCT ACACCCGCGA GTGGTCGCTG TGCGGCGGCA ACATCCACTG CATGGCCCTG
CAGCAGCCCG ACCCGGCCGC CATCGCCGCG AAGCTGGGCT AG
 
Protein sequence
MKIITDSTPK QDGYRMPGEF EPRKQDFLIW PERQDTWRNG GKPAQAVLVE VAKEIIKHEP 
LTVFCSADQY ENARTRLPEG VRVVEMTIDD AWAQDKGPFY VVNDKGDMRG VTWGWNAYGG
LEGGLYFPWK RDQEFATKLL DLENYDAYDA TKMVFEGGAM QIDGEGTLIV TENSVLNHNR
NPHLTKEEAE WYFKEYMGLE KVIWLKDGMA FDETDGHIDD VCFFVRPGVL ALSWTDDEDN
PQYPNLKAAY DVLSEATDAK GRTFEIHKIP IPGIIRISEE ESAGVDLCKD AASREADLPL
AITYINSYFV NGGLLVPQYG DPMDQVACDM FAELMPDREI IKIYTREWSL CGGNIHCMAL
QQPDPAAIAA KLG