Gene Elen_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2226 
Symbol 
ID8416548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2613352 
End bp2614653 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID645025211 
Productamidohydrolase 
Protein accessionYP_003182576 
Protein GI257791970 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.306812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCG CTGATATCGA CCTGTTGGAT GAGAACCTTG ATTTCCGCTC CCATTGCTGG 
GTGGGCGTGC GCGACGGGCG CGTCGCCTAC GTGGGGGATG CGGCTCCGGC GGGCGAGGAA
GCGGCGAGGT ACGGAGAGGT GTACGACGGA CGTGGCAAGC TCTTGTGTCC TGCGTTCTAC
AACGCCCACG CGCACGCGCC CATGACGCTG TTGCGCGGCT ACGCCGAGAA CCTGCCTTTG
CAGGCGTGGC TGAACGACAT GGTGTTTCCG TTCGAGGCGA AGATCACGCC CGAAGACTGC
TACTGGGGCA CGCTTCTGTC CTGCGCCGAG ATGGCGCGGT ACGGCTGCGT GAGCTTCTCG
GACATGTACT ATCACATGGA GGAGGGCGCC CGCGCAGCGC TCGACGCCGG CATCAAGATG
AACCTGTCCG ACTCGCTTCT TGCCTTCAAC GGCGAAGGGT TGGACGACCT GCCGGTGAAG
GGGAACCTCG ACCGTCTCAT CCGCGACCTC CAGGGCGCAG GCGATGGCCG CATCGTGGTG
GACTGCAACA TCCATGCCGA GTACACGTCG AACCCGCGCG CCGTGGCCGA TTTGGCGGCG
TACGCGAAGG AGCACGGGCT TCGGTTGCAG GTGCACGTCT CCGAGACGCG CCTCGAGCAC
GAGGAGTGCA AGCAGCGCCA CGACGGTTTG ACGCCGGTGC GCTACTTCGA GAGCCTGGGC
GTGCTCGACG TGCCCGTGAC GGCGGCGCAC TGCGTGTGGG TGGACGACGG CGACATCGAC
GTGCTGGCGG AGCGCGGGGT GTTCGTGGCG GCGAACCCGG CGTCGAACAT GAAGCTGGGC
AGCGGTTTCG CCCCTGTGGC AAAGATGCTC GCGCGCGGCG TGAACGTGTG CCTGGGCACC
GACGGCATGG CGTCGAACAA CAACCACGAC ATGATGCAGG ATATGCACCT GCTGGCGCTG
ACGGCGAAGG GATCGACGAA CGATCCGGCC GTGGTCACGC CGAAGCAGGC GCTTACGGCC
GCTACGCGCG TGGGCGCGCT TTCGCAGGGG CGCGACGACT GCGGGTACGT GGCCGTGGGG
GCGAAGGCTG ACTTGTGCGT GCTGGACACG TCGGGGCCGT CGTGGGCGCC GATGACGAAC
CCGCTGGTGA ACGTCGTGTA CGCGGGGCAT GGCGCCGACG TGTGCCTGAC GATGTGCGAC
GGGGTCGTGG TGTATCGCGA GGGCGAGTGG CCCACGCTGG ACATCGAGCG AGCGAAGGCC
GAGGTCGAGG CCCGCACGAA GCGCATCATC GGCGAGCTGT AG
 
Protein sequence
MLFADIDLLD ENLDFRSHCW VGVRDGRVAY VGDAAPAGEE AARYGEVYDG RGKLLCPAFY 
NAHAHAPMTL LRGYAENLPL QAWLNDMVFP FEAKITPEDC YWGTLLSCAE MARYGCVSFS
DMYYHMEEGA RAALDAGIKM NLSDSLLAFN GEGLDDLPVK GNLDRLIRDL QGAGDGRIVV
DCNIHAEYTS NPRAVADLAA YAKEHGLRLQ VHVSETRLEH EECKQRHDGL TPVRYFESLG
VLDVPVTAAH CVWVDDGDID VLAERGVFVA ANPASNMKLG SGFAPVAKML ARGVNVCLGT
DGMASNNNHD MMQDMHLLAL TAKGSTNDPA VVTPKQALTA ATRVGALSQG RDDCGYVAVG
AKADLCVLDT SGPSWAPMTN PLVNVVYAGH GADVCLTMCD GVVVYREGEW PTLDIERAKA
EVEARTKRII GEL