Gene Elen_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0233 
Symbol 
ID8414517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp320527 
End bp321885 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content70% 
IMG OID645023211 
Productamidohydrolase 
Protein accessionYP_003180614 
Protein GI257790008 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACTG ATTCTTGCAT CATCCGCGGC GGCACGGTGG TGTGCGCCGA CCGCGTGCTT 
CCCGACTGCG ACGTCGTGGT CATCGACGGG CGCATCGCCG CCATCGAGCC GGTGGGCGCG
TCCGACTTCG ACGCGCAGCC CGATGCCACG ATGGGCGTGC TGCCCGTGGT GGACGCGCGC
GGCGCGTACG TGGCGCCCGG CCTCATCGAC ATCCACTCGG ACTACGTGGA GAACGTGGCC
TCGCCGCGCC CCAGCGTGGT CATGGACCTG TCCACGTCGC TGTACAAGGC CGACCGCGAG
CTGGTGTCGC ACGGCGTGAC CACCATCTTC CACTCGCTGT CGGTGTACGG CGCGCACGTG
TTCGACCACA AGCCCATCCG CGATTTCGGC AACGTGAGCG CCCTCATCGA CCGCGTGGCC
GCCCTGCGCG CGGGCGAGGA GCGCGACCAC CTCATCCGCC ACCGCCTGCA CATGCGCGTG
GAGCTGGACT CGGTGGATTT GTACGACGAC ATCGAGAGCT TCCTGCGCTC GGGCAAGGTG
GACCTCGTGT CGTTCATGGA CCACACGCCG GGGCAGGGCC AGTACCGCGA CCTGCTGGTG
TTCGGCGACA CGCTGAAGGG CTACCGCGAC GTCAGCGACG AGGACGTGCG CGACATCGTG
CGTCAGCAGC AGGAGAGCCA GAAGCTCACG TACGCCCAGA TCACAGCACT GGCGGCCGTG
GCGCGCGAGC GCGGCGTGTC CATCGCCTCG CACGACGACG ACAGCGAGGA CAAGCTGGCG
TTCATGGACG GCCTCGAGGC CACTATCTCC GAGTTCCCCA TCTCGCTGGA CATCGCGCGG
GCGGCGCGGG CGCGCGGCAT GCACACCATC GCAGGCGCGC CGAACGTGAT GTTGGGCCAC
AGCCACTCGG GCAACCTCAG CGCGCGCGAG GCCGTGCAGG CCGGCGCCAT CGACGTGCTG
TGCAGCGACT ACTACCCGGC GGCGCTGCTG GACGCGGTGT TCACGCTGCG CGATCAGTGC
GGGCTCGACA TCGCGAAAGC GTTCGCGCTG GTCACTATCA ACCCGGCGAA GGCCGCGGGC
ATCGCCGACG AGGTGGGCTC CATCGCGGTG GGCAAGCGCG CCGACGTGCT GCTGGTGCGC
GAGATCTCCT GCGGCGAAGG CGAAGGCTCG GGCGAGCACC CGGGCGCAAG GCCGGACGGT
CGCGTCGCGC GCACGATGCC CGTGGTCACG CGCGCGTTCG TGGGCGGCCG CTCGGTGTTC
CGCTCGCACT ATCCCGACCA GCCGCTCGGC TACGGGCGCG ACACCGAGCA GCTCGTCTCG
CTCGACCAGC TGACCCGCCC TCTGGCCAAG GCGGTGTAG
 
Protein sequence
MDTDSCIIRG GTVVCADRVL PDCDVVVIDG RIAAIEPVGA SDFDAQPDAT MGVLPVVDAR 
GAYVAPGLID IHSDYVENVA SPRPSVVMDL STSLYKADRE LVSHGVTTIF HSLSVYGAHV
FDHKPIRDFG NVSALIDRVA ALRAGEERDH LIRHRLHMRV ELDSVDLYDD IESFLRSGKV
DLVSFMDHTP GQGQYRDLLV FGDTLKGYRD VSDEDVRDIV RQQQESQKLT YAQITALAAV
ARERGVSIAS HDDDSEDKLA FMDGLEATIS EFPISLDIAR AARARGMHTI AGAPNVMLGH
SHSGNLSARE AVQAGAIDVL CSDYYPAALL DAVFTLRDQC GLDIAKAFAL VTINPAKAAG
IADEVGSIAV GKRADVLLVR EISCGEGEGS GEHPGARPDG RVARTMPVVT RAFVGGRSVF
RSHYPDQPLG YGRDTEQLVS LDQLTRPLAK AV