Gene Elen_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2834 
Symbol 
ID8417165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3289819 
End bp3291174 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content63% 
IMG OID645025814 
Productamidohydrolase 
Protein accessionYP_003183170 
Protein GI257792564 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTGT GCGCACAATA CATTCTTCCC ATCACGTCCG AGCCGTTTCA GAAGGGCGCG 
GTGCTTGTCC GCGACAACGT CATCCGCGAC ATCGGCACGG CCGAGATGCT CAAGCTGCGC
TATCCCGACG AAGAAGTGGT CGATTTCGGC CAGGCGGCTA TCATGCCGGG CCTCGTCGAC
CTGCACACGC ACCTCGAGAA CTCCGTGATG CGCGGTATCG TGCACGATGT GCCCTATACC
ACGTGGGTCA CGTCCATGTT GGAGAAAAGC GCAAAGATGG ACGTGAGCGA CTGGTACGAC
TCCGCTATCC TCGGGGGTCT TGAAGCGCTG TCCAGCGGCA TTACCTGCGT CGCCGATATC
ACTGCCACCG GCGCCGCATG CACCGCCACG CAGAAGTTGG GCATGCGCAG CGTCATCTAC
CGCGAGGTGG GCGCCATGGA CAAGCGCCGC GTCGATTACG CCATGCGCAT CGCCGAGAAC
GACATCATGC ACTGGCGCGA AGAGGTTGAC GGCGACCGCA TCACCATCGG CGTGGCTCCC
GCGGCTATGT ATGCCTGCCA TCCGTCCATG TTCTCGAAGG TGTCCGAATT CGCTCGGCGC
GAGAACGTGC CCGTCGCCAT GCACGTGGCC GGCAACCGCG AAGAGTACAA CTTCATCAAG
TACGGCTCGT CGCCGTTCTC GGTGCACACG ATGGACCAGA AGCGCGGCTT CGTGGAGATT
CCGCCGTGGC TGCCCACCGG CACGACGCCC GTGCGCTACG CTTTGAACTG GGGCGCGTTC
GAGTCCGACA ACGTGCTGGC CATCCACTGC GTGCACGTGG ACGACAAAGA CGTGCAGAAG
CTGAAAGAGT ACGACGTGGC CGTGGCCGTG TGCCCGCGCT GCAACGCGCA GCTGGGCATG
GGCGTGGCTC CCATCAACGA GTTCATGCGC GCAGGCCTTC GCCTGGGCAT GGGGACCGAT
TCGCCGGCCG CAACCGACTC CACCGACATG CTCACCGAGA TGCGCATCGG CATGCTGGTG
CAGCGCGCGG TGAACGTGGG CGAGTTCCTG GATTCGGCCA CCATGCTGGA GATGGCCACC
ATCGGCGGCG CCCGCGCGCT CAAGCTGGAC GACAAGATCG GCTCCCTGGA AATAGGCAAG
CTGGCCGACA TCATCGCGGT CGACCTGTCC GGCTCGCATC AGACGCCCAC CACCGATCCG
GTTTCGGCCG TGGTCAACAC CTGCAGCGGC GCCGACATCC TCATGACCAT GGTGAACGGC
ACCGCGCTGT ACGAGAAGAA CAAGTGGAAC GTGGGCGTCG AGGTTGCCAG GAACATCGCC
CGCATCATCG AAATCCGCGG TAAGTTGAGG TTGTAA
 
Protein sequence
MLLCAQYILP ITSEPFQKGA VLVRDNVIRD IGTAEMLKLR YPDEEVVDFG QAAIMPGLVD 
LHTHLENSVM RGIVHDVPYT TWVTSMLEKS AKMDVSDWYD SAILGGLEAL SSGITCVADI
TATGAACTAT QKLGMRSVIY REVGAMDKRR VDYAMRIAEN DIMHWREEVD GDRITIGVAP
AAMYACHPSM FSKVSEFARR ENVPVAMHVA GNREEYNFIK YGSSPFSVHT MDQKRGFVEI
PPWLPTGTTP VRYALNWGAF ESDNVLAIHC VHVDDKDVQK LKEYDVAVAV CPRCNAQLGM
GVAPINEFMR AGLRLGMGTD SPAATDSTDM LTEMRIGMLV QRAVNVGEFL DSATMLEMAT
IGGARALKLD DKIGSLEIGK LADIIAVDLS GSHQTPTTDP VSAVVNTCSG ADILMTMVNG
TALYEKNKWN VGVEVARNIA RIIEIRGKLR L