Gene Elen_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2553 
Symbol 
ID8416877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2984890 
End bp2986047 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content61% 
IMG OID645025534 
Productaminodeoxychorismate lyase 
Protein accessionYP_003182897 
Protein GI257792291 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC ACAGGAAGCA AGTCACCTAT TCTCAGCGTC CGAACCATGC AGCTCGCTCG 
GCTCATGCCC GGGGCGAGCG CCAGTTCCGT ACGTACGATA CCAGCTATAT CCGCCCGAAG
AAAAGCAAGG CTCCTGCTAT AGTCGCCGCC GTTTTGGCCG TTCTTGTCGT CGGAGGTTTG
GCGTGGGGCG CGCTCACCCT GTTCAACAGC TGTTCCGCGC AATCGGTCGA GCTTCTGGCC
GAGGGTCAGG AGGCCACGAT CACGGTGGCC GAAGGTGCTG GTGCCAAGGT CGTCGGAGAG
CAGCTTGCGG AAGCCCGTCT GGTTTCCAAT GCGGGAGACT TCACGAAGCG CGTCAACGAG
ATGGGCGTTG ATTCCCAGCT CAAGCCCGGT ACCTACACAT TCGCGGGCGG TATGTCGCTC
GACGCCATCA TCAACCAGCT GACGGCCGGT CCGGTGGCGA ACGCGCTCAC CATCCCCGAA
GGAAGCACGC TCGAGGCCGT TGCCCAGAGC GTGGCAACCT TCACCGAGAA TCGCATCACG
GCGGACGCGT TCACGGCCGC TGCGTCGGAT GCCAGCTCAT ACGCGGCCGA CTACGACTTC
CTGGCCGACG CGGGCACGAA CAGCCTGGAA GGCTTCCTGT TCCCGAAAAC GTACGAGATC
GGCGACGATG CCACGGCCGA GTCGGTAGTG CGCATGATGC TCGACCAGTT CAAGACCGAG
ACGTCGGGGC TCGATTGGTC CTACCCGCAA AGCCAGGGCC TCACCATCTA CGATGCCGTG
AAGCTGGCTT CCATCGTTGA GCGCGAGTCG TCGGGCGACG AGCAGATCCG CGCCCAGGTG
GCCTCGGTGT TCTACAACCG CCTGAACAAC TTCGGCGATC CGAACTACGG CTTCCTGCAA
AGCGATGCGA CCACGGCTTA CGAGCTGGGT CACGACCCCA CCCCCGAGGA TATCAAGAAT
CCAACACCGT TCAACACCTA CACGAACACG GGTCTGCCTC CCACGCCCAT CTGCTCGCCG
GGTCTCGATT GCCTGCAAGC CGTGTGCAAC CCTGCGCAGA CGAACTACTT CTTCTTCTAC
TTCGCGCCTG ATGAAAGCGG TACGATGCAG TACTACTTCA GCGAAACGTA CGAAGAGCAT
CAGCAGACGT TCTCCTAG
 
Protein sequence
MPQHRKQVTY SQRPNHAARS AHARGERQFR TYDTSYIRPK KSKAPAIVAA VLAVLVVGGL 
AWGALTLFNS CSAQSVELLA EGQEATITVA EGAGAKVVGE QLAEARLVSN AGDFTKRVNE
MGVDSQLKPG TYTFAGGMSL DAIINQLTAG PVANALTIPE GSTLEAVAQS VATFTENRIT
ADAFTAAASD ASSYAADYDF LADAGTNSLE GFLFPKTYEI GDDATAESVV RMMLDQFKTE
TSGLDWSYPQ SQGLTIYDAV KLASIVERES SGDEQIRAQV ASVFYNRLNN FGDPNYGFLQ
SDATTAYELG HDPTPEDIKN PTPFNTYTNT GLPPTPICSP GLDCLQAVCN PAQTNYFFFY
FAPDESGTMQ YYFSETYEEH QQTFS