Gene Elen_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2203 
Symbol 
ID8416525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2587356 
End bp2589005 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID645025189 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003182554 
Protein GI257791948 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC TCACGCGACG CGATTTCGCG AAACTGACGG GTGCGACGGC GGCGACGCTG 
TCGTTGGGCG GGCTGCTGGC AAGCTGCGCG AGCGGCGAGG CCGAGAAGCC GGCCGAAGGC
GCGACGGAAG GTGCGGCCGA CAAGGCCTCT TCGCAGGTTA TCGTCTCGAT GACCACCGGA
TCCGAGCCGG CCGCCGGCTT CGACCCGATG GTGTCGTGGG GCTGCGGCGA GCACGTTCAC
GAGCCGCTGA TCCAGTCCAC GCTGATCACC ACCGATGCGG ACCTCAACTT CAAGAACGAC
CTCGCCACGT CCTACGAAGC GTCCGAAGAC GGCATGACCT GGACGTTCAC CGTCCGCGAC
GACGTGAAGT TCACCGACGG CACCCCGCTC ACAGCGCGCG ACGTGGCCTT CACCATCAAC
GGCATCTTGA ACTCGGAAGC ATCCGAGTGC GACATGTCCA TGGTGAAAGA GGCCGTGGCC
ACCGACGACG CCACCGTCGT CGTGCACATG GAGAAGCCGT TCAACGCGCT GCTGTACACG
CTGGCCGTGG TGGGCATCGT GCCCGAGCAC GCCTACGGCG ACACGTACGG CGACAACCCC
ATCGGCTCGG GGCGCTACAT GCTGGAGCAG TGGGACAAAG GCCAGCAGGT CATCCTCAAG
GCGAACCCCG ACTACTACGG CGAGGCGCCG AACATCCAGC GCGTCGTGGT AGTGTTCATG
GAAGAGGACG CCTCGCTTGC GGCGGCGAAG TCCGGACAGG TCGACGTTGC ATACACCTCG
GCGACGTTCG CGGCTCAGCA GCCGAGCGGC TACGACTTGC TGAACTGCGC GTCGGTCGAC
TCTCGCGGCA TCTCGCTGCC GGTGATTCCG GCGGGCGCCA TGAAAACCGA CGAGAAGGGC
GAAGCGGCGG CCGGCAACGA TGTCACGTGC GACCTGGCCA TTCGCCAAGC CATCAACTAC
GGCGTCGACC GCGACAAGAT GATCGACAAC GTGCTGAACG GCTACGGCAC CGTGGCCTAC
AGCGTGGGTG ACGGCATGCC GTGGTCCTCG CCCGACATGA AGTGCTCCAC CGATGTCGAG
AAGGCGAAGA AGCTGCTCGA CGACGGCGGC TGGACGGCCG GTGCGGACGG CATCCGCGAG
AAGGACGGCA CGCGCGCTGC GTTCAACCTG TACTACTCGG CCGGCGACAC CGTGCGCCAA
GGTATCGCCG AGGAGTTCAC CAACCAGATG AAAGAGCTGG GCATCGAAGT ATCCATCAAG
GGCGCCAGCT GGGACGATCT GTACCCGCAT CAGTTTACCG ATCCGGTGGT GTGGGGCTGG
GGCACGAACG CGCCCACCGA GATTTACAAC CTGTTCTACT CCAAGGGCAC GGGCAACTAC
GCCTGCTACA CGAGCGAAAC CACCGACAAG TACCTCGACG AGGCGCTGGC CCAGCCTACT
GTGGAAGAGT CGTTCGATCT GTGGAAGAAG GCTCAGTGGG ACGGCCAGTC CGGCATCGCG
CCGCAGGGGG ACGCGCCGTG GGTGTGGTTC GCGAACATCG ACCACTTGTA CTTTGCGAAG
GACAACCTCA AGATCGCGAA GCAGAAGCCT CATCCGCACG GACACGGCTG GTCGCTGGTG
AATAACGTCG ACCAGTGGTC CTGGGCGTAA
 
Protein sequence
MATLTRRDFA KLTGATAATL SLGGLLASCA SGEAEKPAEG ATEGAADKAS SQVIVSMTTG 
SEPAAGFDPM VSWGCGEHVH EPLIQSTLIT TDADLNFKND LATSYEASED GMTWTFTVRD
DVKFTDGTPL TARDVAFTIN GILNSEASEC DMSMVKEAVA TDDATVVVHM EKPFNALLYT
LAVVGIVPEH AYGDTYGDNP IGSGRYMLEQ WDKGQQVILK ANPDYYGEAP NIQRVVVVFM
EEDASLAAAK SGQVDVAYTS ATFAAQQPSG YDLLNCASVD SRGISLPVIP AGAMKTDEKG
EAAAGNDVTC DLAIRQAINY GVDRDKMIDN VLNGYGTVAY SVGDGMPWSS PDMKCSTDVE
KAKKLLDDGG WTAGADGIRE KDGTRAAFNL YYSAGDTVRQ GIAEEFTNQM KELGIEVSIK
GASWDDLYPH QFTDPVVWGW GTNAPTEIYN LFYSKGTGNY ACYTSETTDK YLDEALAQPT
VEESFDLWKK AQWDGQSGIA PQGDAPWVWF ANIDHLYFAK DNLKIAKQKP HPHGHGWSLV
NNVDQWSWA