Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2203 |
Symbol | |
ID | 8416525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2587356 |
End bp | 2589005 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645025189 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003182554 |
Protein GI | 257791948 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAC TCACGCGACG CGATTTCGCG AAACTGACGG GTGCGACGGC GGCGACGCTG TCGTTGGGCG GGCTGCTGGC AAGCTGCGCG AGCGGCGAGG CCGAGAAGCC GGCCGAAGGC GCGACGGAAG GTGCGGCCGA CAAGGCCTCT TCGCAGGTTA TCGTCTCGAT GACCACCGGA TCCGAGCCGG CCGCCGGCTT CGACCCGATG GTGTCGTGGG GCTGCGGCGA GCACGTTCAC GAGCCGCTGA TCCAGTCCAC GCTGATCACC ACCGATGCGG ACCTCAACTT CAAGAACGAC CTCGCCACGT CCTACGAAGC GTCCGAAGAC GGCATGACCT GGACGTTCAC CGTCCGCGAC GACGTGAAGT TCACCGACGG CACCCCGCTC ACAGCGCGCG ACGTGGCCTT CACCATCAAC GGCATCTTGA ACTCGGAAGC ATCCGAGTGC GACATGTCCA TGGTGAAAGA GGCCGTGGCC ACCGACGACG CCACCGTCGT CGTGCACATG GAGAAGCCGT TCAACGCGCT GCTGTACACG CTGGCCGTGG TGGGCATCGT GCCCGAGCAC GCCTACGGCG ACACGTACGG CGACAACCCC ATCGGCTCGG GGCGCTACAT GCTGGAGCAG TGGGACAAAG GCCAGCAGGT CATCCTCAAG GCGAACCCCG ACTACTACGG CGAGGCGCCG AACATCCAGC GCGTCGTGGT AGTGTTCATG GAAGAGGACG CCTCGCTTGC GGCGGCGAAG TCCGGACAGG TCGACGTTGC ATACACCTCG GCGACGTTCG CGGCTCAGCA GCCGAGCGGC TACGACTTGC TGAACTGCGC GTCGGTCGAC TCTCGCGGCA TCTCGCTGCC GGTGATTCCG GCGGGCGCCA TGAAAACCGA CGAGAAGGGC GAAGCGGCGG CCGGCAACGA TGTCACGTGC GACCTGGCCA TTCGCCAAGC CATCAACTAC GGCGTCGACC GCGACAAGAT GATCGACAAC GTGCTGAACG GCTACGGCAC CGTGGCCTAC AGCGTGGGTG ACGGCATGCC GTGGTCCTCG CCCGACATGA AGTGCTCCAC CGATGTCGAG AAGGCGAAGA AGCTGCTCGA CGACGGCGGC TGGACGGCCG GTGCGGACGG CATCCGCGAG AAGGACGGCA CGCGCGCTGC GTTCAACCTG TACTACTCGG CCGGCGACAC CGTGCGCCAA GGTATCGCCG AGGAGTTCAC CAACCAGATG AAAGAGCTGG GCATCGAAGT ATCCATCAAG GGCGCCAGCT GGGACGATCT GTACCCGCAT CAGTTTACCG ATCCGGTGGT GTGGGGCTGG GGCACGAACG CGCCCACCGA GATTTACAAC CTGTTCTACT CCAAGGGCAC GGGCAACTAC GCCTGCTACA CGAGCGAAAC CACCGACAAG TACCTCGACG AGGCGCTGGC CCAGCCTACT GTGGAAGAGT CGTTCGATCT GTGGAAGAAG GCTCAGTGGG ACGGCCAGTC CGGCATCGCG CCGCAGGGGG ACGCGCCGTG GGTGTGGTTC GCGAACATCG ACCACTTGTA CTTTGCGAAG GACAACCTCA AGATCGCGAA GCAGAAGCCT CATCCGCACG GACACGGCTG GTCGCTGGTG AATAACGTCG ACCAGTGGTC CTGGGCGTAA
|
Protein sequence | MATLTRRDFA KLTGATAATL SLGGLLASCA SGEAEKPAEG ATEGAADKAS SQVIVSMTTG SEPAAGFDPM VSWGCGEHVH EPLIQSTLIT TDADLNFKND LATSYEASED GMTWTFTVRD DVKFTDGTPL TARDVAFTIN GILNSEASEC DMSMVKEAVA TDDATVVVHM EKPFNALLYT LAVVGIVPEH AYGDTYGDNP IGSGRYMLEQ WDKGQQVILK ANPDYYGEAP NIQRVVVVFM EEDASLAAAK SGQVDVAYTS ATFAAQQPSG YDLLNCASVD SRGISLPVIP AGAMKTDEKG EAAAGNDVTC DLAIRQAINY GVDRDKMIDN VLNGYGTVAY SVGDGMPWSS PDMKCSTDVE KAKKLLDDGG WTAGADGIRE KDGTRAAFNL YYSAGDTVRQ GIAEEFTNQM KELGIEVSIK GASWDDLYPH QFTDPVVWGW GTNAPTEIYN LFYSKGTGNY ACYTSETTDK YLDEALAQPT VEESFDLWKK AQWDGQSGIA PQGDAPWVWF ANIDHLYFAK DNLKIAKQKP HPHGHGWSLV NNVDQWSWA
|
| |