Gene Elen_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1725 
Symbol 
ID8416024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2031565 
End bp2033292 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content69% 
IMG OID645024691 
Productoligopeptide transporter, OPT superfamily 
Protein accessionYP_003182079 
Protein GI257791473 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.116386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.268809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTG TCAAAGGGCA GCTGACGCTG CGCGGCATCG TCATAGGCTG CGTCGGTTGC 
GCCATCATCA CCGCGGCATC GGTGTACACG GCCCTCAAGA TGGGGGCGCT TCCTTGGCCC
ATCGTGTTCG CGGCCATCAT CTCGCTGTTC TTCCTCAAGG CCCTCGGGCA CGGCAAGGCC
AGCCTCAACG AGGCGAACGT CACGCACACC GTGATGAGCG CCGGCGCCAT GGTGGCGGGC
GGCCTGGCCT TCACCATCCC CGGCATCTGG ATGCTGGGCT ACGCCGACGA GGTGGGCTGG
TTCGAGATGC TCCTCGTAGC CGTCTCGGGC GTCATCATGG GCCTCGTCTG CACCGCGCTT
CTGCGCCGCC ACTTCATCGA GGACTCCGAG CTGGAGTACC CCATCGGCGA AGCGGCCGCC
CAGACGCTCA TCGCGGGCGA TTCCGGCGGC AAGACCGGCT GGAAGCTGTT CGGCTCCATG
GGCTTCGCGG GCGCGTTCAC GGCGTTGCGC GATTTCTTCG GCGTGATCCC CGCGATGCTC
TTCGGCAACG CCGCGGTCCC CGGCGTTGCG TTCGGCATCT ACCTGTCGCC GATGCTTCTG
GCCGTGGGCT TCCTCGTGGG CACGGGCGCC GTCGTCGTGT GGTTCGTCGG AGCGCTGCTG
GCGAACTTCG GCATCATCGT GGGCGGCTCG GCCGCGGGAC TCTGGGACGT GGCGTCCGCG
CAGGGCATCG TGTCCAGCCT CGGTATGGGC GTCATGATGG GCGCGGGCCT GGGCGTCATC
TTCAAGAACA TCCTGCCCAA GGCCTGGCGC ATGCTGCGCG ACGCCCGCAG CTCGAACGCG
TTCGGCCTCA CCGCCGCAAG CACCATGGAT GCGCCCGCAA GCGGCGCTAA GGACGGACGG
CTGCGCATCG GCAGCTTCCG CCTCACCGCC GGGCTGGCCG CGCTCGCCGT GGCCGCCGTC
GCGCTCATCG TCTGCTTCGG CCTGCAGCTG GGCCCCGTCC CGGCCGTCAT CGTGGTGCTG
CTCGCCTTCG TCGCCACGGC CATGAGCGCG CAGAGCGTCG GCCAGACGGG CATCGACCCC
ATGGAGATAT TCGGCCTCAT CGTGCTTTTG GCCGTGGCGG CCGTATCCAG CGTGCCGCAG
GTGCAGCTTT TCTTCGTCGC GGGCATCGTG GCCGTGGCGT GCGGGCTGGC CGGCGACGTG
ATGAACGACT TCCACGCCGG CCACGTGCTG GGCACCAGCC CCAAGGCGCA GTGGATCGGC
CAGGCCATCG GCGCCGTGCT GGGCGCCTTG GTGGCCGTCG CCGTCATGGC GATCCTCGTG
AGCGCGTACG GCCCCGAGTC GTTCGGCCCC ACGGCGTCGT TCGTGTCCGC GCAGGCGTCC
GTCGTGGCCA CCATGGTGTC GGGCATCCCC TCGGTGCCGG CGTTCGCCAT CGGGCTTGCG
GCGGGATTCG TCCTCTACCT GCTGAACTTC CCCGCCATGA TGCTGGGGCT GGGCATCTAC
CTGCCCTTCT ACATGTCGCT GACCGCGTTT CTGGGAGCCA TGGCCAAAGT CGCCTACGAC
GCCGTGTGCA AGCTGCGCCG CAAGGGGCTC TCGCCCGAGG CGGCCGCGGA GAAGGAGAAG
GCCCAGGGCG AAACGGGCCT CGTGGTGTCG TCCGGCCTGC TGGGCGGCGA ATCCATCGTA
GGCGTCCTGG TGGCGCTCGC CGCCGTGGCG ACGGGCCTGG GAGCGTAA
 
Protein sequence
MDSVKGQLTL RGIVIGCVGC AIITAASVYT ALKMGALPWP IVFAAIISLF FLKALGHGKA 
SLNEANVTHT VMSAGAMVAG GLAFTIPGIW MLGYADEVGW FEMLLVAVSG VIMGLVCTAL
LRRHFIEDSE LEYPIGEAAA QTLIAGDSGG KTGWKLFGSM GFAGAFTALR DFFGVIPAML
FGNAAVPGVA FGIYLSPMLL AVGFLVGTGA VVVWFVGALL ANFGIIVGGS AAGLWDVASA
QGIVSSLGMG VMMGAGLGVI FKNILPKAWR MLRDARSSNA FGLTAASTMD APASGAKDGR
LRIGSFRLTA GLAALAVAAV ALIVCFGLQL GPVPAVIVVL LAFVATAMSA QSVGQTGIDP
MEIFGLIVLL AVAAVSSVPQ VQLFFVAGIV AVACGLAGDV MNDFHAGHVL GTSPKAQWIG
QAIGAVLGAL VAVAVMAILV SAYGPESFGP TASFVSAQAS VVATMVSGIP SVPAFAIGLA
AGFVLYLLNF PAMMLGLGIY LPFYMSLTAF LGAMAKVAYD AVCKLRRKGL SPEAAAEKEK
AQGETGLVVS SGLLGGESIV GVLVALAAVA TGLGA