Gene Elen_2576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2576 
Symbol 
ID8416901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3013127 
End bp3014971 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content61% 
IMG OID645025556 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003182918 
Protein GI257792312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGCGT TGAAGAGAGC CGGTAAGGCG GCGAAGACGT TCGCGTCGAT CGTGCTAGCG 
GGGTCGCTGG TCGGCATCGG ACTGGTCGGC TGTTCCGGCG AGGCTATGGG CAATGCGTCC
GGACGCGACG ATGAGATAGT GCTGACGGTA TCGAAATGCC AGACCAAGGA GCTGCCGCAG
GAGCTGTTGG ACGAGATGAC CAATCGCCAC CCCAACCTGC GCTTCGAGTT CGATACGTAT
TCGAACAGCA ACTATTCCGC GCAGATCGTC ACCGAGCTCG AGCAGCGCGA TATCCCCGAT
ATCCTGATCA ACACGCGCAA TCAGGATCTG ACCGAGGATT TGGAGCATAA TCTGGTGGAT
CTCGCGGCGT ACGATTTTTC CAGCGAGTAC CTTCCCAGCG TTCTCGATCG CATGACCATC
GACGGCAGCC TCTACTATCT GCCGGGTTAC CTGACGCTTG CAGGGTTCTT CTACAACAAG
GACCTTTTCG CCGAGCACGG CTGGGAGGCG CCTCAGTCGC TCGAGGAACT CATTGCCCTC
AACGAGCAGG CGAAGGCGGA GGGCATCCGG CTTATGGCGT ACTCGATGGA GCTGACGGGT
CAACGCTTCC TGCAGCTGAC CAATATCGCT TCGGCGCAGT TTCTGCACAC GCCGCAGGGG
TCGTCTTGGG AACAGGATTA TCTTGCGGGC GAAGCGAGCA TGGTGGGCAC GTTCGGGCCT
TTCATGGACG AGTATCGGCT TTGGCTCGAC AGCGGATTGA TTTCGGCTGA CGACCTTTCC
CTGTCGAATT CGGACGCCGC GGAGATGTTC GCGAACGGTG ACGTGGCCAT GATCTACGGC
GTTGCCAACA ACGTGAAGAC CACTGATTTC GACTTCGATT TGGGCCAAGC TCCGTTCCTT
GCGAGAGGCG AGGGCGAAGA TAACGGCTGG TACCTGTATG CGGTCAGCTC GTACTACGGT
ATTAACAAGA AGCTGGAAGA GCCGGGCAAC GAGGAGAAGC TGGCCATAGC GCTGGAGATG
TTCGACCTGA TGAGCACCCC CGAGGGCCAG AGCATGTTCA CGGATGGCGC GGAAGGGCGA
TATCCGGCCA CGAGAAAGGC GGACGGCGAG CTCCACGCGC CGCTTTTGAG CGATTACCGC
AACGTGGTCG ACCGCAATAA CCTTGTGGAG TTGGCGGCTT ACACTGCGCC GCTCTTGCTG
GGAGGAGAGG CGCTGGGCGG GTACATCGCT GGCACCGTCA GCGCCGAGGA AGCGTTGCAG
GCCTGCGACG AGGCTATGAA GTCCAACAAG TCGGAAACCC AGATCGGCGA CGTAGTGGCA
CATATCGAAC GCGACCTGAG CCGGGAGGAG ACCGTCCGAT ACTTCGCCGA CGCGTTCAGG
GAGTACGGGG GCACCGACCT GTGCCTTATG CTGCCTGGCG GCATGGCAGA CGGCCAAATG
CATCCTTATG GGATTTCCGG CAAGCTGTAC GAAGGGGAGC TGCACGCCAA TGAGCTGACC
GTGCTCTTGC CCAATGCGGG GAAGCCGGTG CCTACGCTGG CCACTGCGCG CATTTCGGGC
GAAGACCTGC GCGCGGTGCT AGAGAGCGGG CGCACGTTCG AGCGGAAGGA CGCGTCGAAG
GAGGCCCTTG CTCCGTTCCG CTACGAGGTG TCGGGAGCCG AGGTGGACTA TGATGCAGAC
CGCAAAGTGC GATCGCTCAA AGTGAACGGC GTTGAAGTCG CCGACGAAGA CGTATTCACG
GTGACGTATT TCGACGGGGC GGTCGAAACG TCCCGCTTGA CCGATGCGGC GGTGTCGGAC
GTGAAGCCCG TCCCCGCGTT CACTGCTTGC AAGGCCGCGC GTTGA
 
Protein sequence
MKALKRAGKA AKTFASIVLA GSLVGIGLVG CSGEAMGNAS GRDDEIVLTV SKCQTKELPQ 
ELLDEMTNRH PNLRFEFDTY SNSNYSAQIV TELEQRDIPD ILINTRNQDL TEDLEHNLVD
LAAYDFSSEY LPSVLDRMTI DGSLYYLPGY LTLAGFFYNK DLFAEHGWEA PQSLEELIAL
NEQAKAEGIR LMAYSMELTG QRFLQLTNIA SAQFLHTPQG SSWEQDYLAG EASMVGTFGP
FMDEYRLWLD SGLISADDLS LSNSDAAEMF ANGDVAMIYG VANNVKTTDF DFDLGQAPFL
ARGEGEDNGW YLYAVSSYYG INKKLEEPGN EEKLAIALEM FDLMSTPEGQ SMFTDGAEGR
YPATRKADGE LHAPLLSDYR NVVDRNNLVE LAAYTAPLLL GGEALGGYIA GTVSAEEALQ
ACDEAMKSNK SETQIGDVVA HIERDLSREE TVRYFADAFR EYGGTDLCLM LPGGMADGQM
HPYGISGKLY EGELHANELT VLLPNAGKPV PTLATARISG EDLRAVLESG RTFERKDASK
EALAPFRYEV SGAEVDYDAD RKVRSLKVNG VEVADEDVFT VTYFDGAVET SRLTDAAVSD
VKPVPAFTAC KAAR