Gene Elen_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0139 
Symbol 
ID8414423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp192154 
End bp193872 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content65% 
IMG OID645023119 
Productphosphoglucose isomerase (PGI) 
Protein accessionYP_003180522 
Protein GI257789916 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.149343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCCG GTGATGTAGA AAAACTGTAT CCTTCCGCGA AAGCCCTCGT GAAGGACTGC 
GTTGCCAGCC GTATCCATGC CAAGGATGCG AGCCTGTACG ACTTCTCGGA GGAAGCGCGC
GCGTGCTCCG AGCAATACAT GGGATGGACG GACCTTGCGA GCAATTCGCC GTACTCCCTG
CGCGACATCC AGAATTTCGC CGACTCGATC ATCGCCCAAG GGCTGAAGAC GGTCGTGCTC
ATCGGCCAGG GCGGTTCCAC GCAGGCGCCC ATGACCATCA CGAAATACAA CAAGCCCGAT
TCGTCGAAGA TCACGTTCAA GACGCTGGAC TCCGACTCGC CCGTGCGCGT GCGCGCCATT
TTGGCCGAAG CGAAGCCCGA GACCACGCTG TTCGTGATCT CGTCGAAGAG CGGCGGCACC
ATCGAGCCGC GCCTGGCCCT GCGTGCCGTG CGCGACGCCG TGGCCGACCG CATCAGCGAA
GAGGAGCTGG TGCAGCACCT CGTGGCCATC ACCGACCCCG GCTCCATGCT TGAGCGCCAG
GCGCGCGAAG AAGGGTGGGC CGCGGTGTTC TCCGGCCAGC CCACCGTGGG CGGGCGCTTC
TCCGCGCTGT CCGTGTTCGG CCTGCTGCCG GCGGCGCTCG TGGGCATCGA CCTGGAAGAG
TTCATGGCGC ACGCCATCGA CGCCGAGCGC CAGTGCAGCG AGGACGCCAT CGACAACCCG
GCCATCGGCC TGGCATCGTT TTTGTACGAC AACTACCTGC AGGGACGCAA CAAGTTCACG
TTCCTCACGC AGAAGCGCGG CCGCGTTCTG GGTCTGTGGA TCGAGCAGCT GGTGGCCGAG
AGCCTGGGCA AGGACGGCCA GGGCATTCTG CCCAACATCG AGGTGGACTC CCTGCTGCTC
AAGAAAGACC CGGGCGATCG CAGCGCCATC GTGTACCTCA CGCGCAACGA CCTGTGGGAC
GAGCGCCGCA ACTTCGAGAT GAGCCTGTCC TACATCGACC CGGCCATCCC GCGCGCCAAC
TACAAGATCG ACTCCGTCGA AGAGCTGGCC GAGCACTTCG TGATGTGGGA ATACGCCATC
GCGATGTGCG GCTACCTCAT GAAGATCTGC CCCTTCGACC AGCCCGACGT GGCGTCGGCG
AAGGCCGTGG TGCTCGACAT CCTCAAGGAG GGCCAGCCCG AGCCCGACTT CGTGCAGGAT
TTCATCGACG AGGTGCACAT GGGCGAGGTG GAAGTGCGCC TGTCTCCGTG CTTCAAAGAT
TGCACCGATG TCCGCAGCGC GCTGCGTGCG CTGCTGGGCA GCATTCAACC GGGCGATTTC
TTCGCGCTCA ACGCGTTCTT GCCGTTCACG GGCGAGGGTC GACGCGAGGC GCTGGAAACC
ATCCGTCACG GCGTGGCTGA GAAGCGCGGC GTGGTATCCT GCCTGGAAGT GGGTCCGCGC
TACCTGCACT CCACCGGCCA GCTGCACAAG GGCGGCCCGA ACTGCGGCGT GTTCCTCATC
CTGTCGGCCG ACGAGCTAAA GGACATCCCG CTGAAGCAGG AGGCTGAAAG CCTGGGCTCG
CTGGCCAAGG CGCAGGCGTC GGGCGACCTC GTTACGCTGG CCGAGCGCGG GCGGCGCGTG
GTGCACCTGC ACCTGCCCGA CAACTCGGGC GTTACGCTGC GCCAGCTGGC TGAAGTGATT
TCCGACATCC TGGAAACCAT GACGGTGCCG ACGGCTTAG
 
Protein sequence
MLSGDVEKLY PSAKALVKDC VASRIHAKDA SLYDFSEEAR ACSEQYMGWT DLASNSPYSL 
RDIQNFADSI IAQGLKTVVL IGQGGSTQAP MTITKYNKPD SSKITFKTLD SDSPVRVRAI
LAEAKPETTL FVISSKSGGT IEPRLALRAV RDAVADRISE EELVQHLVAI TDPGSMLERQ
AREEGWAAVF SGQPTVGGRF SALSVFGLLP AALVGIDLEE FMAHAIDAER QCSEDAIDNP
AIGLASFLYD NYLQGRNKFT FLTQKRGRVL GLWIEQLVAE SLGKDGQGIL PNIEVDSLLL
KKDPGDRSAI VYLTRNDLWD ERRNFEMSLS YIDPAIPRAN YKIDSVEELA EHFVMWEYAI
AMCGYLMKIC PFDQPDVASA KAVVLDILKE GQPEPDFVQD FIDEVHMGEV EVRLSPCFKD
CTDVRSALRA LLGSIQPGDF FALNAFLPFT GEGRREALET IRHGVAEKRG VVSCLEVGPR
YLHSTGQLHK GGPNCGVFLI LSADELKDIP LKQEAESLGS LAKAQASGDL VTLAERGRRV
VHLHLPDNSG VTLRQLAEVI SDILETMTVP TA