Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0139 |
Symbol | |
ID | 8414423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 192154 |
End bp | 193872 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023119 |
Product | phosphoglucose isomerase (PGI) |
Protein accession | YP_003180522 |
Protein GI | 257789916 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.149343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATCCG GTGATGTAGA AAAACTGTAT CCTTCCGCGA AAGCCCTCGT GAAGGACTGC GTTGCCAGCC GTATCCATGC CAAGGATGCG AGCCTGTACG ACTTCTCGGA GGAAGCGCGC GCGTGCTCCG AGCAATACAT GGGATGGACG GACCTTGCGA GCAATTCGCC GTACTCCCTG CGCGACATCC AGAATTTCGC CGACTCGATC ATCGCCCAAG GGCTGAAGAC GGTCGTGCTC ATCGGCCAGG GCGGTTCCAC GCAGGCGCCC ATGACCATCA CGAAATACAA CAAGCCCGAT TCGTCGAAGA TCACGTTCAA GACGCTGGAC TCCGACTCGC CCGTGCGCGT GCGCGCCATT TTGGCCGAAG CGAAGCCCGA GACCACGCTG TTCGTGATCT CGTCGAAGAG CGGCGGCACC ATCGAGCCGC GCCTGGCCCT GCGTGCCGTG CGCGACGCCG TGGCCGACCG CATCAGCGAA GAGGAGCTGG TGCAGCACCT CGTGGCCATC ACCGACCCCG GCTCCATGCT TGAGCGCCAG GCGCGCGAAG AAGGGTGGGC CGCGGTGTTC TCCGGCCAGC CCACCGTGGG CGGGCGCTTC TCCGCGCTGT CCGTGTTCGG CCTGCTGCCG GCGGCGCTCG TGGGCATCGA CCTGGAAGAG TTCATGGCGC ACGCCATCGA CGCCGAGCGC CAGTGCAGCG AGGACGCCAT CGACAACCCG GCCATCGGCC TGGCATCGTT TTTGTACGAC AACTACCTGC AGGGACGCAA CAAGTTCACG TTCCTCACGC AGAAGCGCGG CCGCGTTCTG GGTCTGTGGA TCGAGCAGCT GGTGGCCGAG AGCCTGGGCA AGGACGGCCA GGGCATTCTG CCCAACATCG AGGTGGACTC CCTGCTGCTC AAGAAAGACC CGGGCGATCG CAGCGCCATC GTGTACCTCA CGCGCAACGA CCTGTGGGAC GAGCGCCGCA ACTTCGAGAT GAGCCTGTCC TACATCGACC CGGCCATCCC GCGCGCCAAC TACAAGATCG ACTCCGTCGA AGAGCTGGCC GAGCACTTCG TGATGTGGGA ATACGCCATC GCGATGTGCG GCTACCTCAT GAAGATCTGC CCCTTCGACC AGCCCGACGT GGCGTCGGCG AAGGCCGTGG TGCTCGACAT CCTCAAGGAG GGCCAGCCCG AGCCCGACTT CGTGCAGGAT TTCATCGACG AGGTGCACAT GGGCGAGGTG GAAGTGCGCC TGTCTCCGTG CTTCAAAGAT TGCACCGATG TCCGCAGCGC GCTGCGTGCG CTGCTGGGCA GCATTCAACC GGGCGATTTC TTCGCGCTCA ACGCGTTCTT GCCGTTCACG GGCGAGGGTC GACGCGAGGC GCTGGAAACC ATCCGTCACG GCGTGGCTGA GAAGCGCGGC GTGGTATCCT GCCTGGAAGT GGGTCCGCGC TACCTGCACT CCACCGGCCA GCTGCACAAG GGCGGCCCGA ACTGCGGCGT GTTCCTCATC CTGTCGGCCG ACGAGCTAAA GGACATCCCG CTGAAGCAGG AGGCTGAAAG CCTGGGCTCG CTGGCCAAGG CGCAGGCGTC GGGCGACCTC GTTACGCTGG CCGAGCGCGG GCGGCGCGTG GTGCACCTGC ACCTGCCCGA CAACTCGGGC GTTACGCTGC GCCAGCTGGC TGAAGTGATT TCCGACATCC TGGAAACCAT GACGGTGCCG ACGGCTTAG
|
Protein sequence | MLSGDVEKLY PSAKALVKDC VASRIHAKDA SLYDFSEEAR ACSEQYMGWT DLASNSPYSL RDIQNFADSI IAQGLKTVVL IGQGGSTQAP MTITKYNKPD SSKITFKTLD SDSPVRVRAI LAEAKPETTL FVISSKSGGT IEPRLALRAV RDAVADRISE EELVQHLVAI TDPGSMLERQ AREEGWAAVF SGQPTVGGRF SALSVFGLLP AALVGIDLEE FMAHAIDAER QCSEDAIDNP AIGLASFLYD NYLQGRNKFT FLTQKRGRVL GLWIEQLVAE SLGKDGQGIL PNIEVDSLLL KKDPGDRSAI VYLTRNDLWD ERRNFEMSLS YIDPAIPRAN YKIDSVEELA EHFVMWEYAI AMCGYLMKIC PFDQPDVASA KAVVLDILKE GQPEPDFVQD FIDEVHMGEV EVRLSPCFKD CTDVRSALRA LLGSIQPGDF FALNAFLPFT GEGRREALET IRHGVAEKRG VVSCLEVGPR YLHSTGQLHK GGPNCGVFLI LSADELKDIP LKQEAESLGS LAKAQASGDL VTLAERGRRV VHLHLPDNSG VTLRQLAEVI SDILETMTVP TA
|
| |