Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1878 |
Symbol | |
ID | 8416182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2208679 |
End bp | 2210352 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024848 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003182231 |
Protein GI | 257791625 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000119972 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGCTAC GCAGCGATGC CGTGCGGTGC GGCACGGCGC GCGCCCCGCA CCGCAGCCTG CTCAAGGCCG ACGGCATCAC CGACGAGGAG ATGGAGCGGC CGCTCGTCGC CGTCTTCAAC TCCCGCAACG ACATCATCCC CGGCCACAAC AACCTCGACA AGATCGCCGA AGCCGTGAAG GCGGGCATCT ACATGGCCGG CGGCGTGCCG TTCGAGATAT CCACCATCGG CGTGTGCGAC GGCATCGCCA TGAACCACGA CGGCATGCAC TACTCGCTCG TGTCGCGCGA AGCCATCGCC GACTCGCTGG AATGCGCGGT GCAGGGCCAT GCCTTCGACG CGCTCGTGTG CATCCCGAAC TGCGACAAGA TCGTGCCGGG CATGCTGCTG GGCGCGCTGC GCGTGAACAT CCCCACGGTG TTCGTGTCGG GCGGCCCCAT GCTTCCCGGC AAGCAGCCGG GCGGGTGCGG GCCCACCACC GACCTCAACA CGCTGTTCGA CGGCGCGGCG CAGGTGATGA ACGGCGACAT GTCCGAGGCC GAGCTCAAGT ACTACGAGGA CACCGCGTGC CCCACCTGCG GCAGCTGCTC GGGCATGTTC ACGGCGAACT CGATGAACTG CCTGTGCGAG GCCCTCGGCA TCGCGCTTCC CGGCAACGGC ACCGTCCCCG CGGTGTACTC CGAGCGCATC CGCCTGGCGA AGCACGCGGG CATGAAGGTG ATGGAGCTGC TGGAGCAGGG GATCTGCGCG CGCGACATCG TGAGCGAGGC CGCCATCCAC AACGCCATGG AATGCGACAT GGCCTTCGGC GGCTCCACGA ACACGGTGCT GCACCTCACG GCCGTCGCCC GCGAGGCGGG CCATCCCATC ACGATGGACG ACTGGGACGC CGCGAGCGCG CGCACGCCGC ACCTCGTGAA GCTGCAGCCC TCGGGCCCGC GCCCGCTGTC CGACCTGTAC GAGGTGGGCG GCGTGCCCGT GGTCATGCAC GAGCTGGCCG AGCTCGACCT GCTCGACCGG CGCGCGATCA CCTGCATGGG CCCGCTCGAC GAGTACCTGC GGACGTGCAC GAAGCCGGCC GACGGCGAGG TATGCCGCAC GCACGACAAC CCGTTCTCCC CGGTGGGCGC GCTCAAGGTG CTGCACGGCA ACATCGCGCC CGACGGCGCC ATCGTGAAGA AGTCGGCCGT CGACCCCTCG ATGCTCACCC ACACGGGCCC CGCGCGCTGC TTCGACAGCG AGGAGGAGGC CTGCGCCGCC ATCAACGGCG GGCGGGTCGA GGCGGGCGAC GTGGTGGTCA TCCGCTACGA GGGCCCCAAG GGCGGCCCGG GCATGCGCGA GATGCTCACG CCCACGTCGT CCATCGTGGG GATGGGCCTG TCCACGAGCG TCGCGCTCAT CACCGACGGG CGCTTCTCGG GCGCCACGAA GGGCCCGGCC GTGGGGCACG TGAGCCCCGA GGCGGCCGCG GGCGGCCCCA TCGCGCTCAT CGAGGAAGGT GACACGGTGA CGGTGGACAT CGAGGGCGGC GCGCTCACGC TGGGCGTGGA CGATGCCGAG CTCGAACGCC GCCGCGCGGC ATGGACGCCG CCGGCGCCGA AGCACGACCA CGGCGTGCTC GCGAAGTACG CGAAGCTCGT CTCATCCGCA GACAAGGGGG CGTATGTGTC ATGA
|
Protein sequence | MQLRSDAVRC GTARAPHRSL LKADGITDEE MERPLVAVFN SRNDIIPGHN NLDKIAEAVK AGIYMAGGVP FEISTIGVCD GIAMNHDGMH YSLVSREAIA DSLECAVQGH AFDALVCIPN CDKIVPGMLL GALRVNIPTV FVSGGPMLPG KQPGGCGPTT DLNTLFDGAA QVMNGDMSEA ELKYYEDTAC PTCGSCSGMF TANSMNCLCE ALGIALPGNG TVPAVYSERI RLAKHAGMKV MELLEQGICA RDIVSEAAIH NAMECDMAFG GSTNTVLHLT AVAREAGHPI TMDDWDAASA RTPHLVKLQP SGPRPLSDLY EVGGVPVVMH ELAELDLLDR RAITCMGPLD EYLRTCTKPA DGEVCRTHDN PFSPVGALKV LHGNIAPDGA IVKKSAVDPS MLTHTGPARC FDSEEEACAA INGGRVEAGD VVVIRYEGPK GGPGMREMLT PTSSIVGMGL STSVALITDG RFSGATKGPA VGHVSPEAAA GGPIALIEEG DTVTVDIEGG ALTLGVDDAE LERRRAAWTP PAPKHDHGVL AKYAKLVSSA DKGAYVS
|
| |