Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0107 |
Symbol | aceE |
ID | 6269105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 114643 |
End bp | 117306 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641724363 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001878922 |
Protein GI | 187733027 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00148549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAC GTTTCCCAAA TGACGTGGAT CCGATCGAAA CTCGCGACTG GCTCCAGGCG ATCGAATCGG TCATCCGTGA AGAAGGTGTT GAGCGTGCTC AGTATCTGAT CGACCAACTG CTTGCTGAAG CCCGCAAAGG CGGTGTAAAC GTAGCCGCAG GCACAGGTAT CAGCAACTAC ATCAACACCA TCCCCGTTGA AGAACAACCG GAGTATCCGG GTAATCTGGA ACTGGAACGC CGTATTCGTT CAGCTATCCG CTGGAACGCC ATCATGACGG TGCTGCGTGC GTCGAAAAAA GACCTCGAAC TGGGCGGCCA TATGGCGTCC TTCCAGTCTT CCGCAACCAT TTATGATGTG TGCTTTAACC ACTTCTTCCG TGCACGCAAC GAGCAGGATG GCGGCGACCT GGTTTACTTC CAGGGCCACA TCTCCCCGGG CGTGTACGCT CGTGCTTTCC TGGAAGGTCG TCTGACTCAG GAGCAGCTGG ATAACTTCCG TCAGGAAGTT CACGGCAATG GCCTCTCTTC CTATCCGCAC CCGAAACTGA TGCCGGAATT CTGGCAGTTC CCGACCGTAT CTATGGGGCT GGGGCCGATT GGTGCTATTT ACCAGGCTAA ATTCCTGAAA TATCTGGAAC ACCGTGGCCT GAAAGATACC TCTAAACAGA CCGTTTACGC GTTCCTCGGC GACGGTGAAA TGGACGAACC GGAATCCAAA GGTGCGATCA CCATCGCTAC CCGTGAAAAA CTGGATAACC TGGTCTTCGT TATCAACTGT AACCTGCAGC GTCTTGACGG CCCGGTCACC GGTAACGGCA AGATCATCAA CGAACTGGAA GGCATCTTCG AAGGTGCTGG CTGGAACGTG ATCAAAGTGA TGTGGGGTAG CCGTTGGGAT GAACTGCTGC GTAAAGATAC CAGCGGTAAA CTGATCCAGC TGATGAACGA AACCGTTGAC GGCGACTACC AGACCTTCAA ATCGAAAGAT GGTGCGTACG TTCGTGAACA CTTCTTCGGT AAATATCCTG AAACCGCAGC ACTGGTTGCA GACTGGACTG ACGAGCAGAT CTGGGCACTG AACCGTGGTG GTCACGATCC GAAGAAAATC TACGCTGCAT TCAAGAAAGC GCAGGAAACC AAAGGCAAAG CGACAGTAAT CCTTGCTCAT TCCATTAAAG GTTACGGCAT GGGCGACGCG GCTGAAGGTA AAAACATCGC GCACCAGGTT AAGAAAATGA ACATGGACGG CGTGCGTCAC ATCCGCGACC GTTTCAATGT GCCGGTGTCT GATGCAGATA TCGAAAAACT GCCGTACATC ACCTTCCCGG AAGGTTCTGA AGAGCATACC TATCTGCACG CACAGCGTCA GAAACTGCAC GGTTATCTGC CAAGCCGTCA GCCGAACTTC ACCGAGAAGC TTGAGCTGCC GAGCCTGCAA GACTTCGGCG CGCTGCTGGA AGAGCAGAGC AAAGAGATCT CTACCACTAT CGCTTTCGTT CGTGCCCTGA ACGTAATGCT GAAGAACAAG TCGATCAAAG ATCGTCTGGT ACCGATCATC GCCGACGAAG CGCGTACTTT CGGTATGGAA GGTCTGTTCC GTCAGATTGG TATTTACAGC CCGAACGGTC AGCAGTACAC CCCGCAGGAC CGCGAGCAGG TTGCTTACTA TAAAGAAGAC GAGAAAGGTC AGATTCTGCA GGAAGGGATC AACGAGCTGG GCGCAGGTTG TTCCTGGCTG GCAGCGGCGA CCTCTTACAG CACCAACAAT CTGCCGATGA TCCCGTTCTA CATCTATTAC TCGATGTTCG GCTTCCAGCG TATTGGCGAT CTGTGCTGGG CGGCTGGCGA CCAGCAAGCG CGTGGCTTCC TGATCGGCGG TACTTCCGGT CGTACCACCC TGAACGGAGA AGGTCTGCAG CATGAAGATG GTCACAGCCA CATTCAGTCG CTGACTATTC CGAACTGTAT CTCTTACGAC CCGGCTTACG CTTACGAAGT TGCTGTCATT ATGCATGACG GTCTGGAGCG TATGTACGGT GAAAAACAAG AGAACGTTTA CTACTACATC ACCACGCTGA ACGAAAACTA CCACATGCCG GCAATGCCGG AAGGTGCTGA GGAAGGTATC CGTAAAGGTA TCTACAAACT CGAAACCATT GAAGGTAGCA AAGGTAAAGT TCAGCTGCTG GGCTCCGGTT CTATCCTGCG TCACGTCCGT GAAGCGGCAG AGATCCTGGC GAAAGATTAC GGCGTAGGTT CTGACGTTTA TAGCGTGACC TCCTTCACCG AGCTGGCGCG TGATGGTCAG GATTGTGAAC GCTGGAACAT GCTTCATCCG CTGGAAACTC CGCGCGTTCC GTATATCGCT CAGGTGATGA ACGACGCTCC GGCAGTGGCA TCTACCGACT ATATGAAACT GTTCGCTGAG CAGGTCCGTA CTTACGTACC GGCTGACGAC TACCGCGTAC TGGGTACTGA TGGCTTCGGT CGTTCCGACA GCCGTGAGAA CCTGCGTCAC CACTTCGAAG TTGATGCTTC TTATGTCGTG GTTGCGGCGC TGGGCGAACT GGCTAAACGT GGCGAAATCG ATAAGAAAGT GGTTGCTGAC GCAATCGCCA AATTCAACAT CGATGCAGAT AAAGTTAACC CGCGTCTGGC GTAA
|
Protein sequence | MSERFPNDVD PIETRDWLQA IESVIREEGV ERAQYLIDQL LAEARKGGVN VAAGTGISNY INTIPVEEQP EYPGNLELER RIRSAIRWNA IMTVLRASKK DLELGGHMAS FQSSATIYDV CFNHFFRARN EQDGGDLVYF QGHISPGVYA RAFLEGRLTQ EQLDNFRQEV HGNGLSSYPH PKLMPEFWQF PTVSMGLGPI GAIYQAKFLK YLEHRGLKDT SKQTVYAFLG DGEMDEPESK GAITIATREK LDNLVFVINC NLQRLDGPVT GNGKIINELE GIFEGAGWNV IKVMWGSRWD ELLRKDTSGK LIQLMNETVD GDYQTFKSKD GAYVREHFFG KYPETAALVA DWTDEQIWAL NRGGHDPKKI YAAFKKAQET KGKATVILAH SIKGYGMGDA AEGKNIAHQV KKMNMDGVRH IRDRFNVPVS DADIEKLPYI TFPEGSEEHT YLHAQRQKLH GYLPSRQPNF TEKLELPSLQ DFGALLEEQS KEISTTIAFV RALNVMLKNK SIKDRLVPII ADEARTFGME GLFRQIGIYS PNGQQYTPQD REQVAYYKED EKGQILQEGI NELGAGCSWL AAATSYSTNN LPMIPFYIYY SMFGFQRIGD LCWAAGDQQA RGFLIGGTSG RTTLNGEGLQ HEDGHSHIQS LTIPNCISYD PAYAYEVAVI MHDGLERMYG EKQENVYYYI TTLNENYHMP AMPEGAEEGI RKGIYKLETI EGSKGKVQLL GSGSILRHVR EAAEILAKDY GVGSDVYSVT SFTELARDGQ DCERWNMLHP LETPRVPYIA QVMNDAPAVA STDYMKLFAE QVRTYVPADD YRVLGTDGFG RSDSRENLRH HFEVDASYVV VAALGELAKR GEIDKKVVAD AIAKFNIDAD KVNPRLA
|
| |