Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1037 |
Symbol | aceE |
ID | 4709777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1118262 |
End bp | 1120937 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855508 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001002615 |
Protein GI | 121997828 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCGA TACCCGAGTT GCGCGACGAT CCGGATCCGG AGGAGATCCA GGAGTGGCTC GACGCCCTGG ATGCGGTGAT CGAACACGAG GGCCCCGAGC GGGCACAGCA GATCCTGGAG CACCTGGTCA GCAAGGGCCG CCGACGCTTG GGGCACCTAC CATTCAAGGC CACGACCGGC TACATCAACA CCATCCCCCG CCACCTGGAG GTCCGTCCGC CGGAGTACAC CGATCACCAC CTGGAGTGGC GCATCCGGGC GCTGGTGCGC TGGAACGCCA TCGCCATGGT GGTCGCCGCT AACCGTGAGC ACGACGGCAT CGGTGGGCAC ATCGCCAGCT ACGCCTCGGC CTGCACGCTC TACGAGGTCG GCTTCAACCA CTTCTGGCGG GCCCCCAACG AGGAGCAGGA CGGCGATCTG GTCTTCTTCC AGGGCCACTC GGCACCGGGC ATCTACGCCC GCGCCTACCT CGAAGGCCGG CTCAGCGCCG AGCAACTGGC GGGCTTCCGC CAGGATGTCG ACGGCGACGG TGTCAGCTCC TACCCGCACC CGTGGCTGAT GCCCGACTTC TGGCAGTTCC CCACCGTCTC CATGGGCCTC GGCCCCATCC AGGCGGTGCA GCAGGCGCGC ATGATGAAGT ACCTCCACCA CCGCGGGATT GAGGACACCA GCGGTCGGAA GGTCTGGTGC TTCATGGGTG ATGGCGAGAT GGACGAGCCG GAGTCCATGG GCTCCATCGG CCTCGCCGCC CGCGAGCAGC TCGACAACCT GATCTTCGTG GTCAACTGCA ACCTGCAGCG TCTCGACGGT CCGGTACGTG GCAACGGCAA GATCATCCAG GAACTCGAGG GCGAGTTCCG CGGCGCCGGC TGGAACGTCA TCAAGGTGAT CTGGGGCTCG CGCTGGGACC CACTGCTGGA GCTCGACCAC GAGGGGCTCC TGCAGCAGCG CATGGAGGAG GCCGTCGACG GCGAGTACCA GGCCTTCAAG GCGCGCGGCG GCGATTACAC GCGCAAGCAC TTCTTCGCCA AGGATCCAGA GCTGGAGAAG ATGGTCGCCG GCATGTCGGA CTACGACATC TACCGGCTCA ACCGCGGTGG CCACGACCCG CACAAGGTCT ACGCCGCCTA CCACGCCGCG GTGAACCACA CCGGGCAGCC CACCGTGATC CTGGCCAAGA CGGTCAAGGG CTACGGCATG GGCGAGGCCG GCGAGGGGCA GAACATCACC CACCAGCAGA AGAAGATGGG CGAGAACGCC CTGCGCCGGT TCCGGGATCA CTACGAGATC CCCATCCCCG ACGAGCAGCT CAAGGAGACG CCCTTCTACA AGCCCGACGA CGACGCCCCG GAGATGCGCT ATCTCCACGA GCGTCGCCAG GCCCTGGGCG GCTATATGCC GGTGCGCTAC GAACGGGCCC CGGCGCTCGA GGTGCCGGAG CTCTCCGCCT TCGACGCCCT GCTCAAAGAC AGCGGCGAGC GGGAACTCTC CACCACCATG GCCTTCGTGC GCGCCCTGAC GGTACTCACC CGCGACAAAC AGGTCGGCCA GCGGGTGGTG CCCATCATCC CCGACGAGGC GCGCACCTTC GGCATGGAGG GGCTGTTCCG TCAGCTCGGC ATCTACTCCA ACGTCGGCCA GCTCTACGAG CCCGAAGACG CCGACCAGCT GATGTCCTAC CGCGAGGCGC AGACCGGGCA GATCCTCGAG GAAGGCCTCG ACGAGGCCGG GGCCATGTCC TCGTGGATGG CCGCGGCCAC GTCCTACGCC AACCACGGCG TCAACATGAT CCCCTTCTAC ATCTTCTACT CCATGTTCGG CTTCCAGCGC GTGGGGGATC TGTGCTGGGC GGCTGGGGAT ATCCAGGCCC GCGGCTTCCT CATCGGCGGC ACCGCTGGCA GGACCACCCT CAACGGCGAG GGGCTGCAGC ACCAGGACGG CCACAGCCAC GTCCTCGCCT CGACCATCCC CAACTGCGTC TCCTACGACC CGGCCTTCGA CTACGAGCTG GCGGTGATCG TCCAAGACGG GCTGCGGCGG ATGTACGCCG AGCAGGAGAA CTGCTTCTAC TACCTGACCG TCTACAACGA GAACCACACC CACCCGGCGA TGCCGGAGGG GGCCGAGGAG GGGATCCGCC GCGGCATGTA CCTCTTCCGG GCCGGGCCCG AGAAGAAGGG GCCGCGGGTG CAGCTGATGG GCTCGGGCAG CATCTTCCGC GAGGTGTTGG CGGCGGCGGA TCTGCTCGCC AACGACTTCG GCGTCCACGC CGACATCTGG AGCTGCCCCT CGTTCACCGA GTTGGCCCGC GACGGGATGG TCTGCGCCCG GGCCAATCGG CTCCACCCGG AGGCGGAGCG CCGCCAGTCC TACCTGCAGG CGTGCCTGGA GAGGTACAGC GGTCCGGCGG TCGCCGCCAC GGACTACATG CGCGCCTACC CCGATCAGAT CCGCCCCTAC ATCGGGCGCA AGTTCTGGTC CCTGGGGACC GACGGTTTTG GCCGCTCGGA TACCCGGGAG AAGCTCCGGC GCTTCTTTGA GGTGGACCGC CACCACATCG CCGCAGCGGC CCTCTACGCC CTCGCCGATG AGGGCACCAT CGAGGCCGCG AAGGTCAGCG AGGCCATCGG CAAGTACCAG CTGGCGGCGG ATGCCCCGAA CCCGTGGGAG GTGTGA
|
Protein sequence | MQSIPELRDD PDPEEIQEWL DALDAVIEHE GPERAQQILE HLVSKGRRRL GHLPFKATTG YINTIPRHLE VRPPEYTDHH LEWRIRALVR WNAIAMVVAA NREHDGIGGH IASYASACTL YEVGFNHFWR APNEEQDGDL VFFQGHSAPG IYARAYLEGR LSAEQLAGFR QDVDGDGVSS YPHPWLMPDF WQFPTVSMGL GPIQAVQQAR MMKYLHHRGI EDTSGRKVWC FMGDGEMDEP ESMGSIGLAA REQLDNLIFV VNCNLQRLDG PVRGNGKIIQ ELEGEFRGAG WNVIKVIWGS RWDPLLELDH EGLLQQRMEE AVDGEYQAFK ARGGDYTRKH FFAKDPELEK MVAGMSDYDI YRLNRGGHDP HKVYAAYHAA VNHTGQPTVI LAKTVKGYGM GEAGEGQNIT HQQKKMGENA LRRFRDHYEI PIPDEQLKET PFYKPDDDAP EMRYLHERRQ ALGGYMPVRY ERAPALEVPE LSAFDALLKD SGERELSTTM AFVRALTVLT RDKQVGQRVV PIIPDEARTF GMEGLFRQLG IYSNVGQLYE PEDADQLMSY REAQTGQILE EGLDEAGAMS SWMAAATSYA NHGVNMIPFY IFYSMFGFQR VGDLCWAAGD IQARGFLIGG TAGRTTLNGE GLQHQDGHSH VLASTIPNCV SYDPAFDYEL AVIVQDGLRR MYAEQENCFY YLTVYNENHT HPAMPEGAEE GIRRGMYLFR AGPEKKGPRV QLMGSGSIFR EVLAAADLLA NDFGVHADIW SCPSFTELAR DGMVCARANR LHPEAERRQS YLQACLERYS GPAVAATDYM RAYPDQIRPY IGRKFWSLGT DGFGRSDTRE KLRRFFEVDR HHIAAAALYA LADEGTIEAA KVSEAIGKYQ LAADAPNPWE V
|
| |