Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0855 |
Symbol | aceE |
ID | 4027819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 955255 |
End bp | 957945 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637966024 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_572911 |
Protein GI | 92112983 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTGG AGGCAAGAGA AGATCTCGAT CCGATCGAAA CCACGGAATG GCTGGATTCC TTGGCCTCGG TCATCGAGCA TGAGGGCGAA GAACGTGCTC GGTATATCAT GAGTCGCCTG GCGGACCGCC TGCGTCGCGA TGGCTCTTCA CCGCCGTTTT CCGGGACCAC GCCGCATCGC AATACCATCC CGATGCATCG TGAAGCCAAG ATGCCCGGCG ACATGTTCAT CGAGCGCAAG ATTCGCTCGG CCATTCGCTG GAACGCGATG GTGCAGGTGC TGCGCGCCAA CAAGAAGCAC AAGGGCCTGG GCGGTCACAT TGCCAGCTAC CAGTCCAGTG CGACCTTGTA CGAAGTGGGC TTCAACCACT TCTTCCGCGC GGACAATGGC GATCACAAGG GCGACCTGCT GTACATCCAG GGGCACGTGA CGCCGGGCAT CTATGCGCGT TCCTTCCTCG AAGGCCGCCT GACCGAAGAG CAGATGGACA ACTTCCGTCA GGAAGTCGAC GGCGACGCGC TGCCGTCTTA CCCCCACCCG TACCTGATGC CGGATTACTG GCAGTTCCCC ACGGTCTCCA TGGGACTGGG GCCGATCCAG GCGATCTATC AGGCTCACGT GATGAAGTAC CTGCATTCGC GTGAACTCGA GGACATGAAG GACCGCAAGG TCTGGGCATT CCTGGGGGAC GGCGAGTGCG ACGAGCCCGA AACCCTCGGC GCGCTGCATC TGGCCAGCCG TGAAAAGCTC GACAACCTGA TCTTCGTCAT CAACTGCAAC CTGCAGCGTC TCGATGGCCC GGTGCGTGGC AACTCGCGCA TCATCGACGA GCTGGAAGGC GTCTTCCGCG GTGCCGGCTG GAACGTGATC AAGGTCGTGT GGGGCGGTCA GTGGGACCCG CTGTTCGAGC AGGACGCCAA CGGCATGCTG CAAAAGCGCA TGGACGAGGC GGTCGACGGC GAGTACCAGA ACTACAAGGC TCAGGGCGGT GCCTACACCC GCGAGCATTT CTTCGGCAAG TACGAAGAAA CCGCGGCGAT GGTCAAGGAC TACACCGACG AGCAGATCTT CCGTCTCAAC CGGGGCGGCC ACGACCCGCA GAAGGTCTAT GCGGCCTACC ATGAAGCGGT CAACAACGCG GGCAACCGTC CCACCGTGAT CCTGGCGCAC ACCGTCAAGG GTTACGGCAT GGGCGGCGGC AGCGGCGAAG CCGACATGGA AGCCCACCAG ATCAAGTCGA TGGATACCGA CGCCCTCAAG GCGTTCCGCG ACCGGTTCGG CATCCCGATC TCCGACAAGC AGATCGAAAG CGGCGATATT CCGTACTACA AGCCCGACGA CGACAGCCCG GAGATGAAGT ACCTGCATCT CCAGCGCGAG AAGCTCGGCG GCTTCTTGCC GGCACGCAAG AACGACTTCG AGGCGCTCGA GATCCCCGGG CTCGACGACA AGATGTTCGC GTCCCAGCTC AAGGGCTCCG GCGACCGTGA AGTGTCCAGC ACGATGTCCT TCGTGCGCGT GCTCAACGGC TTGGTCAAGA ACAAGCAGCT CGGCTCGCGG GTGGTGCCGA TCATTCCCGA CGAGGCGCGT ACCTTCGGGA TGGAGGGCAT GTTCCGTCAG CTTGGCATCT ACGCCGCCGA AGGCCAGAAG TACGAGCCGA TGGATGCCGG TCAGATCATG TACTACCGCG AGGATCAGAA AGGCCAGATT CTCGAGGAAG GCATCACCGA AGCCGGCTCC ATGTCCGCGT GGATCGCGGC GGCGACGGCG TACGCCAACC ACCACCTGCC GCTGATTCCG TTCTACGTCT ACTACTCGAT GTTCGGCTAT CAGCGTGTCG GCGACCTGGT CTGGGCCGCC GGTGACCTGC AGGCGCGTGG CTTCATGATC GGCGGTACCG CCGGGCGTAC CACCATCAAC GGCGAAGGCC TGCAGCACCA GGACGGCCAC AGCCATATCC TGATGTCGAC CGTGCCGACC TGCCGTGCCT ACGACCCGTG CTATGGCCAC GAGGTGGCGG TCATCCTGCA GGACGGGCTG AAGCGGATGT TCCAGGACAA GGAAAACTGC TTCTACTACC TGACCGTGAT GAACGAGAAC TACCCGCACC CGGAAATGCC GGAGGGCAGC GAGGAAGGCA TCGTTCGCGG CATGTATCGT CTCTCGGAAG GCAAGAAGGA CGGCCGCAAG AAACAGCCGC GCGTGCAGCT GTTGGGCAGC GGCACGATCC TGCGCGAAGT CGAGGCCGCC GCCGAGATGC TGGCGGAAGA GCATGGCGTG GTCGCCGACG TGTGGAGCGT GACCAGCTTC AACGAGCTGC GTCGCGAAGC GCTCGAATAC GACCGCCTGC AGTTCCTCGA CTACGATGAG AAGCGTGAGA AGCCGTGGGT GGTGCAGCAG CTGGATAGCG CCGAAGGGCC GGTCGTCGCC TCGACCGACT ACATGAAGCT CTACGCCGAC CAGATCCGCG CCTGGGTACC GAGCGACTTC CACGTGCTGG GGACCGATGG TTTCGGACGT TCCGACACGC GTGAAGCGCT GCGTCGTTTC TTCGAAGTCG ACCGCTACTA CGTCACCATT CAAGCGCTGC GCGCGCTGGC CAATCGCGGT GAGATCGACG TCAAGGTCGT CAACGATGCC ATGAAGAAAT ACGGCATCGA CCCCGCCAAG CCCAGCCCGC TGGTGTCTTG A
|
Protein sequence | MSLEAREDLD PIETTEWLDS LASVIEHEGE ERARYIMSRL ADRLRRDGSS PPFSGTTPHR NTIPMHREAK MPGDMFIERK IRSAIRWNAM VQVLRANKKH KGLGGHIASY QSSATLYEVG FNHFFRADNG DHKGDLLYIQ GHVTPGIYAR SFLEGRLTEE QMDNFRQEVD GDALPSYPHP YLMPDYWQFP TVSMGLGPIQ AIYQAHVMKY LHSRELEDMK DRKVWAFLGD GECDEPETLG ALHLASREKL DNLIFVINCN LQRLDGPVRG NSRIIDELEG VFRGAGWNVI KVVWGGQWDP LFEQDANGML QKRMDEAVDG EYQNYKAQGG AYTREHFFGK YEETAAMVKD YTDEQIFRLN RGGHDPQKVY AAYHEAVNNA GNRPTVILAH TVKGYGMGGG SGEADMEAHQ IKSMDTDALK AFRDRFGIPI SDKQIESGDI PYYKPDDDSP EMKYLHLQRE KLGGFLPARK NDFEALEIPG LDDKMFASQL KGSGDREVSS TMSFVRVLNG LVKNKQLGSR VVPIIPDEAR TFGMEGMFRQ LGIYAAEGQK YEPMDAGQIM YYREDQKGQI LEEGITEAGS MSAWIAAATA YANHHLPLIP FYVYYSMFGY QRVGDLVWAA GDLQARGFMI GGTAGRTTIN GEGLQHQDGH SHILMSTVPT CRAYDPCYGH EVAVILQDGL KRMFQDKENC FYYLTVMNEN YPHPEMPEGS EEGIVRGMYR LSEGKKDGRK KQPRVQLLGS GTILREVEAA AEMLAEEHGV VADVWSVTSF NELRREALEY DRLQFLDYDE KREKPWVVQQ LDSAEGPVVA STDYMKLYAD QIRAWVPSDF HVLGTDGFGR SDTREALRRF FEVDRYYVTI QALRALANRG EIDVKVVNDA MKKYGIDPAK PSPLVS
|
| |