Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3613 |
Symbol | aceE |
ID | 5706639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4169297 |
End bp | 4172035 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641273038 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001538402 |
Protein GI | 159039149 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000348525 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCACGG AACGCAAGCG CCCGGTGATC ACCGCCGGTC TGCCGAGCCA GCTTCCGGAT ATCGACCCCG AAGAAACCGG TGAGTGGGTC GAGTCGCTTG ACGGTGTTAT CGACGATCGC GGAACCAAAC GCGCCCGCTA CGTCATGCTG CGCCTGCTGG AGCGGGCCCG CGAGCGTCAG GTCGGGGTGC CGTCCCTGAC CACCACGGAC TACATCAACA CCATCACTCC GGAACGGGAG CCCTGGTTCC CCGGCGACGA GCACGTCGAG CGGCGTATCC GGGCCTACAT CCGGTGGAAC GCCGCGATGC TGGTGCACCG GGCACAGCGG CCGGAGATCG GTGTCGGCGG GCACATCTCG ACGTTCGCCA GTTCGGCCTC GCTCTACGAG GTCGGCTTCA ACCACTTCTT CCGGGGCAAG GACCACCCGG GTGGCGGCGA CCACATCTTC TACCAGGGGC ACGCCTCCCC GGGCATGTAC GCGCGGGCGT TTCTTGAGGG GCGGCTCAGC GAACACCAGC TCGACGGGTT CCGCCAGGAG CTGTCGCACC CCGGCGGCGG CCTGCCGTCC TACCCTCACC CCCGCCTGAT GCCGGACTTC TGGGAGTTCC CCACCGTCTC GATGGGTCTC GGCGGTGTCA ACGCCATCTA CCAGGCGCGG TTCAACCGTT ACCTGCACCA CCGCGGCATC AAGGACACCT CCGACCAGCA CGTGTGGGCG TTCCTCGGCG ACGGCGAGAT GGACGAGCCG GAGTCGCTTG GCGCGATCGG AACGGCCGCC CGGGAGGAAC TGGACAACCT CACCTTCGTC ATCAACTGCA ACCTGCAACG CCTGGACGGG CCGGTCCGGG GCAACGGCAA GGTCATGCAG GAGTTGGAGG CATTCTTCCG AGGTGCCGGC TGGAACGTCA TCAAGGTCGT CTGGGGCCGC GAGTGGGATC CGCTGCTCGC CCGGGACACC GACGGTGCGC TGGTCAACCT CATGAACACC ACGCCCGACG GTGACTACCA GACCTACAAG GCAGAATCCG GGGCGTACAT CCGGGAGCAC TTCTTCGGCC GCGATCCGCG GACCCGCAAG ATGGTCGAGC ACCTCAGCGA CGACGAGATC TGGAACCTGA AGCGGGGTGG CCACGACTAC CGCAAGCTCT ACGCGGCGTA CAAGGCCGCG ATGGAGCACA CCGGACAGCC CACGGTGATC CTGGCCAAGA CCATCAAGGG TTGGACGCTC GGCTCGCACT TCGAGGGGCG CAACGCCACC CACCAGATGA AGAAGCTGAC GTTGGAGGAC CTGAAGACCT TCCGCGACCG GCTCTACCTG GATATCCCGG ACAAGGCACT GGAGGAGAAC CCCTACCTGC CGCCGTACTA CCGTCCGGAG GCCAAGTCCG ACGAGCTCGA GTACCTACAC GAGCGTCGCC GGCAGCTCGG CGGCTACCTG CCGTCCCGAC GGCCCGGCAC CAAGCGGCTC ACCATTCCCG GCCCGGAGCG CTTCGCCGAC GTCAAGCGCG GTTCGGGCAA GCAGAAGGTG GCCACCACGA TGGCCTTCGT CCGCCTGCTC AAGGACCTGA TGAAGGACCG GGAGTTCGGC CGACGCTGGG TGCCGATCGT CCCGGACGAG GCCCGCACCT TCGGCATGGA CTCACTGTTC CCGACGCAGA AGATCTACTC GCCGCATGGC CAGCGGTACA CGTCGGTCGA CCGGGAGCTG TTCCTGTCGT ACAAGGAGGC GACCGGCGGG CAGATCCTGC ACGAGGGCAT CAACGAGGTC GGCTCGGTCG CCTCCTTCAC CGCGGCCGGT TCCTCGTATG CCACGCACGA CGAGCCGATG ATCCCGATGT ACATCTTCTA CTCGATGTTC GGGTTCCAGC GGACCGCGGA CGGGCTCTGG GCAGCGGCCG ACCAGATGAC CCGTGGCTTC CTGCTCGGCG CGACCGCCGG ACGGACCACG CTGAACGGCG AGGGTCTCCA GCATGAGGAT GGTCATTCGC TGTTGATCGC CGCCACCAAC CCGGCGGTGG TCGCCTACGA TCCGGCGTTC GCCTACGAGA TCGCCCACAT CGTGGAGAAC GGCCTGCACC GCATGTACGG CGCGGCGCAG GAGAACGTCT TCTACTACCT GACGGTCTAC AACGAGCCGA TGGTGCAGCC GGCGGAGCCG ACGGACGTCG ACGTCGAGGG TGTGCTGAAG GGAATCTATC GGTACGCGCC GGCGCCCCAG GTGGACGGTC CGAAGGCACA GCTACTCGCC TCCGGTACCG GCATGCAGTG GGCGCTCAAG GCACAGGAGC TACTCGCCCA GGACTGGGGG GTTGCGGCCA GCGTCTGGTC AGTCACCTCC TGGACGGAGC TACGCCGGGA CGCGGTCGAC GCGGAGGAGC ACAATCTGCT CAACCCGACG GGTGAGCAGC GGGTGCCGTA CGTGACGACA AAGCTGGCCG ACGCCGATGG TCCGAAGGTC GCGGTCAGTG ACTGGATGCG CGCGGTGCCG GATCTGATCG CCCGTTGGGT ACCCGGCGAC TACACCTCGC TCGGCACCTG CGGGTTCGGC AAGTCCGACA CACGGCACGC ACTGCGCCGC TACTTCCACG TGGACGCCGA GTCGATCGTG GTCGCCACGC TGCGGCAGCT CGCCCTCCGC GGCGCGGTAC CGGCGGGAGT TCCCGCCGAG GCCGCCAAGA AGTACGCCAT TGACGACATC GGGGCCGCCC CGGTCGGTGA GACCGGCGGC GACAGCTGA
|
Protein sequence | MATERKRPVI TAGLPSQLPD IDPEETGEWV ESLDGVIDDR GTKRARYVML RLLERARERQ VGVPSLTTTD YINTITPERE PWFPGDEHVE RRIRAYIRWN AAMLVHRAQR PEIGVGGHIS TFASSASLYE VGFNHFFRGK DHPGGGDHIF YQGHASPGMY ARAFLEGRLS EHQLDGFRQE LSHPGGGLPS YPHPRLMPDF WEFPTVSMGL GGVNAIYQAR FNRYLHHRGI KDTSDQHVWA FLGDGEMDEP ESLGAIGTAA REELDNLTFV INCNLQRLDG PVRGNGKVMQ ELEAFFRGAG WNVIKVVWGR EWDPLLARDT DGALVNLMNT TPDGDYQTYK AESGAYIREH FFGRDPRTRK MVEHLSDDEI WNLKRGGHDY RKLYAAYKAA MEHTGQPTVI LAKTIKGWTL GSHFEGRNAT HQMKKLTLED LKTFRDRLYL DIPDKALEEN PYLPPYYRPE AKSDELEYLH ERRRQLGGYL PSRRPGTKRL TIPGPERFAD VKRGSGKQKV ATTMAFVRLL KDLMKDREFG RRWVPIVPDE ARTFGMDSLF PTQKIYSPHG QRYTSVDREL FLSYKEATGG QILHEGINEV GSVASFTAAG SSYATHDEPM IPMYIFYSMF GFQRTADGLW AAADQMTRGF LLGATAGRTT LNGEGLQHED GHSLLIAATN PAVVAYDPAF AYEIAHIVEN GLHRMYGAAQ ENVFYYLTVY NEPMVQPAEP TDVDVEGVLK GIYRYAPAPQ VDGPKAQLLA SGTGMQWALK AQELLAQDWG VAASVWSVTS WTELRRDAVD AEEHNLLNPT GEQRVPYVTT KLADADGPKV AVSDWMRAVP DLIARWVPGD YTSLGTCGFG KSDTRHALRR YFHVDAESIV VATLRQLALR GAVPAGVPAE AAKKYAIDDI GAAPVGETGG DS
|
| |