Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0358 |
Symbol | aceE |
ID | 3784550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 388273 |
End bp | 390942 |
Gene Length | 2670 bp |
Protein Length | 889 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810434 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_411058 |
Protein GI | 82701492 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCTA TACCGGATAT AGACCCGACT GAAACCCAGG AATGGCTGGA GGCGCTGGAA TCCGTGTTGA CGCACGAGGG GACGGAGCGG GCGCACTATC TGCTGGAAAG ACTGGTGGAA AAAGCGCGCC GGTCAGGTGC CTATCTTCCC TATAGCGCCA CCACGGCTTA TATCAATACC ATTCCTCCGG GAAAGGAAGA GTGGTCACCG GGAAATCATG CGCTGGAGCA CCGCATCCGT TCCTATGTCC GCTGGAATTC CATGGCCATG GTGTTGCGCG CCAACCGGAA TTCCAATGTC GGCGGGCATA TTGCCAGCTT TGCCTCGGCC GCTACGCTAT ATGATGTCGG CTACAACCAT TTCTGGCATG CCAGGTCGGA GAATCACGGT GGCGACCTGA TATTTGCACA GGGGCATTCA TCGCCCGGTC TTTATGCCTA TGCTTTTCTG CTGGGGGAAC TGACAGAGGA GCAGCTCAAT AATTTCCGGC GGGAGGTGGG GGGAAAGGGA CTGTCTTCCT ATCCGCATCC ATGGCTGATG CCTGACTTCT GGCAGTTCCC GACGGTATCA ATGGGACTTG GGCCGCTGAT GGCGATATAT CATGCGCGCT TCATGAAGTA TCTGGATAGC CGCGGACTGG TCAAGACGGA GGGCCGAAAA GTGTGGGCCT TCATGGGCGA TGGAGAAATG GATGAACCTG AATCACTGGG GTCCATTTCG CTTGCGTCGC GCGAGAACCT GGACAACCTG ATTTTTGTCA TCAACTGCAA CCTTCAGCGT CTCGACGGGC CGGTGCGTGG AAATGGCAAG ATCATTCAGG AACTGGAAGC CGCTTTTCGC GGTTCGGGCT GGAACGTCAT CAAGGTAATC TGGGGCTCAT ACTGGGACCC ATTGCTTGCC AAAGACACGA AGGGGTTGCT GCAACAACGC ATGATGGAAT GCGTGGATGG CGAATATCAG ACTTTCAAGT CGAGAGACGG CGCCTATGTG CGTGAACATT TCTTCGGCAA ATATCCGGAG CTGCTCGAGA TGGTGGCGAA TATGTCCGAC GATGACATAT GGCGCCTGAA CCGGGGCGGA CACGATCCGC ATAAGGTCTA TGCCGCGTAT TCAGCGGCGG TGAAGCACAA GGGCCAGCCC ACCGTGATCC TCGCCAAGAC GATCAAGGGT TACGGGATGG GTGAAGCGGG CGAAGCCCAG AATATCACTC ATCAGCAGAA AAAGATGGGT ACCACTTCAC TCAAAGCGTT TCGTGACCGT TTCGGTCTGC CAATCAGCGA TGATGATATC GAGTCGGTAC CCTACCTGAA ATTCGACAAG GATTCGCCGG AATCCATCTA CATGCACCAG CGACGCGAGG CATTGGGGGG GTTCATCCAT CGTCGCCAGC GTAAGGCCGA GCCCTTGCAG ATACCCCCGC TGTCCGCCTT CGACACTTTG CTCAAGGCAA GCGGTGAGGG AAGGGAATCT TCCACCACCA TGGCTTTTGT GCGCATCCTC AATATCCTCA TAAAGGACAA GAACATCGGC AAGCGGGTCG TGCCGATTGT GGCGGACGAG TCGCGCACCT TTGGCATGGA AGGCATGTTC CGGCAACTGG GCATCTGGTC TTCCACCGGG CAGCTTTACA CGCCCGAGGA TGCTGAGCAA CTGATGTACT ATAAGGAAGA CAAGAACGGC CAGATCCTGC AGGAAGGTAT CAATGAGGCG GGGGCCATGT CATCATGGAT GGCCGCGGCA ACTGCCTACA GTTCTCATGG CGTGCAGATG ATCCCGTTCT ACATTTACTA TTCGATGTTC GGCTTCCAGC GCGTGGGGGA TCTTTGCTGG GCGGCGGGAG ACATGCGCTG CCGCGGCTTC CTGCTGGGGG GCACTGCCGG CCGCACCACA CTGAATGGCG AGGGATTGCA GCATGAGGAC GGTCACAGCC ATCTGGCGGC ATCGACTGTT CCCAATTGCA TATCCTATGA CCCGACCTTT GCCTACGAAC TCACGGTCAT TATCCGGGAT GGCCTGCGCC GCATGTGCGA AATGCAGGAG GATGTCTATT ACTACATCAC AGTGATGAAC GAAAACTATT CGCATCCGGA AATGCCTGCA GGGGCGGAAG AAGGAATTCT GAAAGGCATG TACCTGTTCC GTGAAGGCAA GCCGGCAGGA GAAAAGGATT CCGGGTTGCG TGTTCAATTG CTCGGGTCCG GCGCCATTCT GCGCGAGGTG ATTGCTGCGG CTGAAATACT GGAGGAGGAA TTCGGTGTGA CGGGTGATAT CTGGAGTGTA ACCAGCTTTA CCCAGTTGAG ACGCGAAGCG CTGGCGACAA CGCGCTGGAA CATGCTGCAC CCCACAGAAC CGGCAAGGCT GTCGCACGTC GGCACGTGTC TCAAGGACCG GGAAGGTCCA GTGGTGGCGG CTACGGACTA CATGAAGATT TTTGCCGACC AGATACGTGA ATTTATTCCG GGGCGATACA AGGTGCTGGG TACAGATGGA TTCGGACGTT CCGATACGCG CGAGCAATTG CGCCGCTTCT TCGAGGTCGA CCGCCATTAC ATTACGATAG CCGCTCTGAA AGCGCTGGCC GAAGATGGCA GAATCGAGGC GGAAAGGGTG GCGCAGGCCA TGGGCAAATT CGGCCTCGAT CCTGATAAAC CCAATCCGAT GACCATATAA
|
Protein sequence | MEPIPDIDPT ETQEWLEALE SVLTHEGTER AHYLLERLVE KARRSGAYLP YSATTAYINT IPPGKEEWSP GNHALEHRIR SYVRWNSMAM VLRANRNSNV GGHIASFASA ATLYDVGYNH FWHARSENHG GDLIFAQGHS SPGLYAYAFL LGELTEEQLN NFRREVGGKG LSSYPHPWLM PDFWQFPTVS MGLGPLMAIY HARFMKYLDS RGLVKTEGRK VWAFMGDGEM DEPESLGSIS LASRENLDNL IFVINCNLQR LDGPVRGNGK IIQELEAAFR GSGWNVIKVI WGSYWDPLLA KDTKGLLQQR MMECVDGEYQ TFKSRDGAYV REHFFGKYPE LLEMVANMSD DDIWRLNRGG HDPHKVYAAY SAAVKHKGQP TVILAKTIKG YGMGEAGEAQ NITHQQKKMG TTSLKAFRDR FGLPISDDDI ESVPYLKFDK DSPESIYMHQ RREALGGFIH RRQRKAEPLQ IPPLSAFDTL LKASGEGRES STTMAFVRIL NILIKDKNIG KRVVPIVADE SRTFGMEGMF RQLGIWSSTG QLYTPEDAEQ LMYYKEDKNG QILQEGINEA GAMSSWMAAA TAYSSHGVQM IPFYIYYSMF GFQRVGDLCW AAGDMRCRGF LLGGTAGRTT LNGEGLQHED GHSHLAASTV PNCISYDPTF AYELTVIIRD GLRRMCEMQE DVYYYITVMN ENYSHPEMPA GAEEGILKGM YLFREGKPAG EKDSGLRVQL LGSGAILREV IAAAEILEEE FGVTGDIWSV TSFTQLRREA LATTRWNMLH PTEPARLSHV GTCLKDREGP VVAATDYMKI FADQIREFIP GRYKVLGTDG FGRSDTREQL RRFFEVDRHY ITIAALKALA EDGRIEAERV AQAMGKFGLD PDKPNPMTI
|
| |