Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1254 |
Symbol | aceE |
ID | 3706366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1378475 |
End bp | 1381150 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737756 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_343285 |
Protein GI | 77164760 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAGA ATAAGGCGCA GACCATTGAA GATATCGAGG CTCAAGAAAC CCGGGAATGG TTGGAGTCTC TGGATTATGT CTTGCAGCAA GGCGGCCCCC AACGTACGGT ACGGCTGCTG GATCGTTTGC GGCTTCATGC CCAAAAAGCA GGGGTAAGCC TGCCTTATCC GGCCAACACG CCTTATATTA ATACCATTCC GGTGGAGCAA GAGACTTCTT TTCCTGGCAG TCAGGAAATC GAGCGGCGTA TTCGGAGTTT GGTGCGCTGG AATGCCATGG CCATGGTAGT GCGGGCTAAC CGGGAAGAAG AGGGGATTGG TGGCCATATT TCCACCTTCT CTTCAGCGGC TACCCTCTAT GAAATTGGTT TTAACCATTT TTTCCGAGCC AGAAATGAAG AGCAAGAGGC CGACATTGTT TATTTTCAAG GCCACGCCTC GCCGGGTCCC TACGCTCGCG CTTTTCTTGA AGGCCGTTTA TCCGAGCAGC AGCTGGAAAA CTTTCGCCGG GAGTTGAAAC CGGAAGGAGG CTTGCCATCC TATCCCCATC CTTGGCTCAT GCCTGATTTT TGGGAATTCC CCACGGTCTC TATGGGACTT GGTCCCATTA TGGCTATTTA TCAAGCCCGG TTTAACAGTT ATTTAGAGGA CCGGGGGCTG AAAAAACCCT CCGGACAGAA AGTTTGGGCC TTTATTGGGG ATGGGGAAAC CGATGAGCCG GAAACGTTGG GGGCGATTAG TCTGGCCGTC CGGGAACGTT TAGATAATCT TATTTTTGTC GTCAACTGCA ATCTCCAGCG GCTCGATGGC CCGGTGCGGG GCAACGGAAA GATCATCCAG GAGTTGGAAG CTATTTTTCG AGGTGCCGGC TGGAACGTTA TTAAAGTGAT TTGGGGTCGA GATTGGGACC CGTTGCTGGC CAAAGACTAT GAAGGCGTGC TGGTCCGGCG GATGGAACAG GCCGTCGATG GAGATTATCA AAAGTACGCT GTTGAATCCG GGAGCTATAT TCGCAAACAT TTCTTTGGCA CGGATCCTCG CCTTCAGGAG ATGGTCAAAC ACCTTTCGGA CGAGCAATTA CGCCGCCTGC GCACGGGAGG GCATGATCCA GAGAAAGTCT ATGCGGCTTA TAAAGCGGCG GTAGAGCATC AGGGTTCACC GACCGTCATT TTGGCTCGAA CCATTAAAGG CTATGGCTTG GGTGAGGCGG GTGAGGGAAA AAATATTACC CACCAGCAAA AGAAATTAAA TGAGCAGGAA CTTCGTGATT TCCGGACTCG CTTCGGCATT CCTATTTCTG ATAACCAGGT AGCTCAGGCA CCTTTTTATA AACCGCCGGA AGAGAGTCCG GAATTTAAAT ATCTGCATGA GCGCCGCCAG GCTCTAGGCG GTTATCTACC GGCGCGGCAG GTGCGCGATG AGTCCCTGCA AGTGCCTTCC GAGGAGTTTT TCGCTGAATT CCATGCCGGC ACCGGTGAGC GGACCATGGC AACCACCATG GCTTATGTCA GAGTGCTGAC CAAATTACTC CGCGACCCTG AAATCGGCAA GCTCATTGTC CCTATTATCC CCGATGAGGC GCGCACCTTT GGCATGGAAT CCCTTTTCCG CCAGGTGGGA ATTTACTCCC ACGTGGGCCA GCTTTATGAG CCAGTGGATA GTTCCACCTT GCTCTATTAT AAAGAAGCCA CTGATGGGCA GATTCTGGAA GAGGGAATTA CCGAGGCGGG TTCTATGTCC TCCTTTATTG CCGCGGGTAC GGCCTATGCC ACCCATGGAA TCAGCACCAT TCCTTTTTTC ACTTTCTATT CCATGTTTGG CTTTCAGCGC ATTGGGGATC TCATTTGGGC GGCCGGGGAT ATGCGCTGCC GGGGCTTCCT GGTGGGGGCA ACAGCGGGCC GCACCACGCT TGCCGGCGAA GGGCTTCAGC ATCAGGATGG TCAAAGCCAA GTGCTGGCCT ATTCTGTTCC TAACCTGAAG GTCTACGATC CCGCCTATGC TTATGAAATT GCGGTCATTA TCCAGGATGG CATGCGCCGT ATGTATCAAC AGCAGGAGGA TATTTTCTAT TATCTAACGG TGACCAACGA GCTCTACCCC ATGCCAGAGA TGCCTGACGG GGTTCGGGAA GGCATTTTAA AGGGCATGTA CCGGCTGAAA ACCACCGCGA ACCCGGATGC GAAGCTGAGG GCGCAGCTTT TTGGCAGTGG CGCTATTTTA AATGAAGTCC TTAAGGCCCA GGCGTTGCTG GAAGATTATC AGGTGGCGGC AGATATTTGG AGCATTACCA GCTATAAGGA GCTTTACTTG GACGGTCATT CCGTAGAGCG CTGGAATATG TTGCATCCGC TGGAATCGCC CCGGGTGCCG TATGTGAAAC AATGCTTAGA AGAAGCCCCG GGGGTTTTTG TGGCCGCTTC CGACTACTTA AAGGTATTAC CGGACTCAAT TTACCGTTGG TTTCCTAGGC CGGTGGTTGC CTTGGGGACG GATGGATTTG GCCGTAGTGA CGGCCGCCGG GCGTTGCGCG ATTTTTTTGA AGTCGATGCC CGGTTTATTA CCCTGGCGAC CCTGGCGGCG CTAGCCCGTG AAAAGAAGCT GGAGCCAGGG GTGGTCCAAC AAGCGATTCA GGATCTGAAG ATCGATCCAG AAAAAGTCAA TCCCATGATT TCTTAA
|
Protein sequence | MAENKAQTIE DIEAQETREW LESLDYVLQQ GGPQRTVRLL DRLRLHAQKA GVSLPYPANT PYINTIPVEQ ETSFPGSQEI ERRIRSLVRW NAMAMVVRAN REEEGIGGHI STFSSAATLY EIGFNHFFRA RNEEQEADIV YFQGHASPGP YARAFLEGRL SEQQLENFRR ELKPEGGLPS YPHPWLMPDF WEFPTVSMGL GPIMAIYQAR FNSYLEDRGL KKPSGQKVWA FIGDGETDEP ETLGAISLAV RERLDNLIFV VNCNLQRLDG PVRGNGKIIQ ELEAIFRGAG WNVIKVIWGR DWDPLLAKDY EGVLVRRMEQ AVDGDYQKYA VESGSYIRKH FFGTDPRLQE MVKHLSDEQL RRLRTGGHDP EKVYAAYKAA VEHQGSPTVI LARTIKGYGL GEAGEGKNIT HQQKKLNEQE LRDFRTRFGI PISDNQVAQA PFYKPPEESP EFKYLHERRQ ALGGYLPARQ VRDESLQVPS EEFFAEFHAG TGERTMATTM AYVRVLTKLL RDPEIGKLIV PIIPDEARTF GMESLFRQVG IYSHVGQLYE PVDSSTLLYY KEATDGQILE EGITEAGSMS SFIAAGTAYA THGISTIPFF TFYSMFGFQR IGDLIWAAGD MRCRGFLVGA TAGRTTLAGE GLQHQDGQSQ VLAYSVPNLK VYDPAYAYEI AVIIQDGMRR MYQQQEDIFY YLTVTNELYP MPEMPDGVRE GILKGMYRLK TTANPDAKLR AQLFGSGAIL NEVLKAQALL EDYQVAADIW SITSYKELYL DGHSVERWNM LHPLESPRVP YVKQCLEEAP GVFVAASDYL KVLPDSIYRW FPRPVVALGT DGFGRSDGRR ALRDFFEVDA RFITLATLAA LAREKKLEPG VVQQAIQDLK IDPEKVNPMI S
|
| |