Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2105 |
Symbol | aceE |
ID | 4599949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2248629 |
End bp | 2251307 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639776708 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_923301 |
Protein GI | 119716336 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00865423 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACTG CGAACCTCCC CCACGACCCG ATCGACTGGC ACGACGAGCG CGCGGAGTGG CTCGACTCGG TCAGCGGCCT GGTCCGCACC CACGGACCGC ACGCCGCGAG CCTGCTCCTC CGCGACGTGG CCACCCACGC CGGCCGGCTC GGGGTCGCGA CCGGGCCGCG TACGCCGTAC CTCAACACGA TCCCGACCCA GGAGTGGGCC GCCTATCCCG GCGACCTCGC CCTCGAGGAG CGGATCGACG CCTACCTGCG GTGGAACGCG ATGGCGATGG TGGTCCGGGC CAACAAGCGC CACGCCGGCC TCGGCGGGCA CCTGTCGACC TACGCGTCCA CGCTGACCCT GTGGGAGGTC GGCTTCCAGC ACTTCTTCCG CGGCCGCGGC TCGGCGGGCG AGGCCGTCGT CCCCGGAGAC CAGGTGTTCT TCCAGGGCCA CGCCAGCCCG GGCATCTACG CCCGGGCCTT CCTCGAGGGC CGGCTGAGCG CCGAGCAGCT CGACCACTTC CGGCGCGAGG TAGCCGGCGC GGGGCAGGGC CTCCCGTCGT ACCCGCACCC CCGGAGCATG GCCGGCTTCT GGGAGTTCCC GACCGTGAGC CTCGGCATCG GCCCGCTGCA CGCCGTCTAC CAGGCCCGAT TCAACCGCTA CCTCGCCGCG CGCGGCCTCG CCGACACGGC GGCCGCCCGG GTGTGGTGCT TCGTCGGCGA CGGCGAGATC GACGAGCCCG AGACGCTGGC CGCGATCCGG CTGGCCGCCC GCGAGCACCT GGACAACATC GTCTTCGTCG TCAACGGCAA CCTCCAGCGC CTCGACGGGC CGGTCCGCGG CAACAGTCGC GTCCTCGACG AGCTCGAGGG CATCTTCGCC GGCGCGGGTT GGCACGTCCT CAAGGTGCTG TGGGGCAGCC GGTGGACCTC GCTCTTCGAG CGCCCCGGCG GCGAGGCCCT GCTCGACCGG CTCGAGGCGA TGAACGACGG GGACCTGCAG CGGCTGGCGA TCCTCGAGCC CGCCGAGCTG CGCGCGACCC TGTTCGGCGG CGGCGACGCC GCGCTGGAGG CGCTCGGCGC GACGCTCTCC GATGCCGACC TCGCCGGGCT GCGGCGCGGC GGGCACGACC CGCGAGCCGT CTACGCTGCG TACGCCGAGG CCGTCGCGCA CCGGGGCGAG CCCACCGTGG TGCTGGCGCA GACGATCAAG GGCTACGCGC TCGGCCCGAA CTTCGAGGGC CGCAACGCCA CTCACCAGAT GAAGAAGATG ACGCCGGACC AGCTGCGCAT CTTCCGCGAC ATCCTCCGGG TGCCGGTGAC CGACGAGGAG CTGGCCGACG GGCTGCCGCC GTACCTGCCG CTGCCCGAGG GCTCGCCGGA GCTGGAGTAC CTCCACGCCC ACCGGCGCAC GCTCGGCGGC GCGCTCCCCC GCCGGCCGCT GCGCCCCGCC GTCCTGGCCG CCCAGCCGGC CGACGCGCCC TTCACCCAGT TCGACGCCGG GTCCGGCAAG CGCACCGTGT CGACCACGGT CGCCTACACC CGGCTGCTGC ACTCGCTGAT GCGCGACCCC GAGGTCGGGC GGCGAATCGT GCCGATCGTC CCGGACGAGG GCCGCACCTT CGGCTACGAG CCGCTGTACA GCGAGTTCGG CATCTACGCG CCCGACGGGC AGCAGTACAC GCCGGTCGAC GCCGGCCTGC CGCTGTCCTA CCGGGAGAGT GCGGACGGGC AGGTGCTGCA GGAGGGGATC ACCGAGGCCG GCGGGCTCGC CGAGCTGACG GCGGCGGCCA CGGCCGGGCA CACCTGGGAC ACCGCGGTCG TCCCGTTCTT CACCTACTAC TCGATGTTCG GCTTCCAGCG CGTCGGCGAC CTGATCTGGG CCCTCGCCGA CGCCCGGGGC CGCGGGTTCC TGGTCGGCGC GACGGCCGGT CGTACGACGC TCGCCGGCGA GGGCCTGCAG CACACCGACG GCTCCTCGCA GCTGGCCGCG CTCGCCGTCC CGAGCTGCCA CGCCTACGAC CCGGCCTTCG CCTACGAGAC CGCCACCATC GTCCGCGACG GGATCCGGCG GATGTACGAC GCCGGCGAGG AGTCCTTCTA CTACCTGACC GTCTACAACG AGGACTACGT CCAGCCGGTC AAGCCGGTCG ACTCCGGTGT CCGCAGCGTC AGCGTGGACG AGGCGATCGT CGCCGGCCTG TACCGGGTCG ACGCGACCGA GGGCAATCGG CTCCCCCAGG TCCGCCTCCT GGCCTCCGGG CCGGCCGTCC GCACCGCGCA GCAGGCGGCC GCGGACCTGG CCGCGCGTGA CGGCGTCACC GCCGAGGTGT GGTCGGTGAC CAGCTGGAAG GCGCTGCGCG ACGACGCGCT CGAGACCGAG CGCTGGAACC GCGAGCACCC CCACGAGCCG GCCCGCGTCA GCTACCTGCG CCGTGCCCTC GGCGACCTGC CGGTGCCGGT GGTCGCCGTG AGCGACTACG TGTCCGCGTT GCCGGACCAG ATCGCGCGCT TCGTCGACGC CCCCTTCGTG AGCCTCGGCA CCGACGGCTA CGGCCTCTCC GACACGCGCG AGGAGCTCCG CGCCCACTTC GGTGTCGACG CGGAGGGGAT CCGCGCGACG GCTGCCGGGG TCGCCGACCG GCCGGCGACC CACCGCGCGG CCCGCCCCGA CGAGGGCCTG GTGGCCTGA
|
Protein sequence | MTTANLPHDP IDWHDERAEW LDSVSGLVRT HGPHAASLLL RDVATHAGRL GVATGPRTPY LNTIPTQEWA AYPGDLALEE RIDAYLRWNA MAMVVRANKR HAGLGGHLST YASTLTLWEV GFQHFFRGRG SAGEAVVPGD QVFFQGHASP GIYARAFLEG RLSAEQLDHF RREVAGAGQG LPSYPHPRSM AGFWEFPTVS LGIGPLHAVY QARFNRYLAA RGLADTAAAR VWCFVGDGEI DEPETLAAIR LAAREHLDNI VFVVNGNLQR LDGPVRGNSR VLDELEGIFA GAGWHVLKVL WGSRWTSLFE RPGGEALLDR LEAMNDGDLQ RLAILEPAEL RATLFGGGDA ALEALGATLS DADLAGLRRG GHDPRAVYAA YAEAVAHRGE PTVVLAQTIK GYALGPNFEG RNATHQMKKM TPDQLRIFRD ILRVPVTDEE LADGLPPYLP LPEGSPELEY LHAHRRTLGG ALPRRPLRPA VLAAQPADAP FTQFDAGSGK RTVSTTVAYT RLLHSLMRDP EVGRRIVPIV PDEGRTFGYE PLYSEFGIYA PDGQQYTPVD AGLPLSYRES ADGQVLQEGI TEAGGLAELT AAATAGHTWD TAVVPFFTYY SMFGFQRVGD LIWALADARG RGFLVGATAG RTTLAGEGLQ HTDGSSQLAA LAVPSCHAYD PAFAYETATI VRDGIRRMYD AGEESFYYLT VYNEDYVQPV KPVDSGVRSV SVDEAIVAGL YRVDATEGNR LPQVRLLASG PAVRTAQQAA ADLAARDGVT AEVWSVTSWK ALRDDALETE RWNREHPHEP ARVSYLRRAL GDLPVPVVAV SDYVSALPDQ IARFVDAPFV SLGTDGYGLS DTREELRAHF GVDAEGIRAT AAGVADRPAT HRAARPDEGL VA
|
| |