Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3769 |
Symbol | |
ID | 4597515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3990516 |
End bp | 3991550 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778377 |
Product | pyruvate dehydrogenase (acetyl-transferring) |
Protein accession | YP_924956 |
Protein GI | 119717991 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0012564 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTC AGGCAGCGGC ACAGCCCGCG ACTCCCCAGC AGGCCGGCGG CGGGACGGGC CTCGACCCCG CCGTGCGGCT CGACCTCTAC GAGACGATGG TGCTCTCGCG CACCTACGAG GAGGCGATCC TGCGCGAGTA CCACGCGGAC AAGGGCCCGG GCTTCGACAT CGGCAAGGGC CTGATCCCGG GCGAGATGCA CCTGTCCGCG GGTCAGGAGC CCGTCGCCGC CGGGGTCTGC GCGCACCTCA CCACCGACGA TGCCGTCACC GCCACCCACC GGCCGCACCA CTTCGCGGTC GCGCACGGCG TGGACCTGCG GCGGATGACC GCCGAGATCT TCGGCCGCGA GGACGGGCTG GGCCGAGGGC GGGGCGGCCA CATGCACCTG TTCGACCCGG ACACGCACTT CTCCTGCTCC GGCATCATCG CCGAGGGCTA TCCGCCCGCT CTCGGCCAGG CCTTCGCGTT CCACCGCCAG GGCACCGACC GGATCGCCGT GGCCGTCACC GGCGAGGGCG CGGCCAACCA GGGCGCGTTC CACGAGTCGT TGAACCTCGC CGCCCGCTGG TCGTTGCCGG TCGTCTTCGT CGTGGAGGAC AACGACTGGG GGATCTCGGT GCCGCGGACC GCGTCGACCT CCGTGGCCTC GAACGCCGAT CGGGCCGCGG CGTACGGCAT CCCCGGCGAG CGGATCGAGG GCAACGACGT CGAAGGGGTG TACGACGCCG CGAGGCGTGC GGTCGCCCGC GCCCGTGCCG GGGAGGGCCC CTCGCTCATC GAGGTGCACA CGCTGCGGTT GTGGGGCCAC TTCGAGGGCG ACGCCCAGGG CTATCGGCTC GACCTGGAGG ACGCGCCGAG CCACGACCCG ATCCCCCGCT ACGAGACCCG GCTCCGCGAG GCCGGCGTAC TCGACGACGA AACGGTCACG CGGATCAGGA GCGCCGCCAG CGAGCGCACC GAGGACGCGA TCGCGTTCGC GAAGAACAGC CCCGTGCCGG ACCCTGCGTC GGCCACGTCC TACGTGTTCG CCTGA
|
Protein sequence | MTTQAAAQPA TPQQAGGGTG LDPAVRLDLY ETMVLSRTYE EAILREYHAD KGPGFDIGKG LIPGEMHLSA GQEPVAAGVC AHLTTDDAVT ATHRPHHFAV AHGVDLRRMT AEIFGREDGL GRGRGGHMHL FDPDTHFSCS GIIAEGYPPA LGQAFAFHRQ GTDRIAVAVT GEGAANQGAF HESLNLAARW SLPVVFVVED NDWGISVPRT ASTSVASNAD RAAAYGIPGE RIEGNDVEGV YDAARRAVAR ARAGEGPSLI EVHTLRLWGH FEGDAQGYRL DLEDAPSHDP IPRYETRLRE AGVLDDETVT RIRSAASERT EDAIAFAKNS PVPDPASATS YVFA
|
| |