Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4858 |
Symbol | |
ID | 4595239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 187761 |
End bp | 190481 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639772644 |
Product | 2-oxoacid dehydrogenase subunit E1 |
Protein accession | YP_919304 |
Protein GI | 119714162 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0734587 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCTG ACCTTCAAGG CTGGCGACAC GTCAGCGATC GGCTGGTCCA GCCGAGAGAC ATCGACCCCG ACGAGACCCA GGAATGGGTC GAGTCGCTCG ACGCGGTGGT TGACCACGCC GGTCAGGCGA GGGCTCGTGA GTTGTTGCTG CGCCTGCAGC GGCGGGCTCA GCGTCGAGGC GTGCTGATGC CCGGGTGGGG CCCGTCCGAC TACATCAACA CGATTCCCAC TGCTGCCGAG CCTGCGTACC CCGGCGATGA GGATCTCGAA CGGCGGATCC GTGCGGCCGT GCGGTGGAAC GCCGCGGTGA TGGTGCACCG TGCGCAACGC CCGGGGCTCG GTCTAGGAGG TCACCTGTCG ACGTACGCGT CGACTGCCAC GTTGTTCGAG GTCGGGTTCA ACCACTTCTT TCGCGGCCCG GACGCGGATG GAGATGGGGA CCAAGTCTAC TTCCAGGGCC ATGCGTCACC GGGGGTTTAC GCGCGCGCCT TCCTCGAGAA CCGCCTGGAC GAGGGCCAGC TGGACGGCTT TCGCCAGGAA CTCTCCCACG GCGGCCTAGG AGCTGGCCTT CCGTCCTACC CGCACCCGCG TCTCATGCCC ACCTTCTGGG AGTTCAGCAC CGTCTCGATG GGGCTGAGCC CGATCAACGC GATCTACCAG GCTCGCTTCA ACCGCTACCT GCGCGACCGG GGGATCAAGG ACACCAGTCG CCAACACGTG TGGGCCTTCC TCGGGGATGG CGAGATGGAC GAACCCGAGG CGCTGGGCGC ACTGGGCGTG GCCGCTCGGG AAGAGCTGGA CAACCTGACG TTCGTGGTCA ACTGCAACCT GCAACGGCTG GACGGCCCGG TGCGGGGAAA CGGCAAGATC ATCCAAGAGC TCGAGTCCTA CTTCCGTGGC GCCGGTTGGA ACGTCATCAA GGTCATCTGG GGACGCGAGT GGGACCGGCT CTTAGACACT GACATCCAGG GCGCCCTGGT CGATCTGATG AACGTCACGC CAGACGGGGA CTTCCAGACC TTCAAGGCAG AGTCCGGCCA CTTCGTGCGC GAGCACTTCT TCGGACGGGA TCCGCGTGCC CTCCGGCTGG TCGAGACCCT GACGGACGAG GAGATCTGGG ACCTCAAACG CGGGGGCCAC GACACCCACA AGGTCTTCGC CGCCTACGAC GCCGCACTGC GTCACGTCGG CCAGCCGACC GTCATCTTGG CCCAGACCAT CAAGGGATAC GGTCTCGGTT CTCACTTCGA GAGCCGCAAC TCCACCCACC AGATGAAGAA GCTCACCGTC GAGGACCTCA TTGAGCTACG CGATCGTCTG CGTCTCCCGA TCCCGGACCA CCAGCTCGAC CATGACCTTC CGCCCTACTT CAAACCTGCC GAGGGTTCAC CCGAGCTGGA ATACCTGGAG GAGTGCCGAC TCCGCCTCGG TGGGCACCTC CCGAGCCGAC GCGTCCGCAG CCGGCCCCTC GTGCTGCCGG GCGACACGGC GTACGCGGTG GCCACGCTCG GGTCCCACCA GCCGGTCGCC ACCACCATGG CCTTCGTCCG CCTGCTCCGC AGCCTGATGA ACGACCCCGC GATCGGGCAT CGCTTCGTCC CGATCATCCC CGACGAGGCC CGCACCTTCG GACTGGATGC GTTGTTCCCC ACCAAGAAGA TCTACTCTCC GCAGGGACAG CACTATCTCT CGGTCGACCG TGGGCAGCTG CTGAGCTACC AGGAGGACAC CGCGGGGGTG GTGCTGCACG AGGGACTGAC CGAAGCAGGC TGCACTGCCT CTTGGACCGC CGCGGGAACG TCGCACGCCA CCCATGACGA GCCGATGATC CCGATCTATG TCTTCTACTC GATGTTCGGC TTTCAGCGCA CCGGCGACCT GTTGTGGGCG GCAGCTGACC AGCTGACCCG AGGGTTCCTG CTCGGTGCGA CCGCAGGCCG CACCACGCTC AATGGAGAAG GCTTGCAGCA TCAGGACGGA CACTCCCAAC TTCTTGCTGC AACCAATCCT GCGTGCGTCT CCTACGACCC CGCCTACGCC TACGAGATCG GCCACATCGT CCGCGACGGG CTACGAAGGA TGTACGGCGA GCCCGCGCTC GGGGAAGACC CGGACGTCTT CTACTACCTC ACCCTCTACA ACGAGCCGAT CGTGCAACCC GCCGAACCCG ACGACGTGGA CATCGAAGGA ATCCTCGCCG GCATGCACCG CATCTCGCCG GCCCCGGCAA GTGAGCGGCC GGGCGCGCAG ATCCTCGCGT CGGGCATCGC GGTTCCCTGG GCGCTGGAGG CACAAGCGAT GCTTCACGAC GATTGGGACG TGCACGCCGA CGTGTGGTCG GTGACCTCGT GGAACGAGCT ACGCCGGCGG GCGCTCACCA TCGACGACTG GAACCACGTC CACCCCGATG GGCCGCGCCG GGTCCCCTAT GTGACCAGGC GACTGCTCGG ACAGCGCGGG CCGGTCGTAG CGGTGTCGGA CTGGATGCGC GCGGTGCCCG ACCAGATCGC CCCGTTCGTG GACGCAGACT GGTCCTCGTT GGGCACAGAC GGGTTTGGCC TCTCCGACAC ACGCGCTGCG CTCCGTCGGC ACTTCCGGGT CGACCCACCC TGGATCGTCG CGCGCGTGCT CGCCCACCTC GCCCGCCAAG AGCAGGTCGA TCAGACCGCA CCAGCCAGGG CGATCAAGCG GTATCAGGCG ACCTCCCTCC ATGCGGGCTG A
|
Protein sequence | MTADLQGWRH VSDRLVQPRD IDPDETQEWV ESLDAVVDHA GQARARELLL RLQRRAQRRG VLMPGWGPSD YINTIPTAAE PAYPGDEDLE RRIRAAVRWN AAVMVHRAQR PGLGLGGHLS TYASTATLFE VGFNHFFRGP DADGDGDQVY FQGHASPGVY ARAFLENRLD EGQLDGFRQE LSHGGLGAGL PSYPHPRLMP TFWEFSTVSM GLSPINAIYQ ARFNRYLRDR GIKDTSRQHV WAFLGDGEMD EPEALGALGV AAREELDNLT FVVNCNLQRL DGPVRGNGKI IQELESYFRG AGWNVIKVIW GREWDRLLDT DIQGALVDLM NVTPDGDFQT FKAESGHFVR EHFFGRDPRA LRLVETLTDE EIWDLKRGGH DTHKVFAAYD AALRHVGQPT VILAQTIKGY GLGSHFESRN STHQMKKLTV EDLIELRDRL RLPIPDHQLD HDLPPYFKPA EGSPELEYLE ECRLRLGGHL PSRRVRSRPL VLPGDTAYAV ATLGSHQPVA TTMAFVRLLR SLMNDPAIGH RFVPIIPDEA RTFGLDALFP TKKIYSPQGQ HYLSVDRGQL LSYQEDTAGV VLHEGLTEAG CTASWTAAGT SHATHDEPMI PIYVFYSMFG FQRTGDLLWA AADQLTRGFL LGATAGRTTL NGEGLQHQDG HSQLLAATNP ACVSYDPAYA YEIGHIVRDG LRRMYGEPAL GEDPDVFYYL TLYNEPIVQP AEPDDVDIEG ILAGMHRISP APASERPGAQ ILASGIAVPW ALEAQAMLHD DWDVHADVWS VTSWNELRRR ALTIDDWNHV HPDGPRRVPY VTRRLLGQRG PVVAVSDWMR AVPDQIAPFV DADWSSLGTD GFGLSDTRAA LRRHFRVDPP WIVARVLAHL ARQEQVDQTA PARAIKRYQA TSLHAG
|
| |