Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4804 |
Symbol | |
ID | 9248687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5693801 |
End bp | 5695027 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | pyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit |
Protein accession | YP_003682694 |
Protein GI | 297563720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGG TGGCTGCGCA CGCCACGGTC GCACCCGACA GCGCGGCCAC GGCGGGCCTG GCACCCCGGT CCACAGACCC GGGGCCCCAG GCCTTCTCGT ACGTAGAAAG GCACACCATG GCTGACACGG CCAAGCCGAA GGGCCGCCCC GCCAGGTCCT CAGGGACCCG GCGGCGCACC ACCCGCAAGG ACACCTCCCC CGGCCTCGCT GACGAGAAGC CCGACACCCT GCTCGACTAC TACCGCCAGA TGCTGTTCAT CCGGCGGTTC GAGGAGCGCA CGGCCCAGGC CTACACCCAG GCCAGGATCG GCGGCTACTG TCACCTCAAC CTCGGCGAGG AGGCCACGGT CGTCGGCCTG ATGACCGCCC TCCAGGAGCG CGACTACCTG TTCACGAACT ACCGTGACCA CGGGTACGCG ATCGGCAAGG GCATGGACCC CAAGCGGGTC ATGGCCGAGC TCTACGGGCG CGTCGACGGC GTGTCCAAGG GCTGGGGCGG CTCGATGCAC ATGTACGACA CCGAGACCCG CATGCTCGGC GGCTACGGCA TCGTCGGCGG TCAGCTGCCG CTGGCCGCCG GCGCCGCGCT GGCCGTCTCC TACCGGGGCG GCGACGAGGT CGTCATGTGC CAGATGGGCG ACGGCACCAC CAACATCGGC GCCTGGCACG AGACGCTCAA CATCGCCAAG CTGTGGAACC TGCCGATCGT CTTCGTGGTG ATCAACAACT TCACCGGCAT GGGCACCACG GTCGAGATGT CCTCCGCCGA GCCCGAGCTG TACAAGCGCG GCTCCGCCTT CCGCATCGAG GGCGAGCGCG TGGACGGCCG CGACGTGCTC GCGGTCCGCG ACACCGCGTC CAGGCTCATC GAGCGCGCCC GCAAGGAGCA GACCCCGTTC CTGCTGGAGG CGTGGAGCTA CCGGATGAAG GGCCACTCCG TGGTCGACCC GGCCAAGTAC CGCACCGACG AGCAGAAGGA CGAGGCGCGG TCGGAGGAGA ACGACCCGAT CGCCCTGTTC GAGGCCAGGC TCACCGAGGA GGGCCTCCTC ACCGACGAGC TGCGCGAGGA GATCGCCGCC TCCGTCAAGG CCGAGGTGAC CGAGGCCGCG GACTTCGCCG AGAACAGCCC GCACCCGGAG GTCTCCACCC TCTTCGACTA CACCTACGCC ACCCCGGTGC CCAACGAGTC CACCCGCATG CCCGCCGACC CGGTGTTCGC GGAGTAG
|
Protein sequence | MSRVAAHATV APDSAATAGL APRSTDPGPQ AFSYVERHTM ADTAKPKGRP ARSSGTRRRT TRKDTSPGLA DEKPDTLLDY YRQMLFIRRF EERTAQAYTQ ARIGGYCHLN LGEEATVVGL MTALQERDYL FTNYRDHGYA IGKGMDPKRV MAELYGRVDG VSKGWGGSMH MYDTETRMLG GYGIVGGQLP LAAGAALAVS YRGGDEVVMC QMGDGTTNIG AWHETLNIAK LWNLPIVFVV INNFTGMGTT VEMSSAEPEL YKRGSAFRIE GERVDGRDVL AVRDTASRLI ERARKEQTPF LLEAWSYRMK GHSVVDPAKY RTDEQKDEAR SEENDPIALF EARLTEEGLL TDELREEIAA SVKAEVTEAA DFAENSPHPE VSTLFDYTYA TPVPNESTRM PADPVFAE
|
| |