Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5417 |
Symbol | |
ID | 9249320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 596980 |
End bp | 598095 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | pyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit |
Protein accession | YP_003683302 |
Protein GI | 297564329 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGACA CGACCGTGCA CGAGGAACCG GAGCTGATCC AGCTCCTGAC ACCGGAAGGG GAGTTGACGG GGCACCCCGA CTACCCCCTG GACATCAGCG CCGAGGAGAT CCGCGCCCTG TACCGCGACC TGGTCCTGGT GCGCAGGTTC GACAGCGAGG CGGTCTCCCT CCAGCGCCAG GGCGAGCTGG GCCTGTGGGC CTCGCTGCTG GGCCAGGAGG CCGCGCAGAT CGGCTCCGCA CGCGCGCTGG GCGCGAAGGA CATGGCCTTC CCCTCCTACC GCGAGCACGG CGTCGCGTGG TGCCGGGGCA TCGAGCCCCG TGAACTGCTC GGCATGTTCC GCGGCGTCAC CAACGGGGGC TGGGACCCCC ACGAGCACGG CTTCCACCTG TACACGATCG TCATCGGCAG CCAGACCCTG CACGCCACCG GCTACGCCAT GGGCGTCCAG CGCGACGGCG CCGTCGGCGA GGACGGCACC GCCGTCATCT CCTACTTCGG CGACGGGGCC ACCAGCCAGG GCGACACCAA CGAGGCGTTC AACTTCGCCT CGGTCAACAA CGCCCCGGTG GTCTTCTTCT GCCAGAACAA CCAGTGGGCG ATCTCCGAAC CGCTGGAGCG CCAGGCCCGC GTGCCCATCT ACCGGCGCGC CGCCGGGTTC GGCTTCCCCG GCCTGCGCGT GGACGGCAAC GACGTCCTGG CCTGCCTGGC CGTGACCCGG GTCGCGCTGT CCAACGCCCG CGAGGGCAAC GGCCCCACGC TCGTGGAGGC GTTCACCTAC CGGATGGGCG CCCACACCAC CAACGACGAC CCCACCCGCT ACCGCGCGTC GGCCGAGCTC GACGAGTGGA AGGCCAAGGA CCCGATCCTG CGGGTCCGCC GCTACCTGGA GCGGGGCGGC CACGCCGACG AGGAGTTCTT CGCGTCCGTG GACGCCGAGG CGGACCGGCT GGGCGAGCAG GTGCGCACCG AGTGCCGTTC CCTGCCCGAC CCCGAGCCCC TCGACATCTT CCACGAGGTC TACGCCGAGC CCAACGTCCA CATCGACCAG CAGCGGTCCG AGTTCGCCGA CTACCTGGCC TCCTTCGAGG GCGCGGGCGC GGAAGGGGGC CGTTAG
|
Protein sequence | MSDTTVHEEP ELIQLLTPEG ELTGHPDYPL DISAEEIRAL YRDLVLVRRF DSEAVSLQRQ GELGLWASLL GQEAAQIGSA RALGAKDMAF PSYREHGVAW CRGIEPRELL GMFRGVTNGG WDPHEHGFHL YTIVIGSQTL HATGYAMGVQ RDGAVGEDGT AVISYFGDGA TSQGDTNEAF NFASVNNAPV VFFCQNNQWA ISEPLERQAR VPIYRRAAGF GFPGLRVDGN DVLACLAVTR VALSNAREGN GPTLVEAFTY RMGAHTTNDD PTRYRASAEL DEWKAKDPIL RVRRYLERGG HADEEFFASV DAEADRLGEQ VRTECRSLPD PEPLDIFHEV YAEPNVHIDQ QRSEFADYLA SFEGAGAEGG R
|
| |