Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5088 |
Symbol | |
ID | 9248977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 231122 |
End bp | 232213 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | thiamine pyrophosphate protein domain protein TPP-binding protein |
Protein accession | YP_003682975 |
Protein GI | 297564002 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.993677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.910187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGAGA ACGTGACGGG TACCGGCGCG AACGGCCACG TCCACGGGGT CCCGGAGGCC CTGGGCGGGC TGCGCCTGGT GCCCAGGACC GACACCGCGT ACAAGATGAA GGACTTCAAG TCCGACCAGG AGGTGCGCTG GTGCCCGGGC TGCGGCGACT ACGCGATCCT GGCCGCCTTC CAGTCCTTCC TGCCCGAGCT GGGCGTGCCG CGCGAGAACG TGGTGATGGT GTCGGGTATC GGCTGCTCCT CCCGATTCCC GTACTACCTG AGCACGTACG GCATGCACTC GATCCACGGG CGCGCCCCGG CGATCGCGAC CGGGCTGGCC ACCAGCCGCC CGGACCTGTC GGTGTGGGTG GTGACCGGTG ACGGCGACGG GTTGTCCATC GGCGGCAACC ACCTCGTCCA CGCGCTGCGC CGCAACGTCA ACATCAACAT CCTGTTGTTC AACAACCGGA TCTACGGGCT GACCAAGGGT CAGTACTCCC CCACCTCCGA GCCGGGCAAG ATCACCAAGT CCTCGCCGGT GGGGTCGCTG GACCACCCGT TCAACCCGCT GTCGCTGGCG CTGGGCGCGG AGGCCACGTT CGTGGCCCGC ACGATCGACT CCGACCGCAA GCACCTCACG TCGGTGCTGC GGGCGGCGGC CGACCACCCC GGCGCGTCGT TCGTGGAGAT CTACCAGAAC TGCCCGATCT TCAACGACGA CGCGTTCGAG CCGCTGAAGG ACCCGGCGGC GCGGGACGTC CGGCTGCTGC GCCTGGAGCA CGGCGAGCCG CTGCGGCTGG GCCCGGACCG GGGCGTGGTC GCCGGGGAGT TCGGCGGCTT GGAGGTCGTG GACGTGGACT CGGTGGGAGA GGACCGGCTG CTGCGGCACG ACGCGCACCG GGAGGACCCG GGGTACGCGT TCGCGCTGTC GCGCCTGGAC CAGCCCGCGT TCGAGCACGT GCCGATCGGG GTGCTGCGGG ACGTGCGCCG CCCGGCCTAC GACGAGCTGG TGAACGAGCA GGTGGCCGAC GCGCGGGCCG AGCGCGGCGC CGGCGAGCTG GCCGCCCTTC TGGCCAGCGG GGACACCTGG AGGGTGGAGT AG
|
Protein sequence | MTENVTGTGA NGHVHGVPEA LGGLRLVPRT DTAYKMKDFK SDQEVRWCPG CGDYAILAAF QSFLPELGVP RENVVMVSGI GCSSRFPYYL STYGMHSIHG RAPAIATGLA TSRPDLSVWV VTGDGDGLSI GGNHLVHALR RNVNINILLF NNRIYGLTKG QYSPTSEPGK ITKSSPVGSL DHPFNPLSLA LGAEATFVAR TIDSDRKHLT SVLRAAADHP GASFVEIYQN CPIFNDDAFE PLKDPAARDV RLLRLEHGEP LRLGPDRGVV AGEFGGLEVV DVDSVGEDRL LRHDAHREDP GYAFALSRLD QPAFEHVPIG VLRDVRRPAY DELVNEQVAD ARAERGAGEL AALLASGDTW RVE
|
| |