Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0204 |
Symbol | |
ID | 9244038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 256457 |
End bp | 257449 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | thiamine-monophosphate kinase |
Protein accession | YP_003678160 |
Protein GI | 297559186 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGAACA CCATTGGGGG TCTTGGTGAG TTCGCTCTGA TCGCACGCGT GACGAGCCAA TTCCCCACCA CGGACGATGT AATCCTCGGA CCAGGGGACG ACGCGGCCGT CGTCGCGTCC CCCGACGGCC GGACGGTCGC GACCACCGAC CTGCTGGTCG AGGGCCGCCA CTTCCGGCGC GAGTGGTCCA CCGCCCGTGA CGTGGGGCAC CGGGCCGTCG CGCAGAACTT CGCCGACGTC GCCGCGATGG GCGCCCGCCC CACCGGCCTG CTCATCGGCT TCGCCGCGCC CGCCGACCTC CCGGTGCCCT GGGCGGAGGA GTTCACCGCC GGGGTCCGCG ACGAGTGCGC GGTCGCGGGC GGCGCCGTCG TGGGCGGGGA CATGGTCGGC TCGGACACCC TCACCATCGC GGTCACCGCG CTCGGCGACC TCCAAGGGCG GGCGCCTGTC CGCCGCGACG GCGCCCGCCC CGGCGACGTG GTCGCCTACA CCGGCCACCT GGGGCTGTCG GCCGCCGGGC TCGCCCTGCT GGAACAGGGG ATCGACGGCC CGGCCGAGTG CCTGTCCGAG CACCGCAGGC CCAGCCCGCC CTACGCCCGC GGGGCGGAGG CGGCCCGGCT CGGGGCCACG GCCATGCTCG ACGTCAGCGA CGGCCTCGCC CAGGACCTCG GGCACGTCTG CCGGGCCAGC GGGGTGCGCA TCGACCTGGA GGGGGAGGCG CTGCGCCCCG AGCCCGCGCT CGTCGAGGCG GTCCGCGTCC TGGGGGCCGG AGCCGACACG GCCGAGCGGG CCGCGCGGGA CCTCATGGTC GCCGGGGGCG AGGACCACGC CCTCGCCGCC GTGTTCCCGC CGCACACCGT TCTGCCCGCC CACTGGTCCA GAGTGGGAAC CGTCGCGCGA ACAACAGGTG AAAACGCTGA GGAAAACACA ATTCCGGTCA CGGTCGACGG GCGGGTTCCA CCTCGCGGAG GATGGGATCA TTTCCGGCGC TGA
|
Protein sequence | MRNTIGGLGE FALIARVTSQ FPTTDDVILG PGDDAAVVAS PDGRTVATTD LLVEGRHFRR EWSTARDVGH RAVAQNFADV AAMGARPTGL LIGFAAPADL PVPWAEEFTA GVRDECAVAG GAVVGGDMVG SDTLTIAVTA LGDLQGRAPV RRDGARPGDV VAYTGHLGLS AAGLALLEQG IDGPAECLSE HRRPSPPYAR GAEAARLGAT AMLDVSDGLA QDLGHVCRAS GVRIDLEGEA LRPEPALVEA VRVLGAGADT AERAARDLMV AGGEDHALAA VFPPHTVLPA HWSRVGTVAR TTGENAEENT IPVTVDGRVP PRGGWDHFRR
|
| |