Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5418 |
Symbol | |
ID | 9249321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 598099 |
End bp | 599079 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Transketolase central region |
Protein accession | YP_003683303 |
Protein GI | 297564330 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACCA ACCTCACCAT CGGCAAGGCC ATCAACGCCG GCCTGCGCGC CGCCATGGAG CACGACCCCA AGGTCCTGGT CATGGGCGAG GACGTCGGCA GGCTCGGCGG CGTCTTCCGG GTCACCGACG GCCTGTACAA GGACTTCGGC GCCGACCGGG TCATCGACAC CCCCCTCGCC GAGTCCGGCA TCGTCGGCAC CGCGATCGGC ATGGCCATGC GCGGCTACCG CCCCGTGGTG GAGATCCAGT TCGACGGCTT CTTCTTCCCG GCCGCGAACC AGACCTTCAC CCAGCTGGCC AAGATGCGGC GCCGCTCGGC GGGCACGCTC AGCATGCCCG TGGTCATGCG CATCCCCTAC GGCGGCGGCA TCGGCGCGGT CGAGCACCAC AGCGAGTCCC CGGAGGCCTA CTTCACCCAC ACCGCCGGGC TGCGTGTGGT GTCGGTGGCC AACCCCGAGG ACGCCTACTG GATGATCCAG CAGGCCGTGC GCTCCGACGA TCCGGTGATC TTCCTGGAGC CCAAGCGGCG CTACTACGAG AAGGCCGAGG TCGACACCGA GGCGTCCATC GCCGAGGCCG CCCCGATGGG CGCCGCGCGC GTGGTCCGTC CCGGCACGGA CGTGACCCTC CTGGCGTACG GGCCCATGGT CAAGACGGCG CTCCAGGCCG CCGAGGCCGA CACCGACCAC TCCGTCGAGG TCGTCGACCT GCGTTCGCTG TCCCCCGTGG ACTACCCGAC CCTGTTCGCC TCGGTGAAGA GGACGGGGCG CCTGGTCGTC GCCCACGAGG CCCCCCTCTC CGGCGGCCCC GGCGCGGAGA TCGCCGCCCG GGTCACCGAG GAGTGCTTCT ACCACCTGGA GTCGCCGGTC ATCCGCGTGG CCGCGTTCGA CACCCCCTAC CCGCAGTCCC GGCTGGAGGA GCACTACCTG CCGGACCTGG ACCGGGTTCT GGACGGTGTC GACCGGGCGT TCGCGTACTA G
|
Protein sequence | MGTNLTIGKA INAGLRAAME HDPKVLVMGE DVGRLGGVFR VTDGLYKDFG ADRVIDTPLA ESGIVGTAIG MAMRGYRPVV EIQFDGFFFP AANQTFTQLA KMRRRSAGTL SMPVVMRIPY GGGIGAVEHH SESPEAYFTH TAGLRVVSVA NPEDAYWMIQ QAVRSDDPVI FLEPKRRYYE KAEVDTEASI AEAAPMGAAR VVRPGTDVTL LAYGPMVKTA LQAAEADTDH SVEVVDLRSL SPVDYPTLFA SVKRTGRLVV AHEAPLSGGP GAEIAARVTE ECFYHLESPV IRVAAFDTPY PQSRLEEHYL PDLDRVLDGV DRAFAY
|
| |