Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0017 |
Symbol | |
ID | 9243844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 20922 |
End bp | 22721 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003677975 |
Protein GI | 297559001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0984309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAC TGGTGTCCGA CCACGTACTG AAGCGTCTGC GCGAGTGGGG GGTCGACCGC GTCTTCTCCT ACGCGGGCGA CGGCATCAAC GGCCTGCTCG CCGCGTGGGA GCGGGCCGAC GACCGGCCCC GCTTCATCCA GTCACGGCAC GAGGAACTGG CGGCCTTCGA GGCCACGGGG TACGCCAAGT TCTCCGGCCG GGTGGGGGTG TGCGCGGCCA CGTCGGGTCC CGGCGCCATC CACCTGCTGA ACGGCCTCTA CGACGCCAAG CTCGACCACG TGCCGGTGGT GGCCATCCTC GGCCAGACCG CGCGCAGCGC GATGGGCGGC TCCTACCAGC AGGAGGTCGA CCTGATGTCG CTGTACAAGG ACGTGGCCAG CGACTACCTC CAGATGGTGA CCGTCCCCGA GCAGCTGCCC AACGTGCTGG ACCGGGCGAT CCGGATCGCC GCGAGCAGGC GCACGGTCAC AGCGGTCATC ATCCCCGCCG ACGTCCAGGA TCTGGAGTAC TCGCCGCCCG AGCACGAGTT CAAGATGGTG CCCTCCAGCC TCGGCCTCCC CTCCCCGCGG TCCACGCCGT CCCCGGAGGG GCTGGCCGAG GCCGCCGAGA TCCTCAACTC CGGTGAGCGC GTCGCCATGC TGGTCGGACA GGGGGCCAGG GGAGCGGCGG ACGCCGTCGT CGAGATGGCC GACAGGCTCG GCGCCGGGGT GGCCAAGGCG CTGCTGGGCA AGGACGTGCT CCCCGACGAC CTGCCCTTCG TGACCGGGTC GATCGGGCTG CTCGGCACCC GGCCCTCCTA CGAGATGATG CGGGACTGCG ACACCCTGCT CGTGGTGGGA TCCAGCTTCC CGTACAGCCA GTTCCTGCCC GAGTTCGACC AGGCGCGCGC CGTGCAGATC GACATCGACC CGACCATGGT CGGCATGCGC TACCCGTTCG AGTGCAACCT GGTCGGCGAC TCCGCGCAGA CGCTGCGGAT GCTGCTGCCG CTCGTGGAGC GCAAGACCGA CCGCTCCTGG CGGGAGAAGG TCGAGGACGG CGTCGCGCGG TGGAGGCGGG TCCTCGAACG GCGCGCCCAC GTGGACGCCG ACCCGGTCAA CCCCGAGCGC GTCTTCCACG AGCTGTCCCC GCTGCTGCCC GACGACGTGA TGGTGACCGC GGACTCCGGT TCGGCGGCCA ACTGGTACGC GCGCCACCTG GTGTTCCGCG AGGGCATGCG CGGAACGCTG TCGGGCACGC TGGCCACGAT GTGCCCCGGC GTCCCCTACG CCACGGGGGC GAAGTTCGCC CACCCCGACA GGCCGGTGGT CGCGCTCGTG GGGGACGGCG CCATGCAGAT GAGCGGCATC AACGAGCTGA TCACCATCGG CCACTACTGG AGGGAGTGGG AGGACCCGCG GGTGGTCGTC GCCGTCCTCA ACAACCGCGA CCTCAACCAG GTGACCTGGG AGCTGCGCGC GATGGGCGGA GCGCCGCAGT TCCTCCCCTC GCAGCGGATC CCCGACTTCC CCTACGCCGG GTTCGCCGAG AGCATCGGCC TGAGGGGGAT CAAGGTGGAC GACCCCTCCG ACGTGCGCGA CGCCTGGCAG CGGGCGCTGT CGGCGGACCG GCCCTGCGTG GTCGAGTTCG TCACCGACCC CGCCGTCCCG CCGATCCCGC CGCACGCGAC GCTCGACCAG ATGGAGAGCG TGGCCAAGGC CCTGGCCAAG GGCGACCCCG AGGCGTGGTC CGTGGTCAAG CGGGGCGTCG TGTCCAAGGC ACAGGAGTTC CTGCCGGGAG ACGGGCGGGG AGGCCGCTGA
|
Protein sequence | MAELVSDHVL KRLREWGVDR VFSYAGDGIN GLLAAWERAD DRPRFIQSRH EELAAFEATG YAKFSGRVGV CAATSGPGAI HLLNGLYDAK LDHVPVVAIL GQTARSAMGG SYQQEVDLMS LYKDVASDYL QMVTVPEQLP NVLDRAIRIA ASRRTVTAVI IPADVQDLEY SPPEHEFKMV PSSLGLPSPR STPSPEGLAE AAEILNSGER VAMLVGQGAR GAADAVVEMA DRLGAGVAKA LLGKDVLPDD LPFVTGSIGL LGTRPSYEMM RDCDTLLVVG SSFPYSQFLP EFDQARAVQI DIDPTMVGMR YPFECNLVGD SAQTLRMLLP LVERKTDRSW REKVEDGVAR WRRVLERRAH VDADPVNPER VFHELSPLLP DDVMVTADSG SAANWYARHL VFREGMRGTL SGTLATMCPG VPYATGAKFA HPDRPVVALV GDGAMQMSGI NELITIGHYW REWEDPRVVV AVLNNRDLNQ VTWELRAMGG APQFLPSQRI PDFPYAGFAE SIGLRGIKVD DPSDVRDAWQ RALSADRPCV VEFVTDPAVP PIPPHATLDQ MESVAKALAK GDPEAWSVVK RGVVSKAQEF LPGDGRGGR
|
| |