Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4120 |
Symbol | |
ID | 9247994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4919616 |
End bp | 4921322 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003682021 |
Protein GI | 297563047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA AGGCAACCAG CGAGTCCACC AGCAGCGAGA AGACCACGAT CTCCGGCGGC CACCTGGTCG CCAAGGCCCT CAAGGCCGAG GGGATCGACG TCATCTTCAC GCTCTGCGGC GGACACATCA TCGACATCTA CGACGGATGC GCGGACGAGG GCATCGACGT GGTCGACGTG CGGCACGAGC AGGTCGCCGC GCACGCCGCC GACGGCTACG CCCGCGTCAC CGGCAAGCCC GGGTGCGCCG TCGTGACCGC CGGACCGGGG ACCACCGACG CGGTCACCGG CATCGCCAAC GCCTACCGCG CGGAGAGCCC GATGCTGGTC ATCGGCGGCC AGGGCGCCCT GAGCCAGCAC AAGATGGGCT CGCTCCAGGA CCTGCCGCAC GTGGACATGA TCAACCCGAT CTCCAAGTTC GCCGCCACCG TGCCCCACAC CGAGCGGGTC GCCGACCTGG TCTCGATGGC CTTCCGCGAG GCCAACAGCG GCGCCCCCGG CCCGGCCTTC CTGGAGATCC CCCGGGACGT CCTGGACGCC GAGGTGCCCG TGGAGCGGGC CCGCGTCCCC GCCAAGGGCC GCTACCGCGC CTCCACCCGG CAGGCGGGCG ACCCCGCCGC GATCGAGCGG CTCGCCGACC TGATCGTGCG CTCCGAGAAG CCCAGCATCC TGCTCGGCAA CCAGGTGTGG ACCACCCGGG CCACGCAGTC CGCCACCGAC CTGGTGCGCG CGCTCAACAT CCCCGCCTAC ATGAACGGCG CGGGCCGGGG CACCCTGCCG CCCGGGGACC CGCACCACTT CCAGCTCTCC CGGCGCTACG CCTTCACCAA CTCCGACCTG ATCATCATCG TCGGCACCCC CTTCGACTTC CGGATGGGCT ACGGCAAGCG CCTCTCGCCC ACCGCCACGG TGGTGCAGAT CGACCTCAAC TACGCCACCG TCGGCAAGAA CCGCGACGTG GACCTGGGGC TGGTCGGGGA CGCCGACGTG ATCCTGTCCT CGGTGCTCCA GGCGACCTCG GGCTACGGGG ACAACGGCGC CCAGAGCCGC AAGACCTGGC TGGAGGAGCT GCGCACCCAG GAGCAGGCCG CGCTGGACAA GCGGGCGCAC CTGCTCACCT CCGACTCCAC GCCCATCCAC CCCTACCGGC TGGTCAGCGA GATCAACCAG TTCCTCACCG AGGACTCCAT CTACGTCGGC GACGGCGGCG ACATCGTCAC CTTCTCCGGC CAGGTGGTCC AGCCCAAGTC GCCGGGCCAC TGGATGGACC CCGGGCCCCT CGGCACGCTG GGCGTGGGCG TCCCGTTCGT GATGGCGGCC AAGTACGCCC GCCCGGACAA GGAGGTGGTG GCCCTCTTCG GCGACGGCGC GTTCAGCCTG ACCGGCTGGG ACTTCGAGAC CCTGGTCCGG TTCGACCTGC CCTTCGTCGG CATCGTGGGC AACAACTCCT CGATGAACCA GATCCGCTAC GGCCAGATCG CCAAGTACGG CGCGGACCGG GGCGAGATCG GCAACACCCT GGGCGACGTC AACTACGCCG AGTTCGCCCG GATGCTGGGC GGCCACGGCG AGGAGGTCCG GGACCCGGCC GACATCGCCC CGGCGCTGCG CCGCGCCCGC GAGTCCGGCA AGCCCTCGCT GATCAACGTC TGGATCGACC CCGAGGTCTA CGCCCCGGGA ACGATGAACC AGACCATGTA CAAGTAG
|
Protein sequence | MADKATSEST SSEKTTISGG HLVAKALKAE GIDVIFTLCG GHIIDIYDGC ADEGIDVVDV RHEQVAAHAA DGYARVTGKP GCAVVTAGPG TTDAVTGIAN AYRAESPMLV IGGQGALSQH KMGSLQDLPH VDMINPISKF AATVPHTERV ADLVSMAFRE ANSGAPGPAF LEIPRDVLDA EVPVERARVP AKGRYRASTR QAGDPAAIER LADLIVRSEK PSILLGNQVW TTRATQSATD LVRALNIPAY MNGAGRGTLP PGDPHHFQLS RRYAFTNSDL IIIVGTPFDF RMGYGKRLSP TATVVQIDLN YATVGKNRDV DLGLVGDADV ILSSVLQATS GYGDNGAQSR KTWLEELRTQ EQAALDKRAH LLTSDSTPIH PYRLVSEINQ FLTEDSIYVG DGGDIVTFSG QVVQPKSPGH WMDPGPLGTL GVGVPFVMAA KYARPDKEVV ALFGDGAFSL TGWDFETLVR FDLPFVGIVG NNSSMNQIRY GQIAKYGADR GEIGNTLGDV NYAEFARMLG GHGEEVRDPA DIAPALRRAR ESGKPSLINV WIDPEVYAPG TMNQTMYK
|
| |