Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4193 |
Symbol | |
ID | 9248067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5007486 |
End bp | 5008538 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Zeta toxin family protein |
Protein accession | YP_003682092 |
Protein GI | 297563118 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTTC CACAGGTGGG CGAGCGGACC CTGGTGGAGG ACAGCGCCTG CGGCGGTTTC CCCGCGCAGG GGGCCGGTTC CCGGCGGGCC CGCGAACGGG CCTACAACCT GTACAGCGCG GCCGTCGGCG CCTGCCTGAG CAGGACCGGG CTCCGGGAGC GCGCCGCCGA ACTCCTCGCC CGGGAGGCAC GGCTGCCCTT CCTCGACGAC GGGGAGGTCG CCCGGGTCAA CCGGGAGCTC CCGGGGATCG TCGCCGAACT GCGGGCCGAG CCGATCCGGG TCCGCGACGG CGCGACCCCC GGGGATCCGG GCCCCGTCCT GTCCGGGGCC GACCTGGACG CGCTGTTCGA CCTCGCGCTC CGCGACCGGC TCACCGGACC GGCCCGCGAG CGGCCCCGGC TCCTGCTCCT GGGCGGCCAG CCCGGCTCGG GCAAGTCCAC CCTCCAGCGC CTGGTCCTGC CGTTCCTGCC CGAGGGCACC GTCAGCTACG ACGGCGACGA CCTGCTGCGC CTGGCGCCGG ACTACGAGTG GGCGATGGCC GCCGACGACC GGGCCGCCTC CGCCGCCCTG GCCCGGCAGG TCGGCGGCCT GCACGGGCTG GCCATGGCGC ACATGCGCGC GGGCCGCGTG GACGCCGTGT GCAGCCACCC CCTCGGCCGC GCCGACTGGG CCGCGTCCTG GGTGGAGGGG TTCCGCGACG CCGGGTACCG GGTGGAGGTC GCCCTCGTGG CCACCCACAG TTCCAACAGC CGCTTCGCGA TCGCCGACCG GTACGCCCGG GCCCGCGCCG ACCAGGGTTT CGGCAGGTGG ATGCCCGAAC TCCACCATGA CCGTTTCTAT CTGGGCCTCC CCAACACGGT GGAGTTCCTG GAGACCCACC GGCTGGCGGA CTCGCTCTAC GTCCTCTCCC GCGACGGCGA CGTGCTCTAC GCCAACCACC GGGAGGAGGA CGACTGGCGG ACCGAGCCGT TCGGGCGCGT CGCCCTGGAG GCCGAACGCG GGCGCCGGGA GCTGTGGCAC GCGTCCGGGC GCGCCCCGCG CAGCGCGGTC TGA
|
Protein sequence | MTVPQVGERT LVEDSACGGF PAQGAGSRRA RERAYNLYSA AVGACLSRTG LRERAAELLA REARLPFLDD GEVARVNREL PGIVAELRAE PIRVRDGATP GDPGPVLSGA DLDALFDLAL RDRLTGPARE RPRLLLLGGQ PGSGKSTLQR LVLPFLPEGT VSYDGDDLLR LAPDYEWAMA ADDRAASAAL ARQVGGLHGL AMAHMRAGRV DAVCSHPLGR ADWAASWVEG FRDAGYRVEV ALVATHSSNS RFAIADRYAR ARADQGFGRW MPELHHDRFY LGLPNTVEFL ETHRLADSLY VLSRDGDVLY ANHREEDDWR TEPFGRVALE AERGRRELWH ASGRAPRSAV
|
| |