Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1940 |
Symbol | |
ID | 9245790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2363726 |
End bp | 2364736 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003679873 |
Protein GI | 297560899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.177932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.094391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA TGAACGTGTG GGTGGCGCTC GCGCTGACCG CGGTGATCAT CGCGCTGAGC GCCTTCTTCG TGGCGATCGA GTTCGCCCTG GTGGCGGCGC GCCGCTACCG GCTGGAGGAG GCCGCCGAGT CCAGCTTCTC GGCACGGGCC GCGGTCAGGA GCGCCCGCGA CCTGTCGCTG CTGCTGGCCG GTTCGCAGCT GGGGATCACC CTGTGCGCCC TGGCGCTGGG CGCGATCTCC AAGCCCGCCG TCCACCACAT GCTGGAGCCG CTGTTCGGCG GCCTGCCGGC GGCGGTGGGC TACGTGGTCT CGTTCGTGCT GTCGCTGATC GTGGTGACCT TCCTGCACCT GGTGGTGGGT GAGATGGCGC CCAAGTCCTG GGCGATCTCG CACCCGGAGA AGTCGGCGAT CATGCTGGCC GTGCCGATGC GGGCGTTCAT GTGGTTCACC CGTCCGCTGC TGCTGATGCT CAACGGCATG GCCAACTGGT GCCTGCACCG GCTGGGCGTG GAGGCGGTGG ACGAGATGTC GTCCGGGCAC GGTCCCGACG ACGTGCGCGA GCTGGTGGAG CACTCGGCCA AGGCCGGTGC GCTCGACCCC GAGCGCCGCG CCCAGCTGGC CACGGCGCTG GAGGTCAACT CCCGTCCGCT GAGCGAGATC GTGACACCGC GCGAGGAGAT CGCGTCGGTG TCCCCGAACT CGACGGTGGA CGACATCAAG CAGGTGTCGC GGGAGTCCAC GCACCTGCGC CTGGTGGTGA TGGACGGCAC CGAACCCGTG GGCGTGCTGC ACGTGCGCGA GGCGCTGACG GGCCCGGAGG GGACCACCGC GGCCGACCTG ATGCGGCCGG TGCTCACCCT GGCCGCGGAG ACGCCGATGT ACGCGGCGAT GGGCATCATG CGGGAGAGCC GCAGCCACCT GTCCCTGGTG GAGACGGACG GCGAGGTGAT CGGCCTGGTC ACCCTCCAGG ACATCCTGGA CCGCCTGCTG CTGCTGGACA CGGCCGCCTG A
|
Protein sequence | MSDMNVWVAL ALTAVIIALS AFFVAIEFAL VAARRYRLEE AAESSFSARA AVRSARDLSL LLAGSQLGIT LCALALGAIS KPAVHHMLEP LFGGLPAAVG YVVSFVLSLI VVTFLHLVVG EMAPKSWAIS HPEKSAIMLA VPMRAFMWFT RPLLLMLNGM ANWCLHRLGV EAVDEMSSGH GPDDVRELVE HSAKAGALDP ERRAQLATAL EVNSRPLSEI VTPREEIASV SPNSTVDDIK QVSRESTHLR LVVMDGTEPV GVLHVREALT GPEGTTAADL MRPVLTLAAE TPMYAAMGIM RESRSHLSLV ETDGEVIGLV TLQDILDRLL LLDTAA
|
| |