Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3357 |
Symbol | |
ID | 9247221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4011701 |
End bp | 4013320 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | alpha amylase catalytic region |
Protein accession | YP_003681268 |
Protein GI | 297562294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.925027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGT GGTGGCGCGA CGCGGTCCTG TACCAGGTCT ACCCGCGCAG CTTCGCCGAC GCCGACGGGG ACGGGACCGG CGACATCGCG GGGATCACCG CGCGCCTGGG CCACGTCGCC GACCTGGGCG CGGACGGGAT CTGGCTGTCG CCGTTCTACA CCTCGCCGTG GGCGGACGGC GGGTACGACG TGGCCGACTT CCGGGACGTG GACCCGCGCC TGGGCACCCT GGAGGACTTC GACGCGATGG TGGCCGCCGC GCACGCGCTG GGTCTGCGGG TGATGGTCGA CATCGTGCCC AACCACACCT CCGAGGAGCA CCCGTGGTTC CGGGAGGCGC TGGAGGCCGG CCCCGGGTCG CCCGAGCGGG AGCTGTACGT CTTCCGCGAC GGGCGGGGCT CGGACGGCGA GCTGCCGCCG ACCAACTGGC GTTCGACGTT CGGCGGTCCG GCGTGGACCC GCGTGGCGGA CGGGCAGTGG TACCTGCACA TGTTCGCGCC CGAGCAGCCG GACCTGAACT GGGACGACCC GCGGGTGCGC GAGGAGTTCC GCGGCATCCT GCGGTTCTGG AGCGGGCGCG GGGTGGACGG GTTCCGGATC GACGTGGCGT ACGCGCTGGT CAAGGACCTG CGGGAGCCGT TGCGCGACCT GGTGCTGGTG GAGGGCGGCC GGTTCGAGGA CATCGCGGCC AACCCGGACC ACCCGTTCCT GGACCGGCCC GAGGTGCACG AGGTGTACCG GGACTGGCGG CGGGTGCTGG CGGAGTTCGA CCCGCCCCGG GCCACGGTGG GCGAGGTGTG GCTGCCCGGT GAGCGGCGGG TGCTGTACAC GCGCCCGGAC GAGCTGGACC AGGCGTTCAA CTTCGACTTC CTGAGGACCT CGTGGGACGC CGACGCCTAC CGCGGCGTGA TCGACTCCTC GATCGCCGAC GCCGGGCAGG TCGGCACGGT GCCCACGTGG GTGATCGGCA ACCACGACGT GGTGCGGCCG GTGTCGGTGC TGGGGCTGCC CCCGGGCACC GACCAGAAGG CGTGGCTGCT CTCCGACGGG CGCGACCCGG AGCCGGACCT GGAGCTGGGC ACGCGGCGGG CACGGGCGCT GGCACTGCTG GAGCTGTCGC TGCCGGGGTC GGCGTACGTG TACCAGGGCG AGGAGCTGGG GCTGCCGGAG GTGGCGGACC TGCCCGCGCG GGCGCTGGAG GACCCGCGGT GGGTGCGCAG CGGGCACACC GACAAGGGGC GGGACGGGTG CCGGGTGCCG CTGCCGTGGA CACGGGAAGG AGCGTCGTAC GGGTTCGGCG GGGACACCCC GTGGCTGCCC CAGCCCCGAG GGTGGGGCCG GTGGTCGGTG CGGGCCCAGA ACGACGACCC CGGGTCCGTG CTCTCCCTGT ACCGCCGGGC TCTGGCACAC CGCCGGGAGT TCTCCTCGGA CGAGACTCTG AGCTGGGACG ACACGCTGAA CCGGGGGCCG GTGCTGGCCT ACTGGCGGGG TGGGGACGTG CTGGTGCTGG TCAACACGGG CGAGGAGGCG GTGGAGCTGC CGCCGGGCCG GGTGCTGGTG GCCAGCGCGG AGCTGGACGG GCGGCTCCCG GGAAACGCGG CGGTGTGGCT GCGCCGCTGA
|
Protein sequence | MQQWWRDAVL YQVYPRSFAD ADGDGTGDIA GITARLGHVA DLGADGIWLS PFYTSPWADG GYDVADFRDV DPRLGTLEDF DAMVAAAHAL GLRVMVDIVP NHTSEEHPWF REALEAGPGS PERELYVFRD GRGSDGELPP TNWRSTFGGP AWTRVADGQW YLHMFAPEQP DLNWDDPRVR EEFRGILRFW SGRGVDGFRI DVAYALVKDL REPLRDLVLV EGGRFEDIAA NPDHPFLDRP EVHEVYRDWR RVLAEFDPPR ATVGEVWLPG ERRVLYTRPD ELDQAFNFDF LRTSWDADAY RGVIDSSIAD AGQVGTVPTW VIGNHDVVRP VSVLGLPPGT DQKAWLLSDG RDPEPDLELG TRRARALALL ELSLPGSAYV YQGEELGLPE VADLPARALE DPRWVRSGHT DKGRDGCRVP LPWTREGASY GFGGDTPWLP QPRGWGRWSV RAQNDDPGSV LSLYRRALAH RREFSSDETL SWDDTLNRGP VLAYWRGGDV LVLVNTGEEA VELPPGRVLV ASAELDGRLP GNAAVWLRR
|
| |