Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0930 |
Symbol | |
ID | 9244775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1140807 |
End bp | 1142651 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | alpha amylase catalytic region |
Protein accession | YP_003678880 |
Protein GI | 297559906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0349089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCA TCCGACCCGG CGCCGCCGCC GGGGCGGTCC TGATGGGGAT CGGCCTGCTG GCCGCCCCCG CGCAGGCCGC CCCCGTCCCG GCGGCCCCGA CCCCGACGAC ACCGGTCCCG GCAGCCCCCG CCCCCGCCGA GGACGTCCGG GCCGCCGACA ACGGCGAGAC CATCGTCCAG CTCTTCCAGT GGAACTGGGA CTCCGTCGCC ACCGAGTGCG AGGAGTTCCT CGGCCCCCAC GGCTTCGGCG GGGTGCAGGT CTCCCCGCCC CAGGAGCACG TGGTCATCCC CTTCGCCGAG GGCGGCGACT ACCCCTGGTG GCAGGACTAC CAGCCGACCT CCTACCGCAT CGACAACACC CGGCGCGGCA CCGCCGAGGA GTTCCAGGCG ATGGTCTCCA CCTGTGCCGA CAACGGCGTG AGGATCTACG CCGACGCGAT CATCAACCAC ATGACCGGCG ACGGCTCGGG CACCGGCAGC GCGGGCACGG AGTGGGCCAA GTACGAGTAC CCCGACCTGT TCGGCGACGG CACCGCCTCC CGCACGGGGG AGGACTTCAG CTCCTGCCGG AGGGAGATCA GCAACTGGAA CGACAAGTGG GAGGTGCAGA ACTGCGAGCT GGTCGGCCTG TCCGACCTCG ACACGGGTGA CCCCGAGGTG CGCGCGCAGA TCCGCCGCTA CCTCAACGGC CTGGTGGACA TGGGCGTGGG GGGCTTCCGC GTGGACGCCT CCAAGCACGT CCCCGAGGCC CACGTCGACG CGATCTTCTC CGACCTGAAC GAGGTCCCGG TCTTCGGCGG TCAGCCCGAC GTCTTCCACG AGGTCTACGG GGACCAGACC ATCCCCTACA CCGCCTACAC GCCCTACGGC CGTGTGACCG CCTTCGACTA CCAGCGCGAC ATCTCCAACA AGTTCGCCGG AGGCGACATC TCCGGCCTGG CCCAGCTGCC GGACTACGGC GGGCTCACCG ACGAGCAGGC CACCGTCTTC GTCGACAACC ACGACACCCA GCGCTACCAC CCGACCCTGA CCTTCAAGGA CGGCGACCGC TACCACCTGG CCGTGGCGTT CATGCTGGCC CACCCCTACG GGCGCCCCGT GGTGATGTCC AGCTACGACT TCGGCTCCAA CGTCACCCAG GGCCCGCCCA GCGTCGGCGA GGCGGCGGGC AACCCGGCGG GCTGGATCAC CGCCGACACC GACTGCGCCA GCGCCGAGTG GGTCTGCGAG CACCGCCACC CGACCGTCGC CGGGATGGCC GCCTTCCGCA ACGCCACCGG CGACACCCCC GTCGTCCAGC GCGCCACCGA CGGCTCCTCC CGGCTCGCCT TCGACCGGGG CGACCGCGGC TTCGCCGCCT TCAACGCGAC CGGCGGCACC TGGAACCTGA CCGCCGACAC CGGCCTGCCC GACGGCAGCT ACGACAACGC CGCCGGGAGC GGGACCCTCA CCGTCGCCGA CGGCCGGATC AGCGCCCGGG TCCCCGCGAA CGGGGCCGTC GCCCTGCACG TGGGCGGCAC CTGCGACGAC CCGGCCGAGT GCGGGGGCGG CGGCCCCGGT GAGCCGGGCG AGCCGGGCGA GGTCAACGTC TCCGCCACCG TGGAGACCTG GTACGGCCAG GAGGTGTACG TGGTCGGCTC CACCCCCGGG CTGGGGTCCT GGAACCCCCC GAGCGGGGTG AAGCTGTCCA CCGACGCGTC CACCTACCCC GTGTGGTCGG GCACCGCCCC CATCGGTGCC GACACCGAGT GGAAGCTGGT CAAGATCGAC GGCGCGGGCA ACGTCGAGTG GGAGTCCGGC GCCAACCGCG TCGGCCCCGC CGCCAGCGTC ACCTGGCGCG ACTGA
|
Protein sequence | MKPIRPGAAA GAVLMGIGLL AAPAQAAPVP AAPTPTTPVP AAPAPAEDVR AADNGETIVQ LFQWNWDSVA TECEEFLGPH GFGGVQVSPP QEHVVIPFAE GGDYPWWQDY QPTSYRIDNT RRGTAEEFQA MVSTCADNGV RIYADAIINH MTGDGSGTGS AGTEWAKYEY PDLFGDGTAS RTGEDFSSCR REISNWNDKW EVQNCELVGL SDLDTGDPEV RAQIRRYLNG LVDMGVGGFR VDASKHVPEA HVDAIFSDLN EVPVFGGQPD VFHEVYGDQT IPYTAYTPYG RVTAFDYQRD ISNKFAGGDI SGLAQLPDYG GLTDEQATVF VDNHDTQRYH PTLTFKDGDR YHLAVAFMLA HPYGRPVVMS SYDFGSNVTQ GPPSVGEAAG NPAGWITADT DCASAEWVCE HRHPTVAGMA AFRNATGDTP VVQRATDGSS RLAFDRGDRG FAAFNATGGT WNLTADTGLP DGSYDNAAGS GTLTVADGRI SARVPANGAV ALHVGGTCDD PAECGGGGPG EPGEPGEVNV SATVETWYGQ EVYVVGSTPG LGSWNPPSGV KLSTDASTYP VWSGTAPIGA DTEWKLVKID GAGNVEWESG ANRVGPAASV TWRD
|
| |