Gene Ndas_0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0930 
Symbol 
ID9244775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1140807 
End bp1142651 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content72% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003678880 
Protein GI297559906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0349089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCA TCCGACCCGG CGCCGCCGCC GGGGCGGTCC TGATGGGGAT CGGCCTGCTG 
GCCGCCCCCG CGCAGGCCGC CCCCGTCCCG GCGGCCCCGA CCCCGACGAC ACCGGTCCCG
GCAGCCCCCG CCCCCGCCGA GGACGTCCGG GCCGCCGACA ACGGCGAGAC CATCGTCCAG
CTCTTCCAGT GGAACTGGGA CTCCGTCGCC ACCGAGTGCG AGGAGTTCCT CGGCCCCCAC
GGCTTCGGCG GGGTGCAGGT CTCCCCGCCC CAGGAGCACG TGGTCATCCC CTTCGCCGAG
GGCGGCGACT ACCCCTGGTG GCAGGACTAC CAGCCGACCT CCTACCGCAT CGACAACACC
CGGCGCGGCA CCGCCGAGGA GTTCCAGGCG ATGGTCTCCA CCTGTGCCGA CAACGGCGTG
AGGATCTACG CCGACGCGAT CATCAACCAC ATGACCGGCG ACGGCTCGGG CACCGGCAGC
GCGGGCACGG AGTGGGCCAA GTACGAGTAC CCCGACCTGT TCGGCGACGG CACCGCCTCC
CGCACGGGGG AGGACTTCAG CTCCTGCCGG AGGGAGATCA GCAACTGGAA CGACAAGTGG
GAGGTGCAGA ACTGCGAGCT GGTCGGCCTG TCCGACCTCG ACACGGGTGA CCCCGAGGTG
CGCGCGCAGA TCCGCCGCTA CCTCAACGGC CTGGTGGACA TGGGCGTGGG GGGCTTCCGC
GTGGACGCCT CCAAGCACGT CCCCGAGGCC CACGTCGACG CGATCTTCTC CGACCTGAAC
GAGGTCCCGG TCTTCGGCGG TCAGCCCGAC GTCTTCCACG AGGTCTACGG GGACCAGACC
ATCCCCTACA CCGCCTACAC GCCCTACGGC CGTGTGACCG CCTTCGACTA CCAGCGCGAC
ATCTCCAACA AGTTCGCCGG AGGCGACATC TCCGGCCTGG CCCAGCTGCC GGACTACGGC
GGGCTCACCG ACGAGCAGGC CACCGTCTTC GTCGACAACC ACGACACCCA GCGCTACCAC
CCGACCCTGA CCTTCAAGGA CGGCGACCGC TACCACCTGG CCGTGGCGTT CATGCTGGCC
CACCCCTACG GGCGCCCCGT GGTGATGTCC AGCTACGACT TCGGCTCCAA CGTCACCCAG
GGCCCGCCCA GCGTCGGCGA GGCGGCGGGC AACCCGGCGG GCTGGATCAC CGCCGACACC
GACTGCGCCA GCGCCGAGTG GGTCTGCGAG CACCGCCACC CGACCGTCGC CGGGATGGCC
GCCTTCCGCA ACGCCACCGG CGACACCCCC GTCGTCCAGC GCGCCACCGA CGGCTCCTCC
CGGCTCGCCT TCGACCGGGG CGACCGCGGC TTCGCCGCCT TCAACGCGAC CGGCGGCACC
TGGAACCTGA CCGCCGACAC CGGCCTGCCC GACGGCAGCT ACGACAACGC CGCCGGGAGC
GGGACCCTCA CCGTCGCCGA CGGCCGGATC AGCGCCCGGG TCCCCGCGAA CGGGGCCGTC
GCCCTGCACG TGGGCGGCAC CTGCGACGAC CCGGCCGAGT GCGGGGGCGG CGGCCCCGGT
GAGCCGGGCG AGCCGGGCGA GGTCAACGTC TCCGCCACCG TGGAGACCTG GTACGGCCAG
GAGGTGTACG TGGTCGGCTC CACCCCCGGG CTGGGGTCCT GGAACCCCCC GAGCGGGGTG
AAGCTGTCCA CCGACGCGTC CACCTACCCC GTGTGGTCGG GCACCGCCCC CATCGGTGCC
GACACCGAGT GGAAGCTGGT CAAGATCGAC GGCGCGGGCA ACGTCGAGTG GGAGTCCGGC
GCCAACCGCG TCGGCCCCGC CGCCAGCGTC ACCTGGCGCG ACTGA
 
Protein sequence
MKPIRPGAAA GAVLMGIGLL AAPAQAAPVP AAPTPTTPVP AAPAPAEDVR AADNGETIVQ 
LFQWNWDSVA TECEEFLGPH GFGGVQVSPP QEHVVIPFAE GGDYPWWQDY QPTSYRIDNT
RRGTAEEFQA MVSTCADNGV RIYADAIINH MTGDGSGTGS AGTEWAKYEY PDLFGDGTAS
RTGEDFSSCR REISNWNDKW EVQNCELVGL SDLDTGDPEV RAQIRRYLNG LVDMGVGGFR
VDASKHVPEA HVDAIFSDLN EVPVFGGQPD VFHEVYGDQT IPYTAYTPYG RVTAFDYQRD
ISNKFAGGDI SGLAQLPDYG GLTDEQATVF VDNHDTQRYH PTLTFKDGDR YHLAVAFMLA
HPYGRPVVMS SYDFGSNVTQ GPPSVGEAAG NPAGWITADT DCASAEWVCE HRHPTVAGMA
AFRNATGDTP VVQRATDGSS RLAFDRGDRG FAAFNATGGT WNLTADTGLP DGSYDNAAGS
GTLTVADGRI SARVPANGAV ALHVGGTCDD PAECGGGGPG EPGEPGEVNV SATVETWYGQ
EVYVVGSTPG LGSWNPPSGV KLSTDASTYP VWSGTAPIGA DTEWKLVKID GAGNVEWESG
ANRVGPAASV TWRD