Gene Ndas_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3357 
Symbol 
ID9247221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4011701 
End bp4013320 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content74% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003681268 
Protein GI297562294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.925027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGT GGTGGCGCGA CGCGGTCCTG TACCAGGTCT ACCCGCGCAG CTTCGCCGAC 
GCCGACGGGG ACGGGACCGG CGACATCGCG GGGATCACCG CGCGCCTGGG CCACGTCGCC
GACCTGGGCG CGGACGGGAT CTGGCTGTCG CCGTTCTACA CCTCGCCGTG GGCGGACGGC
GGGTACGACG TGGCCGACTT CCGGGACGTG GACCCGCGCC TGGGCACCCT GGAGGACTTC
GACGCGATGG TGGCCGCCGC GCACGCGCTG GGTCTGCGGG TGATGGTCGA CATCGTGCCC
AACCACACCT CCGAGGAGCA CCCGTGGTTC CGGGAGGCGC TGGAGGCCGG CCCCGGGTCG
CCCGAGCGGG AGCTGTACGT CTTCCGCGAC GGGCGGGGCT CGGACGGCGA GCTGCCGCCG
ACCAACTGGC GTTCGACGTT CGGCGGTCCG GCGTGGACCC GCGTGGCGGA CGGGCAGTGG
TACCTGCACA TGTTCGCGCC CGAGCAGCCG GACCTGAACT GGGACGACCC GCGGGTGCGC
GAGGAGTTCC GCGGCATCCT GCGGTTCTGG AGCGGGCGCG GGGTGGACGG GTTCCGGATC
GACGTGGCGT ACGCGCTGGT CAAGGACCTG CGGGAGCCGT TGCGCGACCT GGTGCTGGTG
GAGGGCGGCC GGTTCGAGGA CATCGCGGCC AACCCGGACC ACCCGTTCCT GGACCGGCCC
GAGGTGCACG AGGTGTACCG GGACTGGCGG CGGGTGCTGG CGGAGTTCGA CCCGCCCCGG
GCCACGGTGG GCGAGGTGTG GCTGCCCGGT GAGCGGCGGG TGCTGTACAC GCGCCCGGAC
GAGCTGGACC AGGCGTTCAA CTTCGACTTC CTGAGGACCT CGTGGGACGC CGACGCCTAC
CGCGGCGTGA TCGACTCCTC GATCGCCGAC GCCGGGCAGG TCGGCACGGT GCCCACGTGG
GTGATCGGCA ACCACGACGT GGTGCGGCCG GTGTCGGTGC TGGGGCTGCC CCCGGGCACC
GACCAGAAGG CGTGGCTGCT CTCCGACGGG CGCGACCCGG AGCCGGACCT GGAGCTGGGC
ACGCGGCGGG CACGGGCGCT GGCACTGCTG GAGCTGTCGC TGCCGGGGTC GGCGTACGTG
TACCAGGGCG AGGAGCTGGG GCTGCCGGAG GTGGCGGACC TGCCCGCGCG GGCGCTGGAG
GACCCGCGGT GGGTGCGCAG CGGGCACACC GACAAGGGGC GGGACGGGTG CCGGGTGCCG
CTGCCGTGGA CACGGGAAGG AGCGTCGTAC GGGTTCGGCG GGGACACCCC GTGGCTGCCC
CAGCCCCGAG GGTGGGGCCG GTGGTCGGTG CGGGCCCAGA ACGACGACCC CGGGTCCGTG
CTCTCCCTGT ACCGCCGGGC TCTGGCACAC CGCCGGGAGT TCTCCTCGGA CGAGACTCTG
AGCTGGGACG ACACGCTGAA CCGGGGGCCG GTGCTGGCCT ACTGGCGGGG TGGGGACGTG
CTGGTGCTGG TCAACACGGG CGAGGAGGCG GTGGAGCTGC CGCCGGGCCG GGTGCTGGTG
GCCAGCGCGG AGCTGGACGG GCGGCTCCCG GGAAACGCGG CGGTGTGGCT GCGCCGCTGA
 
Protein sequence
MQQWWRDAVL YQVYPRSFAD ADGDGTGDIA GITARLGHVA DLGADGIWLS PFYTSPWADG 
GYDVADFRDV DPRLGTLEDF DAMVAAAHAL GLRVMVDIVP NHTSEEHPWF REALEAGPGS
PERELYVFRD GRGSDGELPP TNWRSTFGGP AWTRVADGQW YLHMFAPEQP DLNWDDPRVR
EEFRGILRFW SGRGVDGFRI DVAYALVKDL REPLRDLVLV EGGRFEDIAA NPDHPFLDRP
EVHEVYRDWR RVLAEFDPPR ATVGEVWLPG ERRVLYTRPD ELDQAFNFDF LRTSWDADAY
RGVIDSSIAD AGQVGTVPTW VIGNHDVVRP VSVLGLPPGT DQKAWLLSDG RDPEPDLELG
TRRARALALL ELSLPGSAYV YQGEELGLPE VADLPARALE DPRWVRSGHT DKGRDGCRVP
LPWTREGASY GFGGDTPWLP QPRGWGRWSV RAQNDDPGSV LSLYRRALAH RREFSSDETL
SWDDTLNRGP VLAYWRGGDV LVLVNTGEEA VELPPGRVLV ASAELDGRLP GNAAVWLRR