Gene Ndas_0695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0695 
Symbol 
ID9244537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp856354 
End bp857955 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content73% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003678646 
Protein GI297559672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.497083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCTC AGTCGCACTG GTGGCGTGAC GCGGCGATCT ACCAGATCTA CGTCCGCAGC 
TTCGCCGACT CCAACGGTGA CGGTGAGGGC GACCTCGCCG GCATCCGCGA GCGCCTCCCC
CACCTGGCCG AACTCGGCGT GGACGCGATC TGGCTGACGC CGTTCTACGT CTCCCCCCTC
GCCGACGGCG GTTACGACGT CGCCGACTAC CGCGACGTCG ATCCCCGTTT CGGGACCCTG
GAGGACTTCG ACGCCCTCCT GGAGACCGCC CACGGGATGG GCATCCGCCT GATCATCGAC
GTGGTGCCCA ACCACTCCTC CTCCGCCCAC CGGTGGTTCA AGGAGGCCGC GGCCGCCGAG
CCCGGCGAGC ACGCCCGCTC GCGGTACGTC TTCCGGGACG GCAGGGGACC CGACGGGGAG
GAGCCGCCGA ACAACTGGAA GTCGATCTTC GGCGGCCCGG CCTGGACCCG CCTCAAGCGG
CCCGACGGCA CCCCCGAACA GTGGTACCTG CACCTGTTCG ACCCCGAGCA GCCCGACTTC
GACTGGACCA ACCAGGAGGT GCACGACGAG TTCGACGACG TGCTGCGCTT CTGGCTGGAC
CGGGGCGTGG ACGGGTTCCG CATCGACGTC GCGCACGGCA TGGTCAAGGA CCCGGCGATG
CCCGACATCG CCGAGGACGC CAAGGCCGAC ATGCTCGACG GCAGCACCTC GCTGCCCTAC
TTCGACCAGG ACGGCGTGCA CGAGATCTAC CGCCGCTGGG CGCGGATCGC CGCGGAGTAC
CCGGGCGACC GCGCCCTGGT GGCCGAGGCG TGGGTGGAGG ACGCCCAGCG GGTGGCCCGC
TACCTGCGCC CGGACGAACT GCACCAGGCG TTCAACTTCG AGTACCTGAC CGCGCCGTGG
GACGCCGGGC GCCTGCGCGA GGTCATCGAC GTGTCCCTGG CCGCCAACAC CGCGGTGGGC
GCGACCACCA CGTGGGTGCT GTCCAACCAC GACGTGACCC GCCACGTGAC CCGCTTCGGC
GGCGGTGAGC AGGGGCTGCG CCGCGCGCGC GCGGCGACGC TGCTGATGCT GGGCCTGCCG
GGTTCGGTGT ACCTGTACCA GGGCGAGGAG CTGGGACTGC CCGAGGTCAC CGACCTGCCC
GAGGACTCGC TGCAGGACCC GACCTGGGAG CGTTCGGGCC GCACCGACCG CGGCCGGGAC
GGCTGCCGGG TGCCGCTGCC GTGGGGCGGC GACCAGGCGC CGTACGGGTT CGGCCCCGAG
GGCAGCGTTC CGTGGCTGCC GATGCCCGAG GGCTGGGGCG CCCTGTCGCG CGCCGCCCAG
CGCGGAGTGG AGGGCTCCAC GCTGGAGCTG TACACGAAGG CGCTGCGCCT GCGCCGTGAG
CTGGACGCGC TCGGCGACGG CTCGATGGCC TGGCTGGACG CGCCCGGCGG CGTGCTGTAC
TTCGAGCGCG AGCCGGGTGT GCGCGTCGCG GTGAACCTCA CCGGGGAGGC CGTGGAGCTG
GCTGCCGACG GCGAGGTGCT CGTGGCCAGC GGTCCGGTCG GCGAGCCCGC CGGAGGCACC
CTGAGCCTGC CCGCCGAGAC GGCGGTGTGG CTGCGCGGTT GA
 
Protein sequence
MEPQSHWWRD AAIYQIYVRS FADSNGDGEG DLAGIRERLP HLAELGVDAI WLTPFYVSPL 
ADGGYDVADY RDVDPRFGTL EDFDALLETA HGMGIRLIID VVPNHSSSAH RWFKEAAAAE
PGEHARSRYV FRDGRGPDGE EPPNNWKSIF GGPAWTRLKR PDGTPEQWYL HLFDPEQPDF
DWTNQEVHDE FDDVLRFWLD RGVDGFRIDV AHGMVKDPAM PDIAEDAKAD MLDGSTSLPY
FDQDGVHEIY RRWARIAAEY PGDRALVAEA WVEDAQRVAR YLRPDELHQA FNFEYLTAPW
DAGRLREVID VSLAANTAVG ATTTWVLSNH DVTRHVTRFG GGEQGLRRAR AATLLMLGLP
GSVYLYQGEE LGLPEVTDLP EDSLQDPTWE RSGRTDRGRD GCRVPLPWGG DQAPYGFGPE
GSVPWLPMPE GWGALSRAAQ RGVEGSTLEL YTKALRLRRE LDALGDGSMA WLDAPGGVLY
FEREPGVRVA VNLTGEAVEL AADGEVLVAS GPVGEPAGGT LSLPAETAVW LRG