Gene Ndas_0495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0495 
Symbol 
ID9244336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp603360 
End bp606740 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content76% 
IMG OID 
Product6-deoxyerythronolide-B synthase 
Protein accessionYP_003678448 
Protein GI297559474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.764757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.272973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCG ATAGCGCTCA GAACACCAAG CTCGTCGCCG CGCTGCGCTC CGCTCTCAAG 
GAGAACGAGC GCCTGCGCGG CCGCAACCGG GAACTGGACC GCAGAAGCGA CCCCGTGGCC
GTCGTGGGCA TGGGGTGCAG GCTGCCGGGG GGAGTGGACT CGCCAGAGGG TCTGTGGGAT
CTGGTGGAGC AGGGCACCGA CGCGATCGGG CCCTTCCCCG AGGACCGGGG CTGGGACCTG
GACGCCGTGC TCGGCGGTTC CGAGCCCCTG TCCGCGATGG ACCAGGGCGG CTTCTTGGAG
GGCGCCGCCG ACTTCGACGC CGCGTTCTTC GGGATCTCCC CGAAGGAGGC CGCCGCCATG
GACCCCCAGC AGCGCCTGCT GCTCGAACTG AGCTGGGAGG CCCTCGAACG CGCCGGGATC
GACCCCTCCT CGCTGGCCGG GACGCCCACT GGCGTCTACG TGGGGCTGAT GGCTGCCGAG
TACGGCCCCC GGCTCTCCGA CGCCCCCGAG GACGGGCACC TGCTCACCGG CACCATGCCC
AGCGTGGCCT CCGGCCGCAT CGCCTACACC CTGGGCCTGG CGGGTCCCGC GCTGACCGTC
GACACCGCCT GCTCCTCCTC CCTGGTCGCA CTGCACCTGG CGGTGCGCGC CCTGCGCGAG
CGGGAGTGCT CCCTGGCGCT GGTGGGCGGC GCCACCGTCA TGGGAACCCC CGGAGTGCTG
GCCGAGTTCA GCCGCAAGCA GGGGTTGTCC GCCGACGGCC GCTGCCGGGC CTTCGGCGCG
GGGGCCAACG GCACCGGGTT CGCCGAGGGG GCGGGCGCGC TGGTCCTGGA ACGCCTCTCC
GACGCCAGGC GCAACGGACA CCGCGTGCTC GCCCTGGTCC GGGGCAGCGC GGTCAACCAG
GACGGCGCCT CCAACGGGCT CACCGCGCCC AACGGGCCCT CCCAGGTGTC CGTGGTGCGG
GCCGCCCTGG ACGACGCCGG AATCTCCCCG GACGGGGTGG ACGCGGTCGA GGCCCACGGC
ACCGGGACCA CCCTGGGCGA CCCCATCGAG GCCCAGGCCC TCATCGAGGC CTACGGGGGA
GAGCGCGGGG CCCCGCTGCT GGTGGGCTCG CTCAAGTCCA ACATCGGCCA CACCCAGGCC
GCGGCCGGGG TGAGCGGTGT GGTCAAGGCG GTGCAGTCCC TGCTGCACGG GACCGTCGCG
CCCACCCTGC ACGCCGAGGA GCCCTCGACC CACGTGGACT GGGACTCCGG CGCGGTCCGC
CTGGCCACCC GGGCGCACCC GCTGCCCGAC CTGGGCCGGC CCGCCCGGAT CGGGGTCTCC
TCCTTCGGGA TCAGCGGAAC CAACGCCCAC GTCATCCTGG AGGCGCCGCC CGCAGAGGCC
GCCGACACCG GGGCCCCGGC CTCCGCGCGG GAGAGCGGAC CCGTCCCGGC CCCGCGGACC
CTGCCGTTCG TGGTCTCCGC GCGAGGGGAG GAGGCCCTGC GGACACAGGC CGCGCGCCTG
TCGGCCCACC TGGGCGCCGC GCCGGCCACC GATCCGGGCC GCGAACCCGA GGGCGGCCCC
GGGGAACACG CGGCCCTGGT GGCCACGGCC CGTGCCCTGG CCACCACCCG CGCGCGCCTG
GAGGACCGCG CCGTCGTCCT GGCCTCCGAC ACCGGCGGCC TCACCGCCGC CCTGGACGCC
CTGGCCGAGG GGCGCCCCGA CGCCGCCCTC GTGCGCGGGC GCGCGGACAC GGGCGGCTCC
GTGGCCTTCG TCTTCTCCGG TCAGGGATCC CAGCGCCTGG GCATGGGCCG CGGCCTGTAC
GAGGCCCACC CCGCCTTCGC CCGTGCGCTG GACGAGGCCG TCGACGCCCT GGACTGCCAC
CTGCCGCGTC CGCTGCGCAC GGTGATGTGG GCGCGGGAGG GCACCGCCGA GGCCGAACTG
CTGGACCAGA CCCTCTACAC CCAGGCTGGG GTCTTCGCGG TCGAGGTGGC CGTGGTGCGG
CTGCTGGAGT CGCTGGGCGT GGTCCCCGAC CACGTCGTCG GCCACTCCAT CGGCGAACTG
GCCGCCGCGT ACGCGGCGGG CGTGTTCACG CTGGAGGACG CCGCCGCCCT GGTCGCCGCC
CGCGGTGCGC TCATGCAGGA CCTGCCCGAG GGCGGCGCGA TGGTCGCGGT CCAGGCCGAG
GAGCGGGAGG CCCGCCAGGC GGTCGCCCGC TCCGGCGGAC AGCTCTCCGT CGCGGCCGTC
AACGCGCCCG ACAGCGTCGT GCTGTCCGGG GAGGAGACGG CCGTGGCCGC CCTGGCCGAC
CACTTCGCCC GGCAGGGCCG CCGCACCAAG CGGCTGACGG TCTCCCGCGC CTTCCACTCC
CCGCTCATGG ACCCCATGCT CGGCGCCTTC GCCCGCGCCG CCGAACGCGT TTCCTACCAC
AGGCCCGTCC TCCCCCTGGT GTCCAACCTG ACCGGAGCCC GTGCGGGGGA GGAGGTCCGC
GAGCACGGCT ACTGGGTGCG GCACGTGCGC GAGGCCGTCC GCTTCGCCGA CGGCGTCGCC
CACCTGCACA AGGCCGGGGT CACCGAGTTC GTGGAGGTCG GCCCGGGCGG CGTGCTCAGC
TCCATGGTGC AGTCCTGCCT GGGCGGGGAC CGGGGGGCGC GGGTCACCGC GCTGCTCGGC
GGGGAGCGCG ACGAGCAGCG CGGGTTCGCC GAGGGGCTGG CCGCCCTGCA CGCCCGGGGC
GCGCACGTGG ACTGGTCCGC CTACCTGCCG CCCGGTCCGG TGACCGCCCT GCCCACCTAC
CCCTTCCAGC GCGAGCGCTA CTGGATGAAC CCGCCGGCCG GGGGCCGCGT CGTGGTCGAG
ACCGGCGCCG GGGCGACCGC CGCCGGGGGC GGTGCCGCGC CCGGCGCGCG CGCCGACCTC
ACCGCTCTGG GCGATGACGA GCGCACCGAG GCGCTGCTGG ACCTGGTGCG CCGGGAGTCG
GCCGCGCTGC TCGGCCACGC CTCCCCGCGC GACGTCCGCT CCGACCAGGG CTTCCTGGAG
ATGGGCCTGG ACTCCCTGGG CGCGGTCCGC CTCGGCGAGC GCCTGAGCGC CGCGACCGGT
CTGGACGTGT CGGCCACCAC CGTGTTCGAC CACCCCGAGC CCCGGGTCCT GGCCGCCCAC
CTGAGCGCGG AGCTCGCCCC CGAGGACATC GACGCCTCCC AGGCGGCCGG GGGCGACGAG
CGGGTCCGCG CCGCGCTCGC CCGGATCCCC GTGGACCGGC TGCGCTCGGC CGGACTCCTG
GAGGCCCTGC TCGCCCTGGA CACGGGAGTG GCGGCGGGGG ACGCGGGGGA GGACGGCGCA
GCCGAGGAGG AGGGCGCCAA CACGGACGTG GACTCCATGG ACGTCGACGA CCTCATGCGC
ATGGTGTACG GCCGAGACTG A
 
Protein sequence
MATDSAQNTK LVAALRSALK ENERLRGRNR ELDRRSDPVA VVGMGCRLPG GVDSPEGLWD 
LVEQGTDAIG PFPEDRGWDL DAVLGGSEPL SAMDQGGFLE GAADFDAAFF GISPKEAAAM
DPQQRLLLEL SWEALERAGI DPSSLAGTPT GVYVGLMAAE YGPRLSDAPE DGHLLTGTMP
SVASGRIAYT LGLAGPALTV DTACSSSLVA LHLAVRALRE RECSLALVGG ATVMGTPGVL
AEFSRKQGLS ADGRCRAFGA GANGTGFAEG AGALVLERLS DARRNGHRVL ALVRGSAVNQ
DGASNGLTAP NGPSQVSVVR AALDDAGISP DGVDAVEAHG TGTTLGDPIE AQALIEAYGG
ERGAPLLVGS LKSNIGHTQA AAGVSGVVKA VQSLLHGTVA PTLHAEEPST HVDWDSGAVR
LATRAHPLPD LGRPARIGVS SFGISGTNAH VILEAPPAEA ADTGAPASAR ESGPVPAPRT
LPFVVSARGE EALRTQAARL SAHLGAAPAT DPGREPEGGP GEHAALVATA RALATTRARL
EDRAVVLASD TGGLTAALDA LAEGRPDAAL VRGRADTGGS VAFVFSGQGS QRLGMGRGLY
EAHPAFARAL DEAVDALDCH LPRPLRTVMW AREGTAEAEL LDQTLYTQAG VFAVEVAVVR
LLESLGVVPD HVVGHSIGEL AAAYAAGVFT LEDAAALVAA RGALMQDLPE GGAMVAVQAE
EREARQAVAR SGGQLSVAAV NAPDSVVLSG EETAVAALAD HFARQGRRTK RLTVSRAFHS
PLMDPMLGAF ARAAERVSYH RPVLPLVSNL TGARAGEEVR EHGYWVRHVR EAVRFADGVA
HLHKAGVTEF VEVGPGGVLS SMVQSCLGGD RGARVTALLG GERDEQRGFA EGLAALHARG
AHVDWSAYLP PGPVTALPTY PFQRERYWMN PPAGGRVVVE TGAGATAAGG GAAPGARADL
TALGDDERTE ALLDLVRRES AALLGHASPR DVRSDQGFLE MGLDSLGAVR LGERLSAATG
LDVSATTVFD HPEPRVLAAH LSAELAPEDI DASQAAGGDE RVRAALARIP VDRLRSAGLL
EALLALDTGV AAGDAGEDGA AEEEGANTDV DSMDVDDLMR MVYGRD