Gene Ndas_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1191 
Symbol 
ID9245041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1445626 
End bp1446924 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content74% 
IMG OID 
ProductErythromycin esterase 
Protein accessionYP_003679138 
Protein GI297560164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.295106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG GGACCGGCCC GGACGCCCTC CCGCGGCTGT CCGGCCCGAG GTCGCTGGAC 
CCGCTCCTGG AGCGGATCGG CGACGCGCGC TACGTGCTGC TGGGCGAGGC CTCGCACGGC
ACCGCCGAGT TCTACCGCTG GCGGGCGGAA CTGACCCGGC GGCTGATCGA CGAGGCGGGC
TTCTCCTTCG TCGCGGTGGA GGGCGACTGG CCCGACTTCC AGCAGCTGCA CTGCGGCGTC
GTGGGCGCGC CGGGGGCGCC CGAGGACCCC AGACGGATCC TGGAGGGCTT CCGCAGGTGG
CCGCAGTGGA TGTGGGCCAA CACCGAGGTG CTGGAGTTCG CCCGGTGGCT GCGCGAACAC
AACCTGGGGC TCGCGCCGGA GCGGCGCGCC GGGTTCTTCG GCCTGGACGT CTACAGCCTG
TGGGAGTCGC TGTACGCGGT GCTGGGGTGG CTGCGCGAGA ACATGCCCGA GCAGGTGGAA
CCGGCTCTGA ACGCCTACCG CTGCTTCCAG CCCTACGGAG AGGACCCCCA GGCCTACGCC
CGCGCGACGC GGCTGGTGCC CGAGAGCTGC GAGGCGGAGG TCGTGCGCCT GCTGGCCGGG
CTGCGCGAGC GGGCGGGGGA GGCCTCCTCC GCCGAGGACC TGGCCGAGTT CGCCGCGCGC
CAGAACGCCG AGGTGCTCGC GGACGCCGAA CAGTACTACC GGGCGCTGGT GCGCGGCGGC
CCCGAGTCGT GGAACGTCCG CGACCACCAC ATGGCCGACA CGCTCGACCG GCTCATGGAG
TACCACGGGC CCGGAGCCAA GGCCGTGGTG TGGGAGCACA ACACCCATGT CGGCGACGCG
CGCGCCACCG ACATGGCCGC CTCGGGCATG GTCAACGTGG GGCAGTTGGT GCGTGAGCGC
CACGGCGACG AGGGCGTGGT CCTCGTCGGC TTCGGCACCT ACGAGGGCCG GGTCATGGCC
GCCCGGGCCT GGGGGGAGAC CCCCGAACCG ATGCCGGTGC CCGCGGCGCG CCACGGGAGC
GTGGAGGCGC TGCTGCACCA GTCGTTCGAG GGGGAGACGG GCCTGCTGCT CCTGACCGGC
GAGGGGGCGG TCGATCCGTT CGCGGGCGAG GTCCTGCCGC ACCGCGCCGT CGGCGTCATC
TACCACCCCG GACGCGACGG GCTGCGCAAC TACGTGCCGA CCGTGCTGGG GGAGCGCTAC
GACGCCTTCG TGTTCGTCGA CCGCACCCGC GCGCTCACGC CCCTGCACGA GGTCGAGGAG
GACTCGGGTG AGGAGGGGAC CTGGCCCAGC GGCCAGTGA
 
Protein sequence
MSPGTGPDAL PRLSGPRSLD PLLERIGDAR YVLLGEASHG TAEFYRWRAE LTRRLIDEAG 
FSFVAVEGDW PDFQQLHCGV VGAPGAPEDP RRILEGFRRW PQWMWANTEV LEFARWLREH
NLGLAPERRA GFFGLDVYSL WESLYAVLGW LRENMPEQVE PALNAYRCFQ PYGEDPQAYA
RATRLVPESC EAEVVRLLAG LRERAGEASS AEDLAEFAAR QNAEVLADAE QYYRALVRGG
PESWNVRDHH MADTLDRLME YHGPGAKAVV WEHNTHVGDA RATDMAASGM VNVGQLVRER
HGDEGVVLVG FGTYEGRVMA ARAWGETPEP MPVPAARHGS VEALLHQSFE GETGLLLLTG
EGAVDPFAGE VLPHRAVGVI YHPGRDGLRN YVPTVLGERY DAFVFVDRTR ALTPLHEVEE
DSGEEGTWPS GQ