Gene Ndas_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1949 
Symbol 
ID9245799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2372813 
End bp2374066 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content75% 
IMG OID 
ProductErythromycin esterase 
Protein accessionYP_003679882 
Protein GI297560908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.372413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00455868 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACACTG GCACCACGAT CACCGCACAC ACCTTCGAGG CCGCCGCCGC CATGGGGCTG 
CTCCCGGCCC GGCCGCGACT GCTCGCCCTG GGCGAGCCCA CCCACGGGGA GGACGCCCTG
CTGGACCTGC GCAACGGGCT CTTCCGCCAG CTCGTCGAGC AGGAGGGCTA CCGGACGATC
GCCGTCGAGA GCGACTGTCT GGCGGGCCTG GCCGTGGACG CGTACGTCAC CTCGGGCACG
GGCACCCTCG ACGAGGCCAT GGAGCACGGG TTCAGCCACG GGCTCGGCGC GTCGGCGGCC
AACCGCGAAC TCGTACGCTG GATGCGCGCC CACAACGACG GCAGGCCCGC CGCCGAACAG
GTGCGCTTCG CCGGTTTCGA CGGCCCGCTG GAGGTCACCG GCGCCCAGAG CCCCCGGCGG
GCCCTGACCG CGCTCCACGA CTACCTCGCG CGCTGGGTGG AGGCGGACCA GCTCCCGTGC
GACGCGCGGA CGCTGGACCG CCTGGTCGGC GACGACGCGC GGTGGACCAA CCCCGACGCG
ATGCTGGACC CGGCCGAGTC CGTGGGGCGC TCGGACGACG TCCGGGAGCT GCGCATGCTC
GCCGACGACC TGGCGGCGCT GCTCGACGCG CACACGCCGC GCCTGGTCTC GGCGACCTCG
CGCGAGGACT GGGACCGGGC GCGCCTGTAC GGGCGCACCG CCACCGGCCT GCTGCGCTAC
CACTTCTGGA TGGCCGACAC CTCCTCGCGC CGCATGACGC GGCTGGAGGA CCTGGGCATG
ACGGTCGACA CCTCACCGAG CCGGATGACG CGGCTGCTGG GCCTGCGCGA CCAGATGATG
GCCGACAACC TCTTCGCCCT CGCCGAGCGG GGCCCGGTGC TGGTCCACGC CCACAACTCC
CACCTCCAGC GCGGCATGAG CACGATGCGG ATGGGCGGGC CGCCGCTGGA CTGGTGGGGC
GCCGGGGCGA TCGCGGGCGC CCGCCTGGGG CAGGAGTACG CCTTCCTGGC CACGGCCGTG
GGCACGATCC GGCACCGGGG CGTGGACACC CCGCCCCCGG ACAGCGTCGA GGGCCTCCTG
TACGCCCTCG GGGAGGAGCG CTGCGTGGTC GACGCGCCCC GGCTGGCCGC GGACCTGGAC
GGCGCGATCC CCGCACCCCG TGTGTCCCCC TGGTTCGGCT ACGCCCCGCT CGATCCGGCC
CGTCTGGCCG ACAGCGACGG GATCGTGTTC GTCAGGGACC TCCGGCAGGG CTGA
 
Protein sequence
MDTGTTITAH TFEAAAAMGL LPARPRLLAL GEPTHGEDAL LDLRNGLFRQ LVEQEGYRTI 
AVESDCLAGL AVDAYVTSGT GTLDEAMEHG FSHGLGASAA NRELVRWMRA HNDGRPAAEQ
VRFAGFDGPL EVTGAQSPRR ALTALHDYLA RWVEADQLPC DARTLDRLVG DDARWTNPDA
MLDPAESVGR SDDVRELRML ADDLAALLDA HTPRLVSATS REDWDRARLY GRTATGLLRY
HFWMADTSSR RMTRLEDLGM TVDTSPSRMT RLLGLRDQMM ADNLFALAER GPVLVHAHNS
HLQRGMSTMR MGGPPLDWWG AGAIAGARLG QEYAFLATAV GTIRHRGVDT PPPDSVEGLL
YALGEERCVV DAPRLAADLD GAIPAPRVSP WFGYAPLDPA RLADSDGIVF VRDLRQG