Gene Ndas_0267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0267 
Symbol 
ID9244101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp333253 
End bp334572 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content75% 
IMG OID 
Productacyltransferase 3 
Protein accessionYP_003678222 
Protein GI297559248 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.561641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCCA CCTCCTCCTC CGCGCCCGGG GACCCCGCCG CCGGTGCCCA CTTCGCCAGC 
GCGCGCACCC CCGATCCCTC GCTGGTGCTG CCGCAGGTCG ACGGCCGCCA CGAGTCCCTC
GACGGCGTCC GCGCCGTGGC CGCCTTGATG GTCCTGGTGT TCCACGTGGC CACCGAGACC
GGCGACTCCC TGGCGCCGGG GGTGCTGGGC GGCCTGCTGT CCGCCTTCGA CGTGGCCGTG
CCGCTGTTCT TCGCGCTCTC GGGCGTGCTG CTGTACCGCC CCTGGGCCCG CGCGGCCCTG
GACGGGACGC GCGGGCCGCG CGCCCGCCCC TACCTGTGGC GCCGCGCCGT GCGCGTCCTG
CCCGCCTACT GGCTGGTGGC CGTGACGGCA CTGCTCGTCT ACTCCCGCGA CGAACTCGGC
TCGCTCCGGT ACTGGTGGGA GGTCCTGACC CTCACCTTCC CCTTCAACAC CGACCCGCCG
TGGGTGGGCA CCGGCCCCTA CGGCCTCGGC CAGATGTGGA GCCTGTCGGT GGAGGTCAGC
TTCTACCTGC TGCTCCCGCT GTTCGGTCTG GTCCTGGCGC TGTGGGCGCG CGGCGGCCGG
AGCGTGGACG CCCGGGGCCG GCGGCTGCTC GCGGGCCTGG GGGTCATGGC GCTGCTGGGC
CTGGCCGCGC TGGTCCCGCA GTTCTATCCC GAGCCCCGCG AGTACATGCA CGCCTGGCTG
CCCCGTGCTG CGGGCCTGTT CGCCGTCGGC ATGGCGCTGG CCGTGCTCTC GGAGTGGGCC
TGGCGCGAAC CCGGCGCCGA CGGCCCGGTG CGCCGCCTGT GCCGGACCCT GGGCTCCTCT
CCCGGGGTGT GCTGGCTGGT CGCGGGCGGG TTCTTCGCCC TGAAGGCCAC CGAGGCCTCC
GGCGGCCGCT TCATCGGCTC CGGCGACATC TGGACCTCGG CCGTGGACTC GCTGGCCGGG
ATCGGGTTCG CGTTCTTCCT GATCGCCCCG TGCGCGCTGG CGCCGCGCCC GGCGGGGCCC
GCCCTCCCGC TGCGGGAGGC GCGGGCCTGG CGCGGGGGCC GGTGGCTGGA CTCGCTGCTG
CGGCACCGGG TGTGCCAGTT CCTGGGGCGG ATCTCCTACG GCGTGTTCCT GTGGCAGTTC
ATGGTGCTCT ACCTGTGGCG CGACTTCACC GGCCAGGAGA TCTTCACCGG GTCGTTCTGG
CTGGACGCGG TGCCGGTGAC GATCGGGACC GTGCTGCTGG CCGCCGCGAG CCACCGGTGG
GTGGAGGAGC CCAGCCGCCG ATGGTCCCAG CGCCTCTCCC GCGGACGGTC CTCCGCCTAG
 
Protein sequence
MAPTSSSAPG DPAAGAHFAS ARTPDPSLVL PQVDGRHESL DGVRAVAALM VLVFHVATET 
GDSLAPGVLG GLLSAFDVAV PLFFALSGVL LYRPWARAAL DGTRGPRARP YLWRRAVRVL
PAYWLVAVTA LLVYSRDELG SLRYWWEVLT LTFPFNTDPP WVGTGPYGLG QMWSLSVEVS
FYLLLPLFGL VLALWARGGR SVDARGRRLL AGLGVMALLG LAALVPQFYP EPREYMHAWL
PRAAGLFAVG MALAVLSEWA WREPGADGPV RRLCRTLGSS PGVCWLVAGG FFALKATEAS
GGRFIGSGDI WTSAVDSLAG IGFAFFLIAP CALAPRPAGP ALPLREARAW RGGRWLDSLL
RHRVCQFLGR ISYGVFLWQF MVLYLWRDFT GQEIFTGSFW LDAVPVTIGT VLLAAASHRW
VEEPSRRWSQ RLSRGRSSA