Gene Ndas_3434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3434 
Symbol 
ID9247301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4111150 
End bp4112250 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID 
ProductN-acetylmuramoyl-L-alanine amidase family 2 
Protein accessionYP_003681345 
Protein GI297562371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0021339 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.405491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGGC GTACCCTCCT GGCCGGTGCC ACCGCCGCTG CGGGACTGAC CGCGCTGGGC 
ACCGCGCTCA CCTCCGCCCG CCCCGCACTG GCCGACACCG GCGGCGACCG CACGGTCCCC
CGCGTGCTGG CGGAGCCCTC CGCCTCACAC GGACTGGTCC GCCCCGACCT GCGCTTCGAC
ATGGTGGCCG TCACCGGTGA CCCCGGAGAG GCCGACGCGG CGGTCCGCTT CGAGACCGCC
GACGGCCTGG GCTCGTGGAA CCCGGTGCAC CTGCACACCG GGGGCCGCGA CGACCGGGAC
CCGGTCGCCG CCGCGCTGGT GCGCGCGCCC GAGGGCGCCA CCGGTTACGA GGTGCGTTCG
AGAGGGGGGA CCGCGGCCGT GAACCTGCGC GACGGAGAGG GTCTGCGCTT CGGCGGCCCC
GCACAGGCGT CGCTGTCCGC GGAGGCCTCC GGTACGCTGC GCGGCCGGAC CAGCGTCCCG
TTCCGCACCC GCGCGGGCTG GGGCGCCGAC GAGTCCTGGC GTTTCGACGA CCAGGGCGAC
AACCTGTGGG AGGCGGAGTT CCACCCCGTG CAGGCGCTGA CCGTGCACCA CACCGCGATG
CCGACCGGGG ACGACCACGC GGCGGACGTG CGGGCGGTGT ACTACCTGCA CGCGGTGCAG
CAGCTGTGGG GGGACATCGG CTACCACGTG CTCATCGACC CCGACGGGGT GGTCTACGAG
GGCCGCCACT CGGGCGAGGA CGGCGTGCCG GTCTTCTCCG GGATCCCGCG GCCGGGGCGG
GCCGAGTCGG TGACCGCGGG GCACGCCTAC GGGTTCAACC AGGGCAACGT GGGCGTGTGC
CTGCTCGGGG ACTTCACCGA CGAGCTGCCC ACGCGGGCGG CGCAGGACTC CCTGGTCGAC
GTGCTGCGCG TGCTGTGCGC GGTGACCGGC GTGGACCCGG CCGGGCAGAT CGAGTACGTC
AACCCGGGCA CGGGCGTGGT CACGCCGGGC GACGCGATCT CCCGGCACCG CGACTGGCTG
GAGACCGAGT GCCCGGGCAA CGCCTTCTCC GAGGTGTTCG ACAGCGCGGT CCGCCAGCGC
GTCATCGCGG GCCTGGCCTA G
 
Protein sequence
MRRRTLLAGA TAAAGLTALG TALTSARPAL ADTGGDRTVP RVLAEPSASH GLVRPDLRFD 
MVAVTGDPGE ADAAVRFETA DGLGSWNPVH LHTGGRDDRD PVAAALVRAP EGATGYEVRS
RGGTAAVNLR DGEGLRFGGP AQASLSAEAS GTLRGRTSVP FRTRAGWGAD ESWRFDDQGD
NLWEAEFHPV QALTVHHTAM PTGDDHAADV RAVYYLHAVQ QLWGDIGYHV LIDPDGVVYE
GRHSGEDGVP VFSGIPRPGR AESVTAGHAY GFNQGNVGVC LLGDFTDELP TRAAQDSLVD
VLRVLCAVTG VDPAGQIEYV NPGTGVVTPG DAISRHRDWL ETECPGNAFS EVFDSAVRQR
VIAGLA