Gene Ndas_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4041 
Symbol 
ID9247913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4834450 
End bp4835484 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content74% 
IMG OID 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003681944 
Protein GI297562970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC TGCCGCTGGT CATGGGGATC GAGACGTCCT GCGACGAGAC CGGCGTGGCC 
CTGGTGCGCG GCTGCGAGCT GCTCGGCGAC GCCGTGGCCT CCAGCGTGGA CCAGCACGTC
CGGTTCGGCG GCGTGGTGCC CGAGGTGGCA AGCCGCGCGC ACCTGGAGGC GATGACCCCG
ACCGTGCACC GGGCGCTGGA AAAGGCCGGG GCCAAGCTGT CGGACGTGGA CGCCATCGCC
GTCACCGCCG GTCCGGGCCT GGCGGGCGCC CTGCTCGTGG GCGTGTCCGC GGCCAAGGCG
TACGCGATGG CCCTGGGCAA GCCGCTCTAC GGCGTGAACC ACCTCGTGGG CCACGTGGCC
GTGGACCAGC TGGAGCACGG ACCGCTGCCC AAGCCCTCGA TCGCCCTGCT GGTGTCGGGC
GGCCACACCT CGCTGCTGCT GGTCAACGAC CTGGCCACCG AGGTGGTCTC GCTCGGCGAC
ACCGTGGACG ACGCCGCCGG TGAGGCCTAC GACAAGGTGG CGCGCCTGCT CGACCTGCCC
TACCCGGGCG GCCCGCCCAT CGACAAGGCG GCGCAGCGGG GCGACCCCAA GGCGATCCGC
TTCCCGCGCG GCAAGTGGGG CGACGGCACC TACGACTTCT CGTTCTCGGG CCTGAAGACC
GCCGTGGCCC GCCACGTGGA GGACACCGAC CGCCGGGGCG AGCCCCTGGT GGTCGCCGAC
ATCGCCGCCG CCTTCCAGGA GTCGGTGGTG GACGTGCTCA CCCGCAAGGC CGTGGACGCC
TGCGTGGAGC ACGGCGTGAG CACGCTGGTC ATCAGCGGGG GCGTGGCGGC GAACTCGGCG
CTGCGCGCGC TGGCCGAGGA GCGCTGCCGG GAGGCCGGCG TCGAACTGCG CGTCCCGCGC
CCGCGCCTGT GCACCGACAA CGGCGCGATG ATCGCCGCCT TGGGCGCCGA GGTCGTGGCG
GCGGGCCTGC CCGCGTCCAC GCTGGACCTG GCCACGGACA CCTCCCTGCC GGTGAGCTCC
CCGCTGGCGG TGTAG
 
Protein sequence
MSDLPLVMGI ETSCDETGVA LVRGCELLGD AVASSVDQHV RFGGVVPEVA SRAHLEAMTP 
TVHRALEKAG AKLSDVDAIA VTAGPGLAGA LLVGVSAAKA YAMALGKPLY GVNHLVGHVA
VDQLEHGPLP KPSIALLVSG GHTSLLLVND LATEVVSLGD TVDDAAGEAY DKVARLLDLP
YPGGPPIDKA AQRGDPKAIR FPRGKWGDGT YDFSFSGLKT AVARHVEDTD RRGEPLVVAD
IAAAFQESVV DVLTRKAVDA CVEHGVSTLV ISGGVAANSA LRALAEERCR EAGVELRVPR
PRLCTDNGAM IAALGAEVVA AGLPASTLDL ATDTSLPVSS PLAV