Gene Ndas_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3624 
Symbol 
ID9247493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4346222 
End bp4347577 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content69% 
IMG OID 
Productpeptidase M50 
Protein accessionYP_003681530 
Protein GI297562556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.670502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCC TGATGACCGT GGTCGGGATC GTCCTGTTCG TCTTCGGCCT GCTGTTCTCG 
ATCGCCTGGC ACGAGCTGGG ACACATGTCC ACGGCCAAGA TGTTCGGGAT CAAGTGCACC
GAGTTCATGG TGGGCTTCGG CAAGACCCTG TGGTCCGTCC GCAGGGGCGA GACCGAGTAC
GGCATCAAGG CGGTCCCGCT GGGCGGGTTC GTGCGCATGG TCGGCATGCT CCCGCCCTCG
CGTCAGAGCG CCGACGGCAG CTCCCGCAAG CTCTCGCGGT GGCGGGCCAT GGCCGAGGAC
GCCCGTGAGG CGTCCTACGT CGAGCTGTCC CCCGAGGACC AGGACCGGCA GTTCTACCAG
CGGGCGCCCT GGAAGCGCCT CATCGTGATG TTCGCCGGTC CGGGCATGAA CGTCATCCTG
GCCGCGATCC TGCTCGCCGT GCTGTTCATG GGCATCGGGG TGCCGCAGAG CACCACCCAG
ATCGCCACGG TCAGCGAGTG CGTGGTCCCC GCGGGCAGCT CCGTCACCGA CTGCGAGGAC
GCCCCGCCCA CGCCCGCGGC CGAGGCCGGG ATGCTGCCCG GCGACGTCAT CGTCTCCGTC
GGCGGCGAGT CCACCCCGGA CTGGAGCACC GCCAACCGGC AGATCCGCGA GGCCATGGGT
GACACGGAGA TCGTGGTCGA GCGCGACGGC GAGCGGCTGC CGCTGAACGT CGACATCGTC
GAGAACGAGC TCCCGGCCCG CGACGCCGAG GGCGAGTTCG TCTACGAGAC CGACGCCGAC
GGCGAACCGG TCTACGACGA GCAGGGCTAC CGCGTCTACG AGACTGAGGT CGTGGGCTTC
CTCGGCATCG TCTTCGCCAC CGAGCGCGCC CCGCTCACCC TCGCCGAGTC CGCCGCCGAG
ATGGGCAACA TGATGATCGG CGTCGGCGAG GCCCTCATCG CCCTGCCCAG CAAGGTGCCC
GACGTGTTCG CCGCGGCCTT CCTGGGCGAG CAGCGCACCC AGGACTCCCC GGTGGGCATC
GTCGGCATCT CCCGCATCGG CGGAGAGATC ATGGCCCAGG GACTCCCCGT GGCCGACACC
GCCGCGATCA TGATCCAGAT CCTGGCCGGG GTGAACCTCT TCCTGTTCGC CTTCAACCTG
GTGCCCATCC TGCCGCTGGA CGGCGGGCAC ATGGCCGGCG CCATCTGGGA GTGGATCAAG
CGCGGCTGGG CCAAGCTGTT CCGCAGACCC GAACCCGCCC CGGTGGACGT GGCGATGCTG
ACACCGGTGG CCTACGTGGT CGTGGCCTGC TTCCTGGTGT TCAGCGTGGT CCTGCTCGTG
GCCGACCTGT TCAACCCGGT CAGACTCTTC GGCTGA
 
Protein sequence
MVALMTVVGI VLFVFGLLFS IAWHELGHMS TAKMFGIKCT EFMVGFGKTL WSVRRGETEY 
GIKAVPLGGF VRMVGMLPPS RQSADGSSRK LSRWRAMAED AREASYVELS PEDQDRQFYQ
RAPWKRLIVM FAGPGMNVIL AAILLAVLFM GIGVPQSTTQ IATVSECVVP AGSSVTDCED
APPTPAAEAG MLPGDVIVSV GGESTPDWST ANRQIREAMG DTEIVVERDG ERLPLNVDIV
ENELPARDAE GEFVYETDAD GEPVYDEQGY RVYETEVVGF LGIVFATERA PLTLAESAAE
MGNMMIGVGE ALIALPSKVP DVFAAAFLGE QRTQDSPVGI VGISRIGGEI MAQGLPVADT
AAIMIQILAG VNLFLFAFNL VPILPLDGGH MAGAIWEWIK RGWAKLFRRP EPAPVDVAML
TPVAYVVVAC FLVFSVVLLV ADLFNPVRLF G