Gene Ndas_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0402 
Symbol 
ID9244240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp492540 
End bp494228 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content73% 
IMG OID 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_003678356 
Protein GI297559382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACC TCGCGAAGAT CATGAAGACC CGACAGCGCC CCCAAGGGCC GCTGGTGCTG 
GAGCTGGACC TGACGGAGGG GGTCGCCGAC CAGGCCCCCG GGGATCCGTT CAGCCAGATC
ATGAACCGGC GCAGGCAGCA GTACCTGGAC GTCCTGGAGG GGATCCGCCG CGGCGCGCGC
GACCCGCGGG TGGCGGCCCT CCTGGTCCGG GTGGACGCCC GGTCCCTGGG ATTCGCCAAG
GTCCAGGAGC TGCGCGACAC CGTCGCCGAC TTCCGCGCGG CGGGCAAACC CGCGGTGGCC
TGGGCCGACT CCTTCGGCGA GACCGGCGAG GGCAACCTGC CCTACTACCT GGCCTGCGCG
TTCTCGCGCG TGGTCATGGC GCCCACCGGC GTGCTCGGCC TGACCGGCCT GATGATGCGC
ACGACCTTCG TCAAGGGCGC CCTGGACAAG CTGGACGTGT CCTACGAGGT GGGCGCGCGC
CACGAGTACA AGAACGCGAT GAACAGCGTC ACCGAGACCG GTTACACCGC CGCCCAGCGC
GAGGCCAGCG ACCGGATCGT CACCTCGCTG GGCGACCAGA TCGTCGAGGC GGTCTCCCTG
GCCCGGGGGC TGCCCCGGGA GGAGGTGCGC GCGCTGGTCT CCAAGGGCCC CTTCCTGGCC
CGCGAGGCGG TCGAGCACAA GCTGGTGGAC GGGCTCGCCC ACCGGGACGA GGTGTACGCG
CAGCTGTTCG GGGAGCTGAG CGGTGAGCCT CAGCTGCAGT TCGTCACCCG CTACCACCGC
AAGCACACCG CGCCCCAGCA GCTGTCCCGC AACACCGGGG GCCACATCGC GCTGATCTCG
GCCACCGGAA CGATCAGCCT GGGCCGGACC CGGCGCTCGC CCCTGGGCGG CGGCACCGTC
ATGGGCTCGG ACACCGTGGC GGCGGCCTTC CGCGCCGCGC GCAAGGACCC CCAGGTCAAG
GCGGTCGTGT TCCGGGTGGA CAGCCGGGGA GGCTCCCCGA CCGCCTCCGA CGCGATCCGC
CGCGAGACCG AGCTGACCAG CAAGGCGGGC ATCCCCGTGG TGGCCGTGAT GGGCGACGTC
GCCGCCTCCG GCGGCTACTA CGTGACCCTG GGCTCGGACG CGGTCGTCGC TCAGCCGGGC
ACCCTGACCG GCTCCATCGG CGTGATCACC GGCAAACCGG TCCTGGGCGC GCTGAAGGAG
CAGTACGGCG TGACCAGCGA CTCCGTGCGC ACCGGCGAGC ACGCGGGCAT GTTCGACACC
GACCGGCCCT TCACCGAGTC CGAGTGGGAG CGGGTCAACG CGCTCCTGGA CGAGATCTAC
GAGGACTTCA CCGGCAAGGT CGCCGCCGCG CGCGGGATGA CCCGCGAGCA GGTGCACGAG
GTGGCCCGGG GCCGGGTGTG GACCGGCCGC GACGCCCACG AGCGCGGTCT GGTGGACGAG
CTGGGCGGCC TGGAGACCGC CGTCCGGCTG GCCCGTGAGA AGGCCGGCGC GGGGCCGCTC
CCGGTGCGGC CCTTCCCCCG CCCGAACCCG CTCGACCGGA TCCGCCAACA CGAGTCCAGC
GAGGACGTGG GTGCCTCGGG CCCGCAGACC GTGGTCAGCG CCTGGGGTCC CCTGGAGCAC
GTGGCCGTGG CGATGGGGCT GCCGGTCGGC GGCCCGCTGA TGATGCCGGG GCTGTGGGAG
ATCCGTTGA
 
Protein sequence
MVDLAKIMKT RQRPQGPLVL ELDLTEGVAD QAPGDPFSQI MNRRRQQYLD VLEGIRRGAR 
DPRVAALLVR VDARSLGFAK VQELRDTVAD FRAAGKPAVA WADSFGETGE GNLPYYLACA
FSRVVMAPTG VLGLTGLMMR TTFVKGALDK LDVSYEVGAR HEYKNAMNSV TETGYTAAQR
EASDRIVTSL GDQIVEAVSL ARGLPREEVR ALVSKGPFLA REAVEHKLVD GLAHRDEVYA
QLFGELSGEP QLQFVTRYHR KHTAPQQLSR NTGGHIALIS ATGTISLGRT RRSPLGGGTV
MGSDTVAAAF RAARKDPQVK AVVFRVDSRG GSPTASDAIR RETELTSKAG IPVVAVMGDV
AASGGYYVTL GSDAVVAQPG TLTGSIGVIT GKPVLGALKE QYGVTSDSVR TGEHAGMFDT
DRPFTESEWE RVNALLDEIY EDFTGKVAAA RGMTREQVHE VARGRVWTGR DAHERGLVDE
LGGLETAVRL AREKAGAGPL PVRPFPRPNP LDRIRQHESS EDVGASGPQT VVSAWGPLEH
VAVAMGLPVG GPLMMPGLWE IR