Gene Ndas_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3361 
Symbol 
ID9247226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4015754 
End bp4016809 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID 
Productpeptidase M48 Ste24p 
Protein accessionYP_003681272 
Protein GI297562298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA CACCAGACCG CGTCCGCGTC CGGCTCTCCG AGATCTCGCC CCGCGCCTAC 
GAACACCCCG CAGACCGTGG CGCCCTGGTG GCGCTGCGCT CGCTGCGCGG GTTCGACGAG
GTGTTCAAAC GCCTGTCGGG GCTGTTCAAC GAGCGCGCCC TGCGACTGAT GTTCCTGTCC
AGTGCCGTCC GGGTCGGTCC CACCCAGTTC CCTCACGTGT ACGACTACGT GCGCGACGCC
GCCTACGTCC TGGACATGGA CGAGGTCCCC GAGCTCTACA TCCAGATGAA CCCGAAGCCG
AACGCCATGG CGATCGGCAG CAACCGCCCG TTCATCGTGA TGACCACGGG CCTGTTCGAC
CTGCTGGACG CCGAGGAGCA GCGCTTCGTC ATCGGCCACG AGGTCGGCCA CATCCTCAGC
GGCCACGCCG TCTACCGCAC CATGCTGCTG GCCCTGATCC GGCTGGCCAC CCGCGTGGCG
TGGGTGCCGC TGGGCTTCAT CGGCATCCAG GCCATCGTGG CCGCGCTGGA GGAGTGGTAC
CGCAAGTCCG AGCTGTCCTG CGACCGCGCC GGGATCCTCG CCAGCCAGAA CCCCGAGGCG
GGCAAGCGCG CCCTGATGAA GCTGGCGGGC GGCTCCAAGC TCGTGGAGAT GAACCCCGAC
GCCTTCCTTG AGCAGGCCCG CGAGTACGAG TCCGGCGGCG ACGCCCGCGA CAGCCTCATC
AAGCTGGTCA GCCTGGTCGG ACAGACGCAC CCGTTCGCGG TGGTGCGCCT GGCCGAGCTG
CACCGGTGGA TCGAGGACGG TTCCTACCAG AGCATCGTCA ACGGCGACTA CCCGCGCCGC
TCCACCGACC GCGACGCCCG GGTGGGCGAG GAGGCCCGCA AGGCCGCCGA CTCCTACAAG
CAGTCCTGGG AGCGCACGGG CGACCCGCTG CTGGGCACCC TGCGCGACGT CGCGGGCAGC
GCGGCCAGCG CGGGCGGCAA GATCTTCGAC ACCGTCGCCG ACCGGTGGCG CAACGGCTCC
TCCCGCACCA ACCAGGGCTC GGGAACCTCT TCCTGA
 
Protein sequence
MANTPDRVRV RLSEISPRAY EHPADRGALV ALRSLRGFDE VFKRLSGLFN ERALRLMFLS 
SAVRVGPTQF PHVYDYVRDA AYVLDMDEVP ELYIQMNPKP NAMAIGSNRP FIVMTTGLFD
LLDAEEQRFV IGHEVGHILS GHAVYRTMLL ALIRLATRVA WVPLGFIGIQ AIVAALEEWY
RKSELSCDRA GILASQNPEA GKRALMKLAG GSKLVEMNPD AFLEQAREYE SGGDARDSLI
KLVSLVGQTH PFAVVRLAEL HRWIEDGSYQ SIVNGDYPRR STDRDARVGE EARKAADSYK
QSWERTGDPL LGTLRDVAGS AASAGGKIFD TVADRWRNGS SRTNQGSGTS S