Gene Ndas_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3582 
Symbol 
ID9247451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4295072 
End bp4296406 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID 
Productpeptidase M16 domain protein 
Protein accessionYP_003681489 
Protein GI297562515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTG TCCCCATCGC CGCCGAGCAG GACCCGGACA CGACCGTGAC GCTGCTGGAG 
CCGGACGGCG GTACCGGACT GGTGCGCCGC ACGGTGCTCC CCGGCGGCCT GCGCGTGGTC
ACCGAGGCCG TGCCGGGCGT GCGCTCCGCC GCGTTCGGGA TCTCGGCGAC CACGGGTTCC
CGCGACGAGG ACTCCGCGCA CGCGGGCTCG GCGCACTTCC TGGAGCACCT GCTGTTCAAG
GGGACCAAGG AGCGCTCGGC GCTGGAGATC TCCGCGCTGC TGGACGGTGT GGGCGCCGAC
CACAACGCCT ACACCACCAA GGAGCACACC TGCTACTACG CGAAGGTGCT CGACCGCGAC
CTGCCGCTGG CCGTCGACGT CATCGGTGAC ATGGTGGCCA ATTCGGTGCT CGACGAGGGC
GAGGTGGAGA CCGAGCGGGG CGTGATCCTG GAGGAGATCG CCATGTACGA GGACGAGCCC
GCCGACCTGG TGGACGACGT CTTCGCGGCG CACTTCTTCG GCGACTCGCC GCTGGGCCGA
CCGATCCTGG GCACCACCGA CACCATCGAG GCGCTCTCCC GCGACCGCAT CGCCGAGCAG
TACCGCGACG CCTACGTGCC CGGCGAGCTG ATCGTGACCG CGGCGGGCAG CCTGGACCAC
GACCGGGTGG TGGAGCAGGT CCGCGCGCTG TTCGCCGAGC ACTCGGCCGC CGCCGGGGAC
GCCCGCCCCG CGCGTCCCCG CATCGGCGGC TCGCCGGTCG CCACCTACGG CGGCACGGTG
GTGCAGTCGC GCGAGACCGA GCAGGCGCAC ATCATCCTGG GGTCGGAGGG GCTCTGCCGC
ACCGACCCGC GGTGGCACGC GCTGCGGCTG CTCAGCGCAG CCCTGGGCGG CGGGATGTCC
TCGCGCCTGT TCCAGGAGGT GCGCGAGAAG CGCGGCCTGG CCTACGCGGT GCACGCCTAC
AACGCCGACT ACGCCGACAC CGGCAGCTTC CAGATCTACG CGGGCTGCCT GCCGGACAAG
GCCGACGAGG TCATCGGGGT GTGCCGCGAG GAACTGGCGA AGGTGGCCGC CTCGGGCATC
ACCGAGGAGG AGCTGGCCCG GGCCAAGGGC CAGATCCAGG GGTCGCTGGT GCTGGGCAGC
GAGGGCACCA ACGCGAGGAT GGGGCGGCTG CTCTCGCACG AGCTGAACAG GCCCGGGCAC
TACTCGATCG ACGAGAGCCT GGCGCTGTTC GACGCGGTGA CCGGCGCGGA GGTGGCCGAG
GTGGCCGCCG ACCTGCTGTC GCGGCCGCGC GCGCTGGCCG TGATCGGCCC CTACGCGGCC
GACCGGGTCT TCTGA
 
Protein sequence
MSSVPIAAEQ DPDTTVTLLE PDGGTGLVRR TVLPGGLRVV TEAVPGVRSA AFGISATTGS 
RDEDSAHAGS AHFLEHLLFK GTKERSALEI SALLDGVGAD HNAYTTKEHT CYYAKVLDRD
LPLAVDVIGD MVANSVLDEG EVETERGVIL EEIAMYEDEP ADLVDDVFAA HFFGDSPLGR
PILGTTDTIE ALSRDRIAEQ YRDAYVPGEL IVTAAGSLDH DRVVEQVRAL FAEHSAAAGD
ARPARPRIGG SPVATYGGTV VQSRETEQAH IILGSEGLCR TDPRWHALRL LSAALGGGMS
SRLFQEVREK RGLAYAVHAY NADYADTGSF QIYAGCLPDK ADEVIGVCRE ELAKVAASGI
TEEELARAKG QIQGSLVLGS EGTNARMGRL LSHELNRPGH YSIDESLALF DAVTGAEVAE
VAADLLSRPR ALAVIGPYAA DRVF