Gene Ndas_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3888 
Symbol 
ID9247759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4659989 
End bp4661509 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003681791 
Protein GI297562817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTG ACCCCGGACC GGGCGAGCAG CCCGGCGCCC CCCGATCCGA CCAGGCCTCC 
GCGCACCCCG TGGGCGGAGG GTTCCCCGGT TCCGCCAACC CCTCCGGTCC CGCCGTCTAC
GGCGCTTCCG CCGCCAACCC CGGCTCCGGT GATCCCTCGG GCTCCACCGC CCACACCAGC
TTCGCCAACC CCTCCGGTCC CGGCCACCCG GGTATCGGCG GTCCCTCCGG TCCTGCCAAT
CCCATGAACC CCGGCGCTCC TTCGGCTCCC GGGCAGCACA CCGGCCACAC CGGTCAGTCT
GCCGCCCACG GCGGCTACGC GACGGCCTTC CAGGCGGGGC ACGCCGCCCC CGGCCCGGGC
GCTCCGCCCC CCGGTGTCCC CGGACCCCAC ACGACCGGCC CCCACAGCGC CCCGCAGCCC
CCCGCGCCCA AGCGGCCCCG GCGCGGCGTC CCGGTGTGGA TGGCGCTGTC GGGCATGCTC
GTCGTCGCGC TCATCGCCGG AGGCGCCGGC GGTGTCGCGG GCAACCTCCT CGACGGCTCC
TCCACTGACG AGGCCGCGCA GGAGGAGGGT CCGGTCATGA ACGAGCCGCC GCCCGAGGCC
CCCCGGCGCG ACCCCGACAC CATCGCCGGT GTGGCCCAGC GGGTCAGCCC GAGCGTGGTG
TTCATCCACA GCGCCGATCC CACCATCCCG AGCAGCGGCT CCGGGTTCGT CATCGACGGG
AACTACGTGG TGACCAACGA CCACGTCTCC GCCGGTCTGG AGGCGGACGG CATCGTCGTG
GAGTACAGCG ACGGCAGCCT CTCCAGCGCC TCCGTGGTCG GCTCCGACCC CAGCTCCGAC
CTCGCGGTGC TGTCGCTGGA CGACCCGATC GACGTCGAGC CGCTCCAGTT CGGCGACTCC
GAGCAGGTCA TCGTCGGTGA CGAGGTGATC GCCATCGGCG CGCCCCTGGG CCTGTCCGGA
ACCGTCACGC AGGGCATCAT CAGCGCCGTC AACCGCCCGG TCAGCTCCGG CGAGGGCGAG
AACGCCAGCC GCTTCTACGC GCTCCAGACC GACGCCGCCA TCAACCCGGG CAACTCCGGC
GGCCCGCTGG TGGACCTGGA GGGCCGGGTC ATCGGGGTCA ACTCGATGAT CGTCACCATG
AGCTCCATGG GGGAGCCCAC GGGCAACATC GGCCTGGGCT TCGCGATCCC GTCGGTGGAG
GCCGAACGCG TCGTGAACCG CCTCGTCGAG TACGGCGAGA CCAGCTACGC CGACATCGGG
GCCGAGATCG ACCTGGACAG TCCGGTCGCG GGCGCGGTCA TCGCCGACGG CGGGGGCGCG
GTGGAGAGCG GCGGCCCGGC CGACGAGGCG GGGCTGGAGC CGGGCGACGT CATCCTCTCC
CTGGACGGGC GCCCGGTGAA CTCGGGCCAG GAGCTGCTCG CCATGCTGCG CAGCCGCAGC
CCGGGCGAGG AGGTCGAGGT CGAGTTCGAC CGCGACGGCC GACGCGACAC CGTCACGGTC
ACGCTGGGCT CGTCGGACTG A
 
Protein sequence
MNLDPGPGEQ PGAPRSDQAS AHPVGGGFPG SANPSGPAVY GASAANPGSG DPSGSTAHTS 
FANPSGPGHP GIGGPSGPAN PMNPGAPSAP GQHTGHTGQS AAHGGYATAF QAGHAAPGPG
APPPGVPGPH TTGPHSAPQP PAPKRPRRGV PVWMALSGML VVALIAGGAG GVAGNLLDGS
STDEAAQEEG PVMNEPPPEA PRRDPDTIAG VAQRVSPSVV FIHSADPTIP SSGSGFVIDG
NYVVTNDHVS AGLEADGIVV EYSDGSLSSA SVVGSDPSSD LAVLSLDDPI DVEPLQFGDS
EQVIVGDEVI AIGAPLGLSG TVTQGIISAV NRPVSSGEGE NASRFYALQT DAAINPGNSG
GPLVDLEGRV IGVNSMIVTM SSMGEPTGNI GLGFAIPSVE AERVVNRLVE YGETSYADIG
AEIDLDSPVA GAVIADGGGA VESGGPADEA GLEPGDVILS LDGRPVNSGQ ELLAMLRSRS
PGEEVEVEFD RDGRRDTVTV TLGSSD