Gene Ndas_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4209 
Symbol 
ID9248083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5026408 
End bp5027577 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content77% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003682107 
Protein GI297563133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGGGG GCAACAGCCG CAACAGGATA CACACACGCC GCGCCCGTGG CGGGGCCAGC 
GCGGTCTTCG CGGCCGCCAC CGCCGCCGTC CTCGGTCTGA CCTCGGCCCC GGCCGCGGCG
GACCTGATCC CCGACTACCG GCCCGAGCAG TGGGGACTCC AGGCCGTGGG GGCGCCGCAG
CTGTGGGAGG AGAACCAGGG TCAGGGCGCC ACGGTCGCGC TTCCGGGCGT CTCCGTCGAC
GAGGGGCACC CGGACCTGGT CGACAACGTC CAGCTGGACA CGCGGTTCGG TGAGAACGAC
GGCGACATCG AGCAGGGCAA CGCCGCGGCC GGACTGGTCG CGGCCCACGG GTACGGCAGG
GACGCCGACG GCGGCGTCCT GGGCGTGGCG CCCGAGGCCA CCGTGCTCGT GCTGCCCACG
GGGGACCAGC TCGCCGAGGC CGTGCGGTTC GCCTCCCAGG AGGGCGCCCA GGTCATCCTG
CTGCCCGAGC CCGCCGGGCC CGGCCTCGCC GAGGCCACCC AGGAGGCCTC CTCCAACGGC
GCGCTGGTCG TCGGCCCGGC GGGGGAGGAC GAGGACCCCA ACGTGCTCAC CGTCGCGGGG
ACCGACCAGG ACGGAGCCCT CATCCAGGGC GCGCCCGGGG CCGGGATGAT CGCGCTGACC
GCCCCCGGGG CCGACCTGGT CACGGCGGGA CCGGAACCGG GCCAGGCCGA GGTGACCGGG
GCTCCCTACG CCGCCGCGAT GGTCGCCGGG GCCGCCGCCC TGATGCGCGC GGAGCACCCG
CAGCTGCGGC CCGACCAGAT CCGCGACGCC CTGGTGGACG GCTCCCAGCC CGGCCCCGAC
GGCCTGCCCG CGCTGCACCT GCCCAGCGCG GAGCAGCAGG CGTCCGGCGT CGCCCAGGAC
ATCCCGCTCA TCGACGAGGA CCTGGCCGGG CGGGACGACC AGTCGGGGCT GGTGCCCGCG
TGGGCGTGGT TCGTCACCGT CGGCGCCGTG GTGGTCCTGG GAGTGCTCAT CCTGGTCGTG
TGGGTGCGCC GCTCCACCGC CGACCCCTAC GGCGTGAAGG CCGAGCGCCG CGAGCAGGAC
GAGGAGATCG CCGCCGAGCG CGCCGCCGAG GCCGCGCCCG CCAACCGCCG CCGCAAGGGC
GGACGCCGCC GCAAGACGCG CGGTAACTGA
 
Protein sequence
MLGGNSRNRI HTRRARGGAS AVFAAATAAV LGLTSAPAAA DLIPDYRPEQ WGLQAVGAPQ 
LWEENQGQGA TVALPGVSVD EGHPDLVDNV QLDTRFGEND GDIEQGNAAA GLVAAHGYGR
DADGGVLGVA PEATVLVLPT GDQLAEAVRF ASQEGAQVIL LPEPAGPGLA EATQEASSNG
ALVVGPAGED EDPNVLTVAG TDQDGALIQG APGAGMIALT APGADLVTAG PEPGQAEVTG
APYAAAMVAG AAALMRAEHP QLRPDQIRDA LVDGSQPGPD GLPALHLPSA EQQASGVAQD
IPLIDEDLAG RDDQSGLVPA WAWFVTVGAV VVLGVLILVV WVRRSTADPY GVKAERREQD
EEIAAERAAE AAPANRRRKG GRRRKTRGN