Gene Ndas_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4617 
Symbol 
ID9248498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5483862 
End bp5485187 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content75% 
IMG OID 
Productsecretion protein snm4 
Protein accessionYP_003682509 
Protein GI297563535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.245929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.704968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTT GGAGCCGCGT GACCCTCGTC GGCGAGGAGC GCAGGGTCGA CGCCGTCCTG 
CCCGCGAGCG AACCCGTGGG CGCGCTCATG CCCGAGGTGC TCGACCTCCT CGGCGACCAG
GTGGAGAACC CCGCCAAACT GCGGCACCTG GTGACCGCCT CCGGGATCGT CCTGGAGGGC
GACACCACGC TCGCGGAACG GCAGATCACC GACGGCGCGG TGCTGCGGCT GGTCCGGGCG
GAGGAACCGG TGCCCGCCCC GGTGGTGCAC GAGGTCCCCG AGGCGGTCTC GATGGCCCTG
GACGACCACC AGGGGCGCTG GAACCCGGTG GCGGCCCGCT GGACCGCCAC CGTCTCCCTC
GTCGCCCTCG CCGTGGGCGC GGCCTGGATC GTGCAGGGGT ACTTCTCCGG CAACGGCGGC
CTCGTCGGCC TGGCCGTGGT CGCCGCCGTG CTGGTTGCGG TGGGCGCGTC GATCGGCCCG
ACCTGGCGGG AGCCCCTGGG CACGGCCCTG GCGATCAGCG GCACCGCGGT CGGCGGACTG
GTCCTGTGGC TGGCCTGCGA CCAGCTCGGC TGGCCCGAGT GGGCCCGCTG GGGCGGCGGC
GCCGCCCTGG TCGCGGGACT GGTCCTGCTG CTCGGGCTCA CGTCGGGACT GGGCCGGGGC
GGCCTGACCG GTGGCGGGGT CGGACTCGCC CTGGCGGTCG TGTGGTCGGT GGGCGCGGCG
CTGGGCCTGC CCACCTACCA GATCGCCGTC ATCATGGCGG TGGCCTGCGT GGTGCTGCTG
AGCCTGCTGC TCCGCCTGGC GCTGATGTTC TCCGGGCTGG CGGTCCTGGA CGACCGGCGC
AGCTCCGGGG AGGCGGTGAC CCGGAGCGAC GTGCTCACCT CGGTGGCGGG AGCCCACCGG
AGCCTGGTGA TCGCCACGAT CGCGGTGGCG GTGTCCGCGG CCACGGCGGG GATCGGCCTG
GCCACCCACT TCGACTGGTG GACGGCGGGG CTGTCGGTGG TGCTGGCCCT GGTGGTGGCC
AGCCGCGCCC GGCTGTTCCC GCTGGTCGCG CAGAAGTCGG TGCTCATCGC CGCGAGCCTC
GTGGTGCTGG TGGCCTTCCT CCTCTCCTGG GCCGAGGCCG TGCCCTGGGG GGTGTGGCCC
GCGCTGGGGA TCGCGGTCGC GGTGTCGGCC GTCCCCGCGG TCGTGCTGTC GATCGAGCAG
CCCGAGCACG TGCGGGCGCG GCTGCGCGGT GTCACCAGCA GGTTCGAGGC GGTCGCCGTC
GTCGTGCTGG TGCCCCTGGC GATCGGCGCG TTCGGCACCT TCCAGCGCCT CCTCACGACC
TTCTGA
 
Protein sequence
MTAWSRVTLV GEERRVDAVL PASEPVGALM PEVLDLLGDQ VENPAKLRHL VTASGIVLEG 
DTTLAERQIT DGAVLRLVRA EEPVPAPVVH EVPEAVSMAL DDHQGRWNPV AARWTATVSL
VALAVGAAWI VQGYFSGNGG LVGLAVVAAV LVAVGASIGP TWREPLGTAL AISGTAVGGL
VLWLACDQLG WPEWARWGGG AALVAGLVLL LGLTSGLGRG GLTGGGVGLA LAVVWSVGAA
LGLPTYQIAV IMAVACVVLL SLLLRLALMF SGLAVLDDRR SSGEAVTRSD VLTSVAGAHR
SLVIATIAVA VSAATAGIGL ATHFDWWTAG LSVVLALVVA SRARLFPLVA QKSVLIAASL
VVLVAFLLSW AEAVPWGVWP ALGIAVAVSA VPAVVLSIEQ PEHVRARLRG VTSRFEAVAV
VVLVPLAIGA FGTFQRLLTT F