Gene Ndas_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1679 
Symbol 
ID9245529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2050275 
End bp2051843 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679614 
Protein GI297560640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.789815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGCC TGCTCGCGGC TACCGCACTG GAGTCGCGGG TCGGGCTGCG CTACGGCGTC 
GTACCCGTGG CCGCCGCCCT CGGCGCCGTG TGGACCCTGG TCCTGCTCGC GGTCCCCGCC
GAGGCGGCCG GGACGGTGGC GTCCTACCTG CTCTTCCTCG ACACGGCGGG CTTCGGCGCG
CTCTTCGCCG CGGCCCTGCT GCTCTTCGAA CGCACCGAGG GAACGCGTTC GGCGCTGACG
GTGACCCCGC TGCGCGCGGG GGAGGGGGTG GCGGCCCGGC TCGCGGTGCT CACCGCGCTG
ACGCTGCTCA TCGCCGTGCC GATGCTCGCG GCGGCGCTGC GAGGCCGGTT CGCCGACCTC
GCGCAGGCCC TGCCGCCGGT CCTGGGCGGG GTGGCGCTCA CCTGTCTGCT GCTGCTCACT
GTGTGCCTGG CGGTGGGCGC CCGCTCCCGG GACCTGTCGG GCTTCCTCCT GGCGGCCCCG
CTCACCGTCG CGCCGCTGGT CCTGGTCCCC CTCGTACACG TCAGCGGAAT CCTGGAGCAC
CCGCTCCTGT ACGCGGTGCC GACCACCGCG GGCGCCGACC TCATCCGCCT GGGGGCCGCG
CCCGGCTCCC CCGACGCGGC CCCAGCGGCC CTGGTCGCGG GCACCGTCTA CGCCGTGGCG
TGGGCGGCGA CCGGGGTGGT CGCCGCCTCC CGCGCGGTCG GGCGCGGAGC GGCTCCCGCT
CCCCGGAGGC ACGCGAACGG CAGGGGCGCG GGCGGGCCCG CCCGGCCGGT CACCGGGCCT
GTCCACCCGC GCCCCGGGCC CGCCCGGCCG GTCAGCGGGC GCGGCGGCCT CCCGGTGATC
GTGCGCTTCG CCCGCGTCGA CCTCTTCGGC ACCGGGCGCG ACCCCCTGCT GCCGCTCATG
CTCGGCGCCC CGGTCCTGCT GGCGCTGGTC ATCCGCTTCG CCTTCCCGGC GGCCTCGGAG
TTCGTCCTCG GCTCCTACGG GTTCGACCTC GCGCCGCACA CCCCCGTGGT CCTGGCGGCG
CTGGTCCTGC TGCACGTGCC CATGATGTTC GGGGTGGTCG GCGGCCTGCG CGCCGTCGAG
GACTCCGACG AGAACGTCCT GCTGGTGCTG CGCGCCTCGC CGGTGTCCGT GCCCGCCTAC
CTCGGCTACC GGACGGTCCT GGTCACCGTC CTGTCCCTGG CCGGGCTCGC CGCGGCCCTG
CCCCTGAGCG GGCTCATGAT CTCAGGGTGG ACCGCGCCGG TCGCGGTCGC CCTCGTGCTC
GCTGCTCTCC AGGCGCCGCT GCTGACGGCG TCGATGACCG CGCTGTCCGC CAACAAGGTG
GAGGCCCTGG TCGTGGTCAA GGGGATCGGC GCGCTCCTGG CCCTGACCCC CGTGGCGGCC
TGGGTCCTAC CCGCGCCCTG GAACCTCCTG CTGCTGCCGC TCCCGCCGTC CTGGCCCGCC
CTGGCCCTGC CCGGTTACGA CGCCGGACCG CTGGGGCCCT GGCTGTGCCT GGCGGGCGGG
GTCCTGGTCT CGGCCGCCGC GCTGGCGCTC CTGCTGCGGC GCACCGTGCG GCGGATCGAG
GGCGCGTAG
 
Protein sequence
MNRLLAATAL ESRVGLRYGV VPVAAALGAV WTLVLLAVPA EAAGTVASYL LFLDTAGFGA 
LFAAALLLFE RTEGTRSALT VTPLRAGEGV AARLAVLTAL TLLIAVPMLA AALRGRFADL
AQALPPVLGG VALTCLLLLT VCLAVGARSR DLSGFLLAAP LTVAPLVLVP LVHVSGILEH
PLLYAVPTTA GADLIRLGAA PGSPDAAPAA LVAGTVYAVA WAATGVVAAS RAVGRGAAPA
PRRHANGRGA GGPARPVTGP VHPRPGPARP VSGRGGLPVI VRFARVDLFG TGRDPLLPLM
LGAPVLLALV IRFAFPAASE FVLGSYGFDL APHTPVVLAA LVLLHVPMMF GVVGGLRAVE
DSDENVLLVL RASPVSVPAY LGYRTVLVTV LSLAGLAAAL PLSGLMISGW TAPVAVALVL
AALQAPLLTA SMTALSANKV EALVVVKGIG ALLALTPVAA WVLPAPWNLL LLPLPPSWPA
LALPGYDAGP LGPWLCLAGG VLVSAAALAL LLRRTVRRIE GA