Gene Ndas_0935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0935 
Symbol 
ID9244780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1150469 
End bp1151866 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003678885 
Protein GI297559911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0761113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCG GCCAGTGGGT GGCGTGCGGG GCGACAGGCC TGCTCATCGC GGCCAGTCTC 
ACCGTGTACG CGGGCTACCG CGACGTCCTC AGCATCGCCA CCGAGGAGGT CAACACCGAC
GCCTGGGGTG ACCGTCCGGC CCAGGCGGAG GGCATCCACA ACATCCTGCT GCTGGCCACC
GACCAGCGCG CGGGGGACAA CGCCGAGTAC AGCGTGGTCA ACGGCGTGCG TCCCGACGTG
CTGGTCGTGG TGAGCATCAA CGTGGACGAG GGCGGGGTGA CGATGGTCAA CATGCCCCGC
GACCTGATGG TCCCGATGCC CGACTGCCCG GCCAACGGCG AGAACCCCGG GGTGACGGCG
GGCACGGTGG ACCAGCTCAA CCACGCCATG ACCTACGGCG GGATGGACTG CCAGGGCAAC
ACGGTGGAGA CGGTCACCGA CATCCACCTC GACCACATGG TGATGGTCGA CTTCGCGGGC
TTCCAGGAGA TCGTGGACTC CATCGGCGGC GTGGAGATGT GCGTCCCCCA GCCGATCGAC
GACCCCAAGG CCCACATCAC GCTCGACGCC GGGATGCAGA CCCTCAACGG CGAGGAGGCG
CTGGGCCTGG CCCGCTCCCG GGCCAGCACG GAGCAGGGCA GCGACCTGAA CCGGATCGAG
AACCAGCAGC GGATGATGGG CGCCATCCTG CGCAAGGTCA CCAGCGGCGA GATCATGTCC
AGCCCCGCCA CGCTCTACGA CTTCATGGGC TCGGTCACCG ACAGCCTGGT GACCGACGAC
GGGTTCACCG TGGACCAGAT GACCGAGCTG GCCATCTCGA TGCGCGAGGT CGACCTGGGG
CGGATGCGGA TGGTCACCGC CCCCGTAGTG GACTCTCCCG TCCACAGCGG GAAGCTGGAC
CTCCAGCAGC CCGCCGCCGA TGAGCTGTTC TCCGCGGTCG CCTCCGGTGA CGCCCTGCCC
GAGGAGGAGG GCGGCGACGG AGGCGGCGGC GAGGGCTCGG AGGAGGCCGA GGAGCCCGCC
GTCGAACCCG CGGACGTGTC CGTGCGGGTG CTCAACGGCA CGGGCATCAC CGGTCTGGCG
AGCCAGGTCG GGACGCTGCT CACCGAGCAG GGGTTCAACG TCACCGGTGA GGGCGACCCG
GTCGAGCGGA CCCCCGCCGT CACCACGATC TACCACGGCC CCGACCAGCT CGCGCAGGCC
GAGGAGCTGG CCTCGGCGCT CAGCGTGGCC CAGCTGGAGG AGGTCCCCGA CTTCGGTCCA
GAGCTGGAGC TGGTGATGGG CGCGCAGGAC TGGGACGGCC TGGCCACGAG CGGAGGGGGC
TCCGGCGGCG GCGGGGGCGA TGCCCTGGCA GGTCTGGGCG CCACCAGCGC CGCCGAGGAC
GAGGTCAGCT GCGAGTAG
 
Protein sequence
MSLGQWVACG ATGLLIAASL TVYAGYRDVL SIATEEVNTD AWGDRPAQAE GIHNILLLAT 
DQRAGDNAEY SVVNGVRPDV LVVVSINVDE GGVTMVNMPR DLMVPMPDCP ANGENPGVTA
GTVDQLNHAM TYGGMDCQGN TVETVTDIHL DHMVMVDFAG FQEIVDSIGG VEMCVPQPID
DPKAHITLDA GMQTLNGEEA LGLARSRAST EQGSDLNRIE NQQRMMGAIL RKVTSGEIMS
SPATLYDFMG SVTDSLVTDD GFTVDQMTEL AISMREVDLG RMRMVTAPVV DSPVHSGKLD
LQQPAADELF SAVASGDALP EEEGGDGGGG EGSEEAEEPA VEPADVSVRV LNGTGITGLA
SQVGTLLTEQ GFNVTGEGDP VERTPAVTTI YHGPDQLAQA EELASALSVA QLEEVPDFGP
ELELVMGAQD WDGLATSGGG SGGGGGDALA GLGATSAAED EVSCE