Gene Ndas_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0934 
Symbol 
ID9244779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1148831 
End bp1150273 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content70% 
IMG OID 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003678884 
Protein GI297559910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0202033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAA AACGCTCCGC CCCCCGCCCT CCGTCCGCCG CCATGCACGC CGTGCGCATG 
TCCCCGGGGC AGTGGGTCGC CTGCGTGGTC ACCGCCCTGG CCATCATCGC CAGCCTCGGC
GGCTACGGCT GGTACCAGGG CATCGTCGGC AACATCACCA CCGCGCAGGT GGACACCGAC
GCCTGGGACC GGCCCAACAG CGTCGAGGGC GTGATGAACC TGCTCATCAT CGGCTCCGAC
GTCCGGTCGG GGGACAACGC CAACTACGGC GAGGCCGAGG GCGAGCGCCC CGACACCATG
CTCATCGCCA GCATCAACGT GGACAACGGC GCGGCCACGC TGGTCAACCT GCCCCGCGAC
CTGGTCGTGG ACCTGCCCGG CTGCGAGGCC GTGGAGGGCT ACGAGGGGAT GAGCCCGCAC
AGCGGCATGA TCAACTCGGC GATGACCTTC GGCGGGGTGG GCTGCCAGTG GCAGACCGTC
GAGGAGGTCA CCGACGTGCA CCTGGACCAC TTCGTCATGA TGGACTTCAC CGGGTTCAAG
GACATGGTGG ACGCCATCGG CGGCGTGGAG ATGTGCATCC CCGCGCCGGT GGACGACCCC
AAGGCGCACC TGACGCTGGA CGCAGGGACG CAGACCCTCA GCGGTGAGGA GTCGCTGGGC
TACGTGCGCT CCCGCTACGG CCAGGGCGAC GGCAGCGACC TGTCGCGGAT CGACCGCCAG
CAGGAGTTCA TGGGCGCCAT GCTGCGCCAG GTGCTCAGCA GCGAGGTCAT GACCAGCCCG
GTGACCATCA CCAACTTCCT CAGCGCCGTC ACCGACTCGG TGACCACCGA CGAGGAGCTG
ACCGTGGAGA CGATGACCGA CATCGCCATC TCCATGCGCG AGGTGGACCT GGAGCGCATC
CAGTTCGTCA CCGTGCCCAA CGGCCAGCAC CCCGCCGACG CCAACCGGCT GGCCATGAGC
CAGCCCGCCG CCTCCGAGCT GTTCGCGGCG ATCAACTCCG GCGCCTACCT GGAGGACGAG
GAGCCGGAGG ACGAGGGGGA GGAGTCGGAG GAGGGCTCCG GCGACGCGGC CCCCGCCCCC
GCCGACGTCT CCGTGCAGGT CCTCAACAAC ACCGGTGTCA CCGGCCTGGC GAACGAGGTC
CAGGGCGTCC TGCTGGGGGA GGGGTACGAC GTCACCGGTA TCGGCGAACC CGCGGTGCGC
TTCCCCGAGC TGACCACCGT CTACTACGCT CCGGGTGAGG CGGCCGCCGC CGAACTGCTG
GCGGGTTCGC TGGAGAACGC GGTCACCGAG GAGGTCGCCG ACCTCCCGCA GACGCTGGAA
CTGGTCATCG GCCAGGACTG GAACGGCTTC GCGGGCGGAG GCGGCTCGTC CGGGCCCGAG
GTCTCCATCA CCGAGGACCT GGGCGGCACC ACCGCGGCGG GGGCTCGGGA GAGCGCCTGC
TGA
 
Protein sequence
MAGKRSAPRP PSAAMHAVRM SPGQWVACVV TALAIIASLG GYGWYQGIVG NITTAQVDTD 
AWDRPNSVEG VMNLLIIGSD VRSGDNANYG EAEGERPDTM LIASINVDNG AATLVNLPRD
LVVDLPGCEA VEGYEGMSPH SGMINSAMTF GGVGCQWQTV EEVTDVHLDH FVMMDFTGFK
DMVDAIGGVE MCIPAPVDDP KAHLTLDAGT QTLSGEESLG YVRSRYGQGD GSDLSRIDRQ
QEFMGAMLRQ VLSSEVMTSP VTITNFLSAV TDSVTTDEEL TVETMTDIAI SMREVDLERI
QFVTVPNGQH PADANRLAMS QPAASELFAA INSGAYLEDE EPEDEGEESE EGSGDAAPAP
ADVSVQVLNN TGVTGLANEV QGVLLGEGYD VTGIGEPAVR FPELTTVYYA PGEAAAAELL
AGSLENAVTE EVADLPQTLE LVIGQDWNGF AGGGGSSGPE VSITEDLGGT TAAGARESAC