Gene Ndas_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3959 
Symbol 
ID9247830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4734162 
End bp4735553 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content72% 
IMG OID 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_003681862 
Protein GI297562888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATTGC GCCGAACCCC GCTACCCTTT TCGCGATCAC ACAGGGTGGA AGTAAGCGAC 
GAGGCGATCC ACGGCGCTGA GCAGATGGGC GATGCGATGG CTGACCGACG GAACCGGGAC
GACCCCGGCC GCACCGGCGA GCACGACAGG CAGGGCGTGT TCCGGCGCGG CGGGACCCCG
GGCAGGCCGG ACGAGTTCGA GAAGCTCTAC CGCCCGCGCG AGTCCGAGCA GAGGGTGGTC
GGCGAGACGC GTCGGCTCCC GCCCGCCGAG GACGTGCCCC CGGTGCGGCC GAGGCGGTCC
CAGAGGCCTC AGCAGTCCCA GAGGTCCCGG CGCACCCACA GCGCCGGGCG CGTCTACGAC
GGCCGCGGCA TCGAGCGCAG TCGCAAGGAG CGCAAGCGCC GGCTGGCCAC CACCGTCACG
GTGACCGTGC TGGTGCTGGT GCTGGTCCTG CCCCTGGTCT TCGTCGGCGG GTTCTACGTG
TACGCGAACT CGCGGCTGGA GCGGGTGGAG GCCCTGCTGG ACTACGAGGG CCGCCCGGAC
GGGCAGCCCG GCACGACCTA CATGATCGTG GGCTCCGACA GCCGCCAGGG CCTGTCCGAG
GAGCAGATGG ACGAGATGGC CACCGGCTAC GCCGAGGGCC GCCGGACCGA CACCATCATG
GTGCTGTACA TCCCCGACGA GGGCGAGCCC ACCATCGTCA GCGTCCCCCG AGACTCCTAC
GTCCCCCTCG CCGTCCCCGG TTACGCCGAC AACAAGATCA ACACCGCCTT CGCCGACGCC
GTGTGCGGCA CGAACGACGC GGGCGAGGAG GTCTGCGGCG GCCCCGCCCC CCTCGTGGAG
ACCTTCGAGC GCGCCTCGGG CGTGCACATC GACCACTACG TGGAGATCGG CATGGGCGGC
TTCGTCGACA TCGTGGACGC GGTGGGCGGC GTGGAGCTGT GCCCGGAGGA GGCCATGGCC
GACCCCAAGG CCGGGCTGGA CATCGAGGCG GGCTGCCAGA TGATGGACGG CGGCACCGCC
CTGGGCTACG TGCGCACCCG GGCCACGCCG CGCGCCGACC TGGACCGCAT CGCCCGCCAG
CGCGAGTTCT TCTCGGCGCT GGTCCAGACG GCCAGCGCGC CCTCCACGCT GTTCAACCCC
TTCGAGTCCA TCCCGCTGGT GCTCGCGGGC ACCGACACCT TCATGGTGGA CGAGGGCGAC
GACCTGCGGC ACCTGGCCAG CATGCTCCTG GCGATGCGCG GCGGTACGCA GACCACCGCG
ATCCCCGTGG GCCAGACCCC CACGCTGGAC GGGGTCGGCT CGGTGGTGGT CTGGGACGAG
GTGCGCTCCG AGGAGATGTT CGCCGCCATG CGGGCCGGGG AGCCGATCCC CGAGAGCGCC
TTCCAGGAGT AG
 
Protein sequence
MTLRRTPLPF SRSHRVEVSD EAIHGAEQMG DAMADRRNRD DPGRTGEHDR QGVFRRGGTP 
GRPDEFEKLY RPRESEQRVV GETRRLPPAE DVPPVRPRRS QRPQQSQRSR RTHSAGRVYD
GRGIERSRKE RKRRLATTVT VTVLVLVLVL PLVFVGGFYV YANSRLERVE ALLDYEGRPD
GQPGTTYMIV GSDSRQGLSE EQMDEMATGY AEGRRTDTIM VLYIPDEGEP TIVSVPRDSY
VPLAVPGYAD NKINTAFADA VCGTNDAGEE VCGGPAPLVE TFERASGVHI DHYVEIGMGG
FVDIVDAVGG VELCPEEAMA DPKAGLDIEA GCQMMDGGTA LGYVRTRATP RADLDRIARQ
REFFSALVQT ASAPSTLFNP FESIPLVLAG TDTFMVDEGD DLRHLASMLL AMRGGTQTTA
IPVGQTPTLD GVGSVVVWDE VRSEEMFAAM RAGEPIPESA FQE