Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3959 |
Symbol | |
ID | 9247830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4734162 |
End bp | 4735553 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003681862 |
Protein GI | 297562888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACATTGC GCCGAACCCC GCTACCCTTT TCGCGATCAC ACAGGGTGGA AGTAAGCGAC GAGGCGATCC ACGGCGCTGA GCAGATGGGC GATGCGATGG CTGACCGACG GAACCGGGAC GACCCCGGCC GCACCGGCGA GCACGACAGG CAGGGCGTGT TCCGGCGCGG CGGGACCCCG GGCAGGCCGG ACGAGTTCGA GAAGCTCTAC CGCCCGCGCG AGTCCGAGCA GAGGGTGGTC GGCGAGACGC GTCGGCTCCC GCCCGCCGAG GACGTGCCCC CGGTGCGGCC GAGGCGGTCC CAGAGGCCTC AGCAGTCCCA GAGGTCCCGG CGCACCCACA GCGCCGGGCG CGTCTACGAC GGCCGCGGCA TCGAGCGCAG TCGCAAGGAG CGCAAGCGCC GGCTGGCCAC CACCGTCACG GTGACCGTGC TGGTGCTGGT GCTGGTCCTG CCCCTGGTCT TCGTCGGCGG GTTCTACGTG TACGCGAACT CGCGGCTGGA GCGGGTGGAG GCCCTGCTGG ACTACGAGGG CCGCCCGGAC GGGCAGCCCG GCACGACCTA CATGATCGTG GGCTCCGACA GCCGCCAGGG CCTGTCCGAG GAGCAGATGG ACGAGATGGC CACCGGCTAC GCCGAGGGCC GCCGGACCGA CACCATCATG GTGCTGTACA TCCCCGACGA GGGCGAGCCC ACCATCGTCA GCGTCCCCCG AGACTCCTAC GTCCCCCTCG CCGTCCCCGG TTACGCCGAC AACAAGATCA ACACCGCCTT CGCCGACGCC GTGTGCGGCA CGAACGACGC GGGCGAGGAG GTCTGCGGCG GCCCCGCCCC CCTCGTGGAG ACCTTCGAGC GCGCCTCGGG CGTGCACATC GACCACTACG TGGAGATCGG CATGGGCGGC TTCGTCGACA TCGTGGACGC GGTGGGCGGC GTGGAGCTGT GCCCGGAGGA GGCCATGGCC GACCCCAAGG CCGGGCTGGA CATCGAGGCG GGCTGCCAGA TGATGGACGG CGGCACCGCC CTGGGCTACG TGCGCACCCG GGCCACGCCG CGCGCCGACC TGGACCGCAT CGCCCGCCAG CGCGAGTTCT TCTCGGCGCT GGTCCAGACG GCCAGCGCGC CCTCCACGCT GTTCAACCCC TTCGAGTCCA TCCCGCTGGT GCTCGCGGGC ACCGACACCT TCATGGTGGA CGAGGGCGAC GACCTGCGGC ACCTGGCCAG CATGCTCCTG GCGATGCGCG GCGGTACGCA GACCACCGCG ATCCCCGTGG GCCAGACCCC CACGCTGGAC GGGGTCGGCT CGGTGGTGGT CTGGGACGAG GTGCGCTCCG AGGAGATGTT CGCCGCCATG CGGGCCGGGG AGCCGATCCC CGAGAGCGCC TTCCAGGAGT AG
|
Protein sequence | MTLRRTPLPF SRSHRVEVSD EAIHGAEQMG DAMADRRNRD DPGRTGEHDR QGVFRRGGTP GRPDEFEKLY RPRESEQRVV GETRRLPPAE DVPPVRPRRS QRPQQSQRSR RTHSAGRVYD GRGIERSRKE RKRRLATTVT VTVLVLVLVL PLVFVGGFYV YANSRLERVE ALLDYEGRPD GQPGTTYMIV GSDSRQGLSE EQMDEMATGY AEGRRTDTIM VLYIPDEGEP TIVSVPRDSY VPLAVPGYAD NKINTAFADA VCGTNDAGEE VCGGPAPLVE TFERASGVHI DHYVEIGMGG FVDIVDAVGG VELCPEEAMA DPKAGLDIEA GCQMMDGGTA LGYVRTRATP RADLDRIARQ REFFSALVQT ASAPSTLFNP FESIPLVLAG TDTFMVDEGD DLRHLASMLL AMRGGTQTTA IPVGQTPTLD GVGSVVVWDE VRSEEMFAAM RAGEPIPESA FQE
|
| |