Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5463 |
Symbol | |
ID | 9249366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 651604 |
End bp | 652866 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003683348 |
Protein GI | 297564375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.37967 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCACA CCGACGCCAC GCGCGCGGTC GAGGCGGTCT GGCGTATCGA GTCGGCCCGT CTCGTCGCAG CGCTCACCCG CATGGTCGGC GACGTCGGCC TCGCCGAGGA GTTCGCCCAG GACGCGCTGG TCAGCGCACT GGAGAAGTGG CCCGGCGACG GGGTTCCCGA CAACCCCGGC GCCTGGCTGA CCACCGTCGC CAAGCGCCGC GTGCTCGACC GCTGGCGCCG CGACGAGCGC TTCCAGCGGC GCATGGCCGA CCTCGGCCGC GAGATCGCCG AGCACGACGG GCAGGCCGAG TTCGACGCCG TCCTGGAGGA GGACTTCGGA GACGACCTCC TGCGGCTGAT GTTCGTGTGC TGCCACCCGG TGCTGTCCAC CGAGGCCCGC GTCGCGCTCA CCCTGCGCCT GCTCGGCGGG CTCACCACCG ACGAGATCGC CCGGGCCTTC CTGGTGCCCG AGTCCACCGT CGCCCAGCGC ATCGTGCGCG CCAAGCGCAC CCTGGCCAAG AGGAAAGTGC CCTTCGAGGT GCCCGTCGGC GAGGACCGCG ACGCCCGGGT GGCCTCCGTG CTGGAGGTCG TCTACCTGGT GTTCAACGAG GGCTACACCG CCACCTCGGG TACGGAGTGG ACGCGCCCCA CGCTGTGCGA GGAGGCCATG CGCCTGGGCC GCGTGCTCGC CCAACTGCTG CCCGGGGAGT GCGAGGCGCA CGGGCTGGTG GCGCTGATGG AGCTGCACGC CTCGCGGCTG CGCGCCCGCG TCGGGCCGGG CGGGGAACCC GTCCCCCTGG CCGAGCAGAA CCGCGCGCTC TGGGACCGCC TGCTCATCAC GCGCGGGATG GAGGCCCTGT TCAGGGCACT GCCGCCGGAC CGGAGCCAGC CCGGCGGTCC CTACGTGCTC CAAGCGGCCA TCGCGGCCGA ACACGCCAGG GCGGTCACCG CCGCGGACAC CGACTGGACG GTCATCGCCG GGCTGTACCT GGCCCTGGTC AGGGTCACCG GCTCGCCGGT CGTGGAGCTG AACCGGGCGG TGGCGGTGTC GATGGCCTCG GGCCCGGAGA CCGCGCTGGA GATCGTGGAC GCGCTGCGGG ACCAGCCGGG GATGGGCGAC TACCACCTGC TGCCCTCCGT GCGCGGCGAC CTGCTCGTGC GGCTGGACCG CCGGGCCGAG GCCCGCGCCG AGTTCGAGCG CGCGGCCTCC CTGACCCGCA ACGAGCGCGA GCGCTCGCTG CTCCTGGACC GCGCCCGCGG CTGCGAGGAC TGA
|
Protein sequence | MSHTDATRAV EAVWRIESAR LVAALTRMVG DVGLAEEFAQ DALVSALEKW PGDGVPDNPG AWLTTVAKRR VLDRWRRDER FQRRMADLGR EIAEHDGQAE FDAVLEEDFG DDLLRLMFVC CHPVLSTEAR VALTLRLLGG LTTDEIARAF LVPESTVAQR IVRAKRTLAK RKVPFEVPVG EDRDARVASV LEVVYLVFNE GYTATSGTEW TRPTLCEEAM RLGRVLAQLL PGECEAHGLV ALMELHASRL RARVGPGGEP VPLAEQNRAL WDRLLITRGM EALFRALPPD RSQPGGPYVL QAAIAAEHAR AVTAADTDWT VIAGLYLALV RVTGSPVVEL NRAVAVSMAS GPETALEIVD ALRDQPGMGD YHLLPSVRGD LLVRLDRRAE ARAEFERAAS LTRNERERSL LLDRARGCED
|
| |