Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0869 |
Symbol | |
ID | 9244714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1064354 |
End bp | 1066309 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003678819 |
Protein GI | 297559845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.456252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.192635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGATG TCCACCGCAC CGTCGACGCG GTCTGGAAGA TGGAGTCGGC GCGGATCGTC GCCTCGCTCA CCCGGATCGC GCACGACGTC GGCGCCGCCG AGGAGTGCGC CCAGGACGCC CTCGTCGCCG CCCTGGAGCA GTGGCCGCGT GAGGGGATCC CCGACAACCC CGGCGCCTGG CTCATGACCG CCGCCAAACG CCGCGTCCTG GACCGCCTGC GCCGGGAGCA CCGCCTGGAG AGCAAGCACA AGGAGATCGC CCACGAGCTG GAGCGCGCTC CCGGGGTCGC GCCCGCCCCG GACGACGGCG TCCTGCGCCT GCTGTTCGCG ACCTGCCATC CGGTGCTGTC CACTCCGGAG AGGGTCGCGC TGACCCTGCG CCTGGTGGCC GGGCTGACCA ACGGGGAGAT CGCCCGCGCC TTCCTCACCG GGGAGGGGCG CATCGCCCAG CGCGTGGCGC GGGCCAAACG GCTGCTCGCC GAGGAGGGGG TGGCCTTCGG GCTGCCCGAC GGACGGGAGC TGGCCGAACG CCTCTCCTCC GTCCTCGGCG TCATCTACCT CGTCTTCAAC GAGGGCTACG CGGCGACCTC GGGCGAGGAC CTGATGCGCC CCGGCCTGTG CCTGGAGGCG CTGCGGCTGG GCCGCACCCT GGCCGAACTG GTGCCGCACG AGGCCGAGGC GCACGGCCTG GTGGCGCTGA TGGAGCTCCA GCAGTCCCGG GCCGGGGCCC GGACCGGCCC CTCCGGCGAG ATCGTCCGAC TGCACGAGCA GAACCGGGGC CGCTGGGACC CCCTGCTGGT GCGCCGGGGC TTCGCCGCCA TGCTGCGCGC CCGTGACGCG GGCGGCCCGC CCGGCCCCTA CGTGCTCCAG GCGGCCGTCG CCGTGTGCCA CGCCCGGGCC ACCAGCGAGC AGGACACCGA CTGGGCCCGT ATCGCCGCCC TCTACGACCA GCTGGTCGTC CTACTCCCCA CCCCGGTCGT GCGCCTGAAC CGGGCCGTGG CGGTCGGCAG GGCCCGCGGG CCGGGGGAGG GACTGGCCCT GGCCGACGAG CTGGCGGAGG ACCCGGTCCT GCGCGACTAC CACCTGTTGC CCGGCGTGCG CGGCGACCTG CTCCTGCGGC TGGGCCGGGC CGCCGAGGCG AAGCGGGAGT TCGAGCGCGC CGCCTTGCTG GCCGAGAACA CCGCCGAACG CGCCTTCCTG TCGCGCCGGG CAGAGGAGAC CGCGGTCCCC GAGCCCGCCG GGCCCGACCT GGGCGCGACG GCCCGGGAGT TCCTGGGCCG CGACGACCTG GACCCCCAGA CGCTGCGCTC CTACGGCCAG ACCCTGGACC GGCTGTGCCG TTCGCTGGGG GAGGGTCTGC CCCTGGCCGA CCTGACCCCC GAACGGGTGG CGGGCGTGTT CGCCACCGCC TGGGGCGGTG CGGCCCCGCG CACGTGGAAC CGGCACCGGT CGACCGTGCG CTCCTTCGGC GCCTGGGCGG GGCTGGAGGA TCTCGCGGCG GACCTGGAAC GGCGCGGCGA GACCCGCTCA CCGCACGTGC CGCTCGACCC CGAGACCGTG GCGCGCCTGT GCGACGGGGA GGGTTTCGCG CTTCGCGAGC GCGTGCTGTG GCGGCTGCTG CACGAGTCCG GCGCCCGGGT GAATTCCGTG CTGGCGCTCA ACGTGGAGGA CCTGGACCTG GAGGACCGCC GTGCGCGGGC GGGCGACGGC TGGGTGGGCT GGAGGTCGGG GACCGCCCGG CTGCTGCCCG AGCTCGTGGC GGGGCGCGAG CGGGGGCCGC TCCTTCTGGC AGACCGGCGT CCGGGACCGG CGAGGCGCCC CGCCGCGGCC GACCTGTGCC CGCTGACGGG GCGAGGGCGC CTGTCCTACC CGCGTGCGGA GTACCTGTTC AAGCGGGCCA CGCGCTCCCT GGATCCGGCG GGGCGGGGCT ACACCCTGAG CAGGCTCCGG CCCTGA
|
Protein sequence | MTDVHRTVDA VWKMESARIV ASLTRIAHDV GAAEECAQDA LVAALEQWPR EGIPDNPGAW LMTAAKRRVL DRLRREHRLE SKHKEIAHEL ERAPGVAPAP DDGVLRLLFA TCHPVLSTPE RVALTLRLVA GLTNGEIARA FLTGEGRIAQ RVARAKRLLA EEGVAFGLPD GRELAERLSS VLGVIYLVFN EGYAATSGED LMRPGLCLEA LRLGRTLAEL VPHEAEAHGL VALMELQQSR AGARTGPSGE IVRLHEQNRG RWDPLLVRRG FAAMLRARDA GGPPGPYVLQ AAVAVCHARA TSEQDTDWAR IAALYDQLVV LLPTPVVRLN RAVAVGRARG PGEGLALADE LAEDPVLRDY HLLPGVRGDL LLRLGRAAEA KREFERAALL AENTAERAFL SRRAEETAVP EPAGPDLGAT AREFLGRDDL DPQTLRSYGQ TLDRLCRSLG EGLPLADLTP ERVAGVFATA WGGAAPRTWN RHRSTVRSFG AWAGLEDLAA DLERRGETRS PHVPLDPETV ARLCDGEGFA LRERVLWRLL HESGARVNSV LALNVEDLDL EDRRARAGDG WVGWRSGTAR LLPELVAGRE RGPLLLADRR PGPARRPAAA DLCPLTGRGR LSYPRAEYLF KRATRSLDPA GRGYTLSRLR P
|
| |