Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3312 |
Symbol | |
ID | 9247174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3954517 |
End bp | 3956232 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003681224 |
Protein GI | 297562250 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCATCTG CCAGTTCGAC TCGCTCGAAG CAGTCCGAGT CACTGCAAGA GCCCGTCATT CAGCAGCTGA TCGAGCGTGG GCGGTCCCAG GGCTTCCTTG AGCCCGAGGA CGTGCGCCGT GCCTTCGAAG AGGCCGAAAT CCCGATGTCG CAGGCGCAGT CCGTGCTCCG CAGCCTCGCG AAGGAGGGCG TGACCGTACT CGTTCCCGAG TCGGCCTCCT CCCGGCGCAA GGCGCCGCGG CGCAAGGCGG CGACCACCAA GACGGCGGCC ACGCGGTCCA CCTCGACCAG GTCCACCAAG ACCACCGCGA CGAGCAGGGC CGCCAAGCCC GTGCGCGCGG CGGCGGCCCA GGAGCAGACC GAGACGGTCA CCGCCGTGGT CGGTTCCGCC GAGGACGCCG GGAACGAGGC CGGGGCGGAG AAGAAGCCCG CGGCCAAGAA GCCCGCCAAG AAGACGGCGA CCAAGAAGCC CGCGACCAAG GCGGCCAAGA CCACCGCGGC CAAGGGCCCG ACCGCCAAGA CCACCGGCCT CAAGGCCGCC GCCGCCAAGA AGACGGCGAC CAAGGAACTC GGACTCGCGG CGGACGGCGA GTTCGACGAG GACGAGGACG GCCTGGACGA CCTCGAGCAC ACCGGCGCCG AGCTGGAGCT GGTCGAGGAC ACCCCGGACC CGGCGGACAA GGACCTCAAG CCCGCCAAGC CCGAGGCCGT GGGCGCCCCG GCCAAGCCCG CCAACGAGGA CGAGTCCTTC GTCCTCTACG ACGACGATGA CGACGCCCCC GCGGCGCAGG TCGTGGCCGC GGGCGCGACG GCGGACCCGG TCAAGGACTA CCTCAAGCAG ATCGGCAAGG TCCCGCTGCT CAACGCCGAG CAGGAGGTCG AACTCGCCAA GCGGATCGAG GCCGGCCTGT TCGCCGAGGA GAAGCTGGCC GAGGAGGCCG AGCTCCTCAC CGTCGAGCTG CGCGACGAGC TGGAGTGGAT CGCCGAGGAC GGCGGCCGCG CCAAGAAGCA CCTGCTGGAG GCCAACCTCC GGCTCGTGGT CTCGCTCGCC AAGCGCTACA CCGGCCGCGG CATGCTCTTC CTGGACCTGA TCCAGGAGGG CAACCTCGGT CTGATCCGCG CGGTGGAGAA GTTCGACTAC ACCAAGGGCT TCAAGTTCTC GACCTACGCC ACGTGGTGGA TCCGCCAGGC GATCACCCGG GCCATGGCCG ACCAGGCGCG CACCATCCGC ATCCCGGTGC ACATGGTCGA GGTCATCAAC AAGCTGGCCC GCGTCCAGCG CCAGATGCTC CAGGACCTGG GCCGCGAGCC CACCCCGGAG GAGCTGGCCA GGGAACTCGA CATGACCCCG GAGAAGGTCG TCGAGGTGCA GAAGTACGGC CGCGAGCCGA TCTCCCTGCA CACCCCGCTG GGCGAGGACG GCGACAGCGA GTTCGGCGAC CTCATCGAGG ACTCCGAGGC GATCCAGCCG GGCGAGGCGG TCAGCTTCAC CCTGCTCCAG GAGCAGCTGC ACTCGGTGCT GGACACGCTG TCCGAGCGCG AGGCGGGCGT GGTGTCCATG CGCTTCGGTC TCACCGACGG CCAGCCGAAG ACTTTAGACG AGATCGGCAA GGTCTACGGG GTCACCCGTG AGCGCATCCG GCAGATCGAG AGCAAGACGA TGTCGAAGCT CCGCCACCCG TCGCGTTCGC AGGTGCTCCG CGACTACCTG GACTAG
|
Protein sequence | MSSASSTRSK QSESLQEPVI QQLIERGRSQ GFLEPEDVRR AFEEAEIPMS QAQSVLRSLA KEGVTVLVPE SASSRRKAPR RKAATTKTAA TRSTSTRSTK TTATSRAAKP VRAAAAQEQT ETVTAVVGSA EDAGNEAGAE KKPAAKKPAK KTATKKPATK AAKTTAAKGP TAKTTGLKAA AAKKTATKEL GLAADGEFDE DEDGLDDLEH TGAELELVED TPDPADKDLK PAKPEAVGAP AKPANEDESF VLYDDDDDAP AAQVVAAGAT ADPVKDYLKQ IGKVPLLNAE QEVELAKRIE AGLFAEEKLA EEAELLTVEL RDELEWIAED GGRAKKHLLE ANLRLVVSLA KRYTGRGMLF LDLIQEGNLG LIRAVEKFDY TKGFKFSTYA TWWIRQAITR AMADQARTIR IPVHMVEVIN KLARVQRQML QDLGREPTPE ELARELDMTP EKVVEVQKYG REPISLHTPL GEDGDSEFGD LIEDSEAIQP GEAVSFTLLQ EQLHSVLDTL SEREAGVVSM RFGLTDGQPK TLDEIGKVYG VTRERIRQIE SKTMSKLRHP SRSQVLRDYL D
|
| |