Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1916 |
Symbol | |
ID | 9245766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2335898 |
End bp | 2337043 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003679849 |
Protein GI | 297560875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.28585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.677149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAGG CCCTTCTGCG GACCCTCACC CCCACCGTGA TCGCCGTCCT CGGCCGCCGC GGAGCGGACT TCGCCGCGGC CGAGGACGCC GTACAGGAAG CCCTGGCCGA GGCTGTGCGC GTGTGGCCGG ACCACCCGCC GCACGACCCC AGGGGCTGGC TGGTCACGGT GGCCTGGCGC AAGTTCCTCG ACGCCGCCCG CGCCGACACC TCCCGACGGC ACCGCGAGGT ACGCGTCGAG GCCGAACCCG TACCCGGCCC GGGCGAGGCG GTGGACGACA CGCTCCGGTT GTACTTCCTG TGCGCGCACC CGTCCCTGAC ACCGGCCTCG GCCGTGGCGC TCACGCTGCG CGCGGTCGGC GGCCTGACCA CGCGCCAGAT CGCGCGGGCC TACCTCGTGC CGGAGGCGAC CATGGCCCAG CGCATCAGCA GGGCCAAGCG GACCGTCGCT GGCGTCCGGT TCGACCGGCC CGGCGACGTC GCCACCGTGC TGCGCGTGCT CTACCTGGTC TTCAACGAGG GCTACTCCGG AGACGTCGAC CTCGCCGCCG AGGCGATCCG GCTCACCCGC CAGCTCGCGG CCGCGATCGA CCACGAGGAG GTCGCGGGGC TGCTCGCGCT CATGCTTTTG CACCACGCCC GGCGCCCGGC GCGGACCGGT CCCGACGGCA GGCTGGTGCC CCTCGCCGAG CAGGACCGCG GCCTGTGGGA CACCCGCATG ATCGCCGAGG GCGTCGGCGT GCTCCAGGCG GCCCTGGCCC GCGACCGCCT GGGCGAGTAC CAGGCCCAGG CCGCGATCGC CGCCCTGCAC GCCGACGCCC GGACGGCCGA GGAGACCGAC TGGACGCAGA TCGTCGAGTG GTACGACGAA CTGGTGCGCC TCACCGACAG CCCCGTGGCC CGCCTCAACC GGGCCGTCGC GGTCGGTGAG GCCGACGGTC CGCGGGCGGG TCTGGCCGCC CTGGCGGAGG TGGACCCCTC CGTGCCCCGG CACAGCGCCG CCGCCGCGTA CCTGCGTGAG CGCGACGGCG ATCCGGCCAC CGCGGCGCGG CTCTACGCCG AGGCCGCCCG GTCGGCGCCC AACCTGCCCG AACGCGACCA CCTCACGCGG CAGGCCGCAC GGCTCAACGC GCGGCTGCGC GGCTGA
|
Protein sequence | MDEALLRTLT PTVIAVLGRR GADFAAAEDA VQEALAEAVR VWPDHPPHDP RGWLVTVAWR KFLDAARADT SRRHREVRVE AEPVPGPGEA VDDTLRLYFL CAHPSLTPAS AVALTLRAVG GLTTRQIARA YLVPEATMAQ RISRAKRTVA GVRFDRPGDV ATVLRVLYLV FNEGYSGDVD LAAEAIRLTR QLAAAIDHEE VAGLLALMLL HHARRPARTG PDGRLVPLAE QDRGLWDTRM IAEGVGVLQA ALARDRLGEY QAQAAIAALH ADARTAEETD WTQIVEWYDE LVRLTDSPVA RLNRAVAVGE ADGPRAGLAA LAEVDPSVPR HSAAAAYLRE RDGDPATAAR LYAEAARSAP NLPERDHLTR QAARLNARLR G
|
| |