Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5152 |
Symbol | |
ID | 9249045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 292655 |
End bp | 293677 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | DNA-directed RNA polymerase, alpha subunit |
Protein accession | YP_003683038 |
Protein GI | 297564065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.449988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATCG CACAGCGTCC CACGCTCACC GAGGAAGCCT CCTCGGACCT GCGCTCGAAG TTCGTCATCG AGCCGCTGGA GCCGGGCTTC GGCTACACCA TCGGCAACTC GCTTCGGCGC ACCCTGCTGT CCTCCATCCC CGGCGCGGCT GTCACGAGCA TCCGCATCGA GGGCGTCGAG CACGAGTTCA CCACCGTGCC CGGTGTCAAG GAGGACGTCA CCGAGATGAT CCTCAACCTC AAGGGCCTGG TCGTCAGCTC CGAGCACGAC GAGCCGGTGC TGATGTACCT GCGCAAGCAG GGCCCCGGTG TGGTGACCGC CGCGGACATC GCTCCGCCGG CCGGTGTCGA GGTGCACAAC CCCGACCTGC ACATCGCCAC GCTCAACGGC AAGGGCAAGC TGGAGATGGA GCTGACGGTC GAGCGCGGCC GCGGCTACGT CTCGGCGACG CAGAACAAGC AGGCGGGACA GGAGATCGGC CGCATCCCGA TCGACTCCAT CTACTCCCCG GTCCTGCGTG TGACCTACAA GGTCGAGGCC ACCCGTGTCG AGCAGCGCAC CGACTTCGAC CGCCTGATCG TGGACATCGA GACCAAGCCG GCCATCCGTC CCCGGGACGC GGTCGCCAGC GCGGGCAAGA CCCTGGTCGA GCTGTTCGGT CTGGCGCGTG AGCTCAACGT CGACGCCGAG GGCATCGACA TGGGCCCGTC GCCCACGGAC GCCGCCCTGG CGGCGGACCT GGCGCTGCCG ATCGAGGACC TCAACCTCAC GGTCCGGTCC TACAACTGCC TCAAGCGTGA GGGCATCCAC AGCGTCGGTG AGCTGGTTGC CCGCTCCGAG CAGGACCTGT TGGACATCCG CAACTTCGGT GCCAAGTCCA TCGACGAGGT CAAGCAGAAG CTCGTCGACA TGGGCCTGTC GCTGAAGGAC TCCCCGCCCG GATTCGACCC CAGCTCGGCG GCCGACTCCT ACAGCTCCGA GGACGACGAG GGCGAGTCCT TCGTCGAGAC GGAGCAGTAC TAA
|
Protein sequence | MLIAQRPTLT EEASSDLRSK FVIEPLEPGF GYTIGNSLRR TLLSSIPGAA VTSIRIEGVE HEFTTVPGVK EDVTEMILNL KGLVVSSEHD EPVLMYLRKQ GPGVVTAADI APPAGVEVHN PDLHIATLNG KGKLEMELTV ERGRGYVSAT QNKQAGQEIG RIPIDSIYSP VLRVTYKVEA TRVEQRTDFD RLIVDIETKP AIRPRDAVAS AGKTLVELFG LARELNVDAE GIDMGPSPTD AALAADLALP IEDLNLTVRS YNCLKREGIH SVGELVARSE QDLLDIRNFG AKSIDEVKQK LVDMGLSLKD SPPGFDPSSA ADSYSSEDDE GESFVETEQY
|
| |