Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1885 |
Symbol | |
ID | 9245735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2296055 |
End bp | 2297374 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003679819 |
Protein GI | 297560845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.248059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.283313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGACC GCGACCGTGC GCAGCGGCGC CCGCCGGTGG CGGACGGCCA GCGCGACGGC GGGCGGCGGC CCCCCGGCAC GGACGCGGAC ATCGAGCACC TGCTGCGCAC CGAGGCGCCG CAGGTGCTCG GCGCCCTGGT GCGGCGCTTC GGCCGGTTCG ACGCCGCTGA GGACGCGGTG CAGGAGGCGC TGCTCGCCGC GAGCCGGGCC TGGCCCGCCG ACGGCGTGCC GGAGAACCCG CGCAGCTGGC TGATCCGCGT CGGCTACCGG CGCCTGGTCG ACCTCCTCCG CGCCGAACAG GCCAGGCACC GGCGCGAACA GGAGATCGGC GCCGCCGAAC TGGCCATGCG GGAGCCGGAC CGGAGGGCGG GCCCCGCCCG GGAGAGCGAC GACAGCCTGG CCCTGCTGTT GCTGTGCTGC CACCCCGCGC TGAGCGCCGC CTCCCAGGTG GCGCTCACGT TGCGCGCGGT GGGCGGCCTG ACCACCGCCG AGATCGCGCA CGCCCACGGG ACCTCGGAGA ACACCATGGG CACGCGGATC AGCCGCGCCA AGCAGCAGCT GGCCCGGGTC GGAGCCCGTG TCACCCCGCC GACCGACGCC GACCGCGACA GCCGGATCAC GGCGGTGGCG AAGGTGCTCT ACCTGGTCTT CAACGAGGGT TACACGACCT CCGAGGGCGA CCAGCTCGCC CGCGTGGACC TGACCGGCGA GGCCATCCGG CTGACACGCA TGCTCCACGA CTCTCTGCCC GACGACGCCG AGGTCACCGG CCTGCTCGCG CTCATGCTGC TCACCGAGGC GCGCCGTCCC GCACGCACCG GCGACCACGA CGAGCTGGTG CCCCTGGACG AACAGGACCG GTCGTTGTGG AACGCCGACC TCGTCCGCGA GGGCACCGCG CTGATCGACG GCGTGTGGAA CCGCGGTGAG GTCGGCCCCT ACCAGTTGCA GGCGGCGATC GCGGCCGTGC ACGCGGCGGC CCCGGCGCCG GAGCGGACCG ACTGGGTGCA GATCGCGGTG CTCTACCTGT GGCTCGAACG GCTCAGCCCC ACCGCTCCCG TGCGGCTGAG CCGGGTGGTG GCGGTGGCCA AGGCGTACGG CCCGGCGCGG GGACTGGCCC TGCTGGACGA CCTCGACCGA CGCTTCGGGC TCGGCCGGGA CCCCCTGACC CGGCAGCGCG AACGCGCGGT GCGCGCTCAC CTGCTGGAGA GGACCGGGGA GGGGGAGGGC GCGGCGGCGC TGTACCGGGA GGCGGCCTCC CTGACCGGCA ACCGGGTCGA GCGGCGGTTC CTGCTGGACC GCGCCGACCG CCTCGGTTGA
|
Protein sequence | MNDRDRAQRR PPVADGQRDG GRRPPGTDAD IEHLLRTEAP QVLGALVRRF GRFDAAEDAV QEALLAASRA WPADGVPENP RSWLIRVGYR RLVDLLRAEQ ARHRREQEIG AAELAMREPD RRAGPARESD DSLALLLLCC HPALSAASQV ALTLRAVGGL TTAEIAHAHG TSENTMGTRI SRAKQQLARV GARVTPPTDA DRDSRITAVA KVLYLVFNEG YTTSEGDQLA RVDLTGEAIR LTRMLHDSLP DDAEVTGLLA LMLLTEARRP ARTGDHDELV PLDEQDRSLW NADLVREGTA LIDGVWNRGE VGPYQLQAAI AAVHAAAPAP ERTDWVQIAV LYLWLERLSP TAPVRLSRVV AVAKAYGPAR GLALLDDLDR RFGLGRDPLT RQRERAVRAH LLERTGEGEG AAALYREAAS LTGNRVERRF LLDRADRLG
|
| |