Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2569 |
Symbol | |
ID | 9246420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3064116 |
End bp | 3065522 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003680494 |
Protein GI | 297561520 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.152283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCA CTGCCCCAGC CGCTCCGGCA CCCACCGCCG GAGCCCCCGC TGACACCGCT CGCAGCGACG ACGCGTCCGC CGACGCGGTC CTGACCGACC TGCTCGACCG CGGACGCGCC CAGGGGCACC TGTCCCTCTC CGAACTGCGT GCCGCTTTCA CCGCGGCCGC CGTCTCCACC GGCGACGGCC GTTCCATCCT GCGTGAACTC ACCGAGAACG GGGTCAGGCT CGCCAACGGG GGCGACGACT CCGCACCGGT GGCCGATCCC GACGCCGAGG ACGACGTGCT GGAAGACACT CTCATCGCCA CGGTGCCCGA CACCGACGAG GCACCCAGGG CCGAAGGCAA GACCGCGCCG CGCAAGGGCC CCTCCCCCCG GAGGAAGACC CGCTCCACAC GGCGCGTCAA GAAGGCCGAG CCCGCCCAGA CCACCACCGA CGCGGAGACC GGCGAGGCCG ACCTCGACGA CCAGTCCCCC GCCATGGGCG ACTCGGTCCA CACCTACCTC AAGGCCATCG GCCGACGCCA GCTCCTCACC GCGGAGCAGG AGGTCGACCT GGCCAAGCGG GTCGAGGCCG GGCTCTACGC CGAGTACCGG CTCGGCCTGC ACGGCGAGGT GGACGGCTCC GCCCCGCTGG CCGAGGCCGA GGTCGAGGAG CTGGAGTGGG TGGCCGAGGA CGGCCGCAAG GCCAAGTCCC ACATGCTGGA GGCCAACCTG CGCCTGGTGG TGTCGGTGGC CAAGAAGTAC AGCGACCGGG GGATGTCGCT GCTCGACGTG GTCCAGGAGG GCAACCTCGG CCTGATCCGC GCCGTGGAGA AGTTCGACTA CACCAAGGGC TTCAAGTTCT CCACCTACGC CATGTGGTGG ATCCGCCAGG CCATCCAGCG CGGTTTCGCC GACTCCGCGC GCACCATCCG CCTGCCCGTG CACGTCCTGG AGCTGCTGAG CAAGGTCAGC CGCCTGGAGC GCGACATGCA CCAGGCGCTG GGCCGCGAGC CCACGCCGGA GGAGCTGGGC CTGGAGCTGG ACAAGACCCC GGCCCAGATC GAGGAGCTGC TGCGGGTCAC CCGCCAGCCC ATCAGCCTGG ACTCCACGAT CGGCGAGGAC GGCGAGACCC GCATCGGCGA CCTGATCGAG GACGTGGACG CCTCGGAGGC CTCCGAGGTG GTGGACCGCC AGCTCATGGC CGACCAGCTG CGCAACGCGC TGTCGGACCT GGAGCCGCGC GAGGCCACCA TCATGTCCCT GCGCTTCGGC CTCATGGACG GCCGTCCGCG CACCCTGGAC GAGATCGGCA AGCACCTGGG GCTGACCCGC GAGCGCATCC GCCAGCTGGA GAAGCAGTCG CTGTCCAAGC TGCGCCACCC CAGCCGCGCC CAGCAGCTGC TGGACTTCGC CAGCTAG
|
Protein sequence | MTSTAPAAPA PTAGAPADTA RSDDASADAV LTDLLDRGRA QGHLSLSELR AAFTAAAVST GDGRSILREL TENGVRLANG GDDSAPVADP DAEDDVLEDT LIATVPDTDE APRAEGKTAP RKGPSPRRKT RSTRRVKKAE PAQTTTDAET GEADLDDQSP AMGDSVHTYL KAIGRRQLLT AEQEVDLAKR VEAGLYAEYR LGLHGEVDGS APLAEAEVEE LEWVAEDGRK AKSHMLEANL RLVVSVAKKY SDRGMSLLDV VQEGNLGLIR AVEKFDYTKG FKFSTYAMWW IRQAIQRGFA DSARTIRLPV HVLELLSKVS RLERDMHQAL GREPTPEELG LELDKTPAQI EELLRVTRQP ISLDSTIGED GETRIGDLIE DVDASEASEV VDRQLMADQL RNALSDLEPR EATIMSLRFG LMDGRPRTLD EIGKHLGLTR ERIRQLEKQS LSKLRHPSRA QQLLDFAS
|
| |