Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0393 |
Symbol | |
ID | 9244231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 483761 |
End bp | 485215 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003678347 |
Protein GI | 297559373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.041427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCAGAG GTAGCGAACC AGGCGAACCG TCGACGGTCG AGGCGCTCTA CGACGCCTTC GCCCCGTCCC TGTACCGGTA CGCCTGGTCG CTACTGGGTG AGGAGGACAC CGGGCAGGCG GCCGAGGCCG TGCACGACGG CCTCGTCGCC GGTGTCGTCC TCGACTCGCG GCGCGCGGAC CCGGCCGACC TCGGCTCCTG GCTGTACGCC CTGGTCCGCT CGGCCTGCCA GCGCCGCGGC CTGGCCCACG TGAGCCCCTA CACCCGGCTG GCCACGGTGC CCGCGGAGGA GCCGGTGGCC CGCATGTTCG CCCGGCTGCC CGCCAGCCAC CGCGAACTGG TGGAGCTGAA CCTGCGGCAC GCGCTGCCCA CCTCCGCGGT CGCGCGCGTC CTGGGCCTGG ACCCCCAGAT CTGCGGCGAG CTGTCGCGCT CGGCGATCCG CCGCGCCGCC GAGAACCTGG ACGGCCGCGA CGGCGGGCGC CGCGTCCCGG ACCCCCGCCC CTCCCGGGAG GCGGGAGCGC CGGACGGTGC CGGGGACACC GGCGGCCCGG GAACCGCCGC CTGGCGCGCC CAGGTGCACG ACGTCTCCCA GGCCCTGGCC CTGCTGCGCC CGCCCGGCCC TCCGCCCGGC CTGCGCGAGG CGGTGGTGCG CACCTGCACC GACCCCGGCC TCGCCGCGGC CCGCGAGCGC ATCGCCGCGC AGATGCACCC CCTCACCGGC GAGGGGTACC CGATGCACCG CTCCCGCGCC GCCGGGGCGG TCGAGGAGGA GGCCGAGGCG GCCGAGCCCG GCCCCGAGGC GCCGCCCCGC GCCCTGCCCG GCGACCGGCT GACCACGCGC GACCACCCGG TGCGCGACGA AGCGGTCACC CCGCTGGCCG GCCCCCGATC CCCCGCCGGA CCCGACTCCG GTGACGACCC CGACGACCAT CGCACCGCCC GCCGCCGCTG GCCGCTGCCC GCCGTCTCCG GCCTCGCCAC CGTGGTGCTG GCGGTCGCGC TGTGGGGCTG GGCCAGCGCC GTGGGCGGCC CCTCGACGAT GATCGGCTCC GGGCCCGACG AGGCCGAGCG CGGTCCGCTG GTCCCGCAGG TGGAGACCGA CGCGACCAGC GCGGACACCA AACCGGAGGC CGGACCGACC GTGGAGCCCC CGGCCGCGCC GACGGAGTCC GCCGAACCCG GGGCCGGGAC GGAGCCGCGG CAGGAGGACG GCGGCGGCAG CGGGCACCAG GCCCCCGACC CGGCTCCGGA GCGGACCACG CCCGCACCGC CCGCGCAGGA GCCGAGCCCG CCCCCCGGAG GCGAGGGCCC CGACGGCGGT CCGGACGCGC CCGAGGGGGA GGAACCCGGA GACGACGACG CCCCCGGCGG CCCGCCCGAG GAGGACGGGG ACGACGACGG GGGCGGTACA GGTCTGCTCG ACGGGCTGTT GGGGCTGCTC TTCGGCGGTG GCTGA
|
Protein sequence | MTRGSEPGEP STVEALYDAF APSLYRYAWS LLGEEDTGQA AEAVHDGLVA GVVLDSRRAD PADLGSWLYA LVRSACQRRG LAHVSPYTRL ATVPAEEPVA RMFARLPASH RELVELNLRH ALPTSAVARV LGLDPQICGE LSRSAIRRAA ENLDGRDGGR RVPDPRPSRE AGAPDGAGDT GGPGTAAWRA QVHDVSQALA LLRPPGPPPG LREAVVRTCT DPGLAAARER IAAQMHPLTG EGYPMHRSRA AGAVEEEAEA AEPGPEAPPR ALPGDRLTTR DHPVRDEAVT PLAGPRSPAG PDSGDDPDDH RTARRRWPLP AVSGLATVVL AVALWGWASA VGGPSTMIGS GPDEAERGPL VPQVETDATS ADTKPEAGPT VEPPAAPTES AEPGAGTEPR QEDGGGSGHQ APDPAPERTT PAPPAQEPSP PPGGEGPDGG PDAPEGEEPG DDDAPGGPPE EDGDDDGGGT GLLDGLLGLL FGGG
|
| |