Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2135 |
Symbol | |
ID | 9245985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2555526 |
End bp | 2556812 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | putative RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003680065 |
Protein GI | 297561091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.387852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACC CCACCGGCCG CGAGGCCGTA GCCGCCGTCT GGCGGATCGA CTCGGCGCGG ATCGTCGGCG CGCTCGCGCG CTACACCGGC GACTTCGCCC TGGCCGAGGA CCTGGGCCAG GAGGCGCTGG CCGAGGCGCT GGTGACCTGG CCGCGCGAGG GCGTGCCCCG CAACCCGGCG GGGTGGCTGC TCACCGTCGG CCGCCGCCGC GCGATCGACG CGTTCCGCCG CCGCTCCGCC CTGGACGACA GGTACGCCGT CCTCGCCCGT GACCTGGGCG AGGGCGGCGC CGCCTCGGGC AGCGCGCCCG CCGACCCCGC ACGGGAGGCC GACGACGTGC TGTGGGATCC GGACCGGATC GACGACGACG TGCTCGCGCT GATGTTCACC GCCTGCCACC CCGTGCTGTC GCGGGAGGCC AGGGTGGCGC TCACCCTGCG GGTGGTCGGC GGTCTGACCA GCGACGAGAT CGCCAGGGCG TTCCTGGTCC CGACCGCGAC CGTGCAGGCC AGGATCACCC GGGCCAAGAG GACCCTCGGG GCGGCCCGGG TGCCCTTCGG GGTACCCCCG GCCCAGGAAC GGCCCCAGCG GCTCGGCTCG GTGCTCAACG TGGTCTACCT GATCTTCACC GAGGGCTCCT CGGCCAGCTC CGGCGGCGAC CTGCTCCGGC TCGACCTCGC GGGCGAGGCC CAGCGCCTGG CCCGGGTGCT GGCCCGTCTG GTCCCCGACC AGCCCGAGGT CCACGGCCTG CTGGCGCTGC TGGAGCTGAC CGCGGCGCGC TTCCCCGCCC GGACCGGGGC TGACGGGCGG CCGGTGCTGC TGGAGCACCA GGACCGCCGC CGCTGGGACC GCGCCGCGAT CCGCCGGGGA CGGGCCGCCC TGGCCCGCGC GGGGAGGACC GGCCGGGGGC TGGGCGCCTA CGGCCTCCAG GCCGCGATCG CCGAGTGCCA CGCGCTCGCC GCCTCGGTGC GGGAGACGGA CTGGGAGCGG ATCGTGCTGC TCTACGAGGC GCTCAGCCGT CTGGCGCCCT CCCCGGTGGT GGACCTCAAC CGGGCGGTGG CGGTCTCCAT GGCCCGGGGA CCGGCCGAGG CGCTGCGGAT CGTGGACGAG CTGGCGGCGG CCGGGGCGCT GGCGGACTCG CACCTGCTGC CGAACGTGCG CGGGGAGCTG CTCGTGCGCC TGGGGCGCAC CGGTGAGGCG CGCACCGAAC TGGAGCTGGC CGTGCGCCGG TGCGGCAACG AGCGCGAGCG GGAGGTGCTG GAGCGCAAAC TCGCCGACCT GGGCTGA
|
Protein sequence | MADPTGREAV AAVWRIDSAR IVGALARYTG DFALAEDLGQ EALAEALVTW PREGVPRNPA GWLLTVGRRR AIDAFRRRSA LDDRYAVLAR DLGEGGAASG SAPADPAREA DDVLWDPDRI DDDVLALMFT ACHPVLSREA RVALTLRVVG GLTSDEIARA FLVPTATVQA RITRAKRTLG AARVPFGVPP AQERPQRLGS VLNVVYLIFT EGSSASSGGD LLRLDLAGEA QRLARVLARL VPDQPEVHGL LALLELTAAR FPARTGADGR PVLLEHQDRR RWDRAAIRRG RAALARAGRT GRGLGAYGLQ AAIAECHALA ASVRETDWER IVLLYEALSR LAPSPVVDLN RAVAVSMARG PAEALRIVDE LAAAGALADS HLLPNVRGEL LVRLGRTGEA RTELELAVRR CGNEREREVL ERKLADLG
|
| |