Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1198 |
Symbol | |
ID | 9245049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1454716 |
End bp | 1455657 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | RNA polymerase, sigma 28 subunit, FliA/WhiG subfamily |
Protein accession | YP_003679145 |
Protein GI | 297560171 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.179495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.392398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAC AGGACAGAGC ACCGGGGGGT TTCCTCTTGT CCACGTCTCT CAACGAAACA CACACGTCAA CGACCGTCAC CGACCGGTTG CCCGTGATCC CCAGACCGCG CAGGCCGGTC GAGGACACCG TCCCCGAGGA CACGGGGACC GGAGAGGTCC GGGAGACCGG GGAGGTCCGC CCCGAGGAGC TGATCTCCCA GCTGCGTGAG CTCGCTGAAG ACGACCCCAG GGCCGACGAG CTGCGCGAGC GGGTGGTCGT GCTCTACCAG CCGTTGATCA ACAAGATCGC GCGCCGCTAC GGGGGGCGCG GCGAGCCGCT GGAGGACCTG AAGCAGACCG CGATGGTGGG GTTGGTCAAG GCCGTGCGCG GTTACGACCC GGCCCGCGGC AAGCCCTTCA TCTCCTACCT GCTGCCCACG GTGACGGGGG AGATCAAGCG GCACTTCCGC GACCACACCT GGGCGGTGCG GGTGCCGCGC CGCCACCAGG AGAACCGGGT GAAGCTGCGC CGGGTGACGG GGGAGTTCCA GCAGGCCCAC GCGCGCACGC CCACGGTCCA CGAACTCTCC GAGGAGATGG GGCTGCCCGA GGCCGAGGTC GGCGAGCTCA TCCAGGTCTC GGAGTCCTAC CGGTCGCTGT CCCTGGACGC GCCCGACTCC TCCGATTCCG AGGGGCAGGA GGGCACGCGC CTGGAGGACC ACCTGGGCTG TGAGGACGCG GCGCTGGACC GGGTGGTGGA GCGCGAGTCG CTCAAGCCCG CGCTGGCGCG CCTGCCCGCC CGCGAGCGGG AGATCCTGCG GCTGCGGTTC TTCGGGGACC ACACCCAGTC CGAGATCGCG GACCGGTTGG GGTACTCCCA GATGCACGTG TCCCGGCTGC TGTCGGGGGT GCTGGAACAG CTGCGCGAGG AGGTCGGCGG GCACGCGCCC GGCCACGGCT GA
|
Protein sequence | MSTQDRAPGG FLLSTSLNET HTSTTVTDRL PVIPRPRRPV EDTVPEDTGT GEVRETGEVR PEELISQLRE LAEDDPRADE LRERVVVLYQ PLINKIARRY GGRGEPLEDL KQTAMVGLVK AVRGYDPARG KPFISYLLPT VTGEIKRHFR DHTWAVRVPR RHQENRVKLR RVTGEFQQAH ARTPTVHELS EEMGLPEAEV GELIQVSESY RSLSLDAPDS SDSEGQEGTR LEDHLGCEDA ALDRVVERES LKPALARLPA REREILRLRF FGDHTQSEIA DRLGYSQMHV SRLLSGVLEQ LREEVGGHAP GHG
|
| |