Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1501 |
Symbol | |
ID | 9245351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1841008 |
End bp | 1843059 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator |
Protein accession | YP_003679437 |
Protein GI | 297560463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.634725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000677289 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCGCGCA GGAAGACGGA ACGACAGCTG AACCTGGTCA TCTGTCTGTT GTCCACGCGC CGCCACCTGA CCGCTCAGGA GATCAGGAAG ACGGTGCACG GCTACGCCGA GGCCGAGACC GAGTCCGCCT TCAAGCGCAT GTTCGAGCGC GACAAGCGCG AACTGCGCGA CAGCGGCATC CCCATCCAGG TGGGCACCGG CGACGCCATC AGCGGCGAGG AGGGCTACCG CATCTCCCGC TCCGACTACG AGCTGCCCGA GATCGAACTC CTGCCCGACG AGGCCCTCGT CCTGGGCCTG GCCGCCCGCG CCTGGCGGCA CGCCTCCCTC GGCGAGGCCG CCGCCAACGC CCTGTTCAAG CTGCGCTCCG CCGGGGTCCC CGTCGACAGC GAGGCCGCCC CCGGACTCAC GCCCGCCATG CGCACCGACG AGCCCTCCTT CGGCCCGGTC TGGCAGGCCG TGCGCGACCG GCGCCCCGTC TCCTTCGCCT ACCGCAAGCC CGGCCAGACC GAGGCCGAGA CCCGCGAGGT CGAACCCTGG GGCATCGTCA ACCTGCGCGG ACACTGGTAC GTGGTCGGTC ACGACCGCCT GCGCGACGCC CGCCGCGTCT TCCGTCTGGG CCGCATCCGC GGTGAGGTGA CCATCCTGCG CGGCGGCCCC GACGTCGTCG TCCCCGAGGG TGTGGACCTG CGCTCGGTGG TCGCGGGCCA CAGCGCCGAG TCCGAGCGCA CCGCCGTCCT GCGGGTGCGC ACCGACTCCG CGCACGCCCT GCGCCGCAGC GCCCTGCGCG TGGTGCCCGG CGAGCGCGAC AGCTCCGGCG TCGGCTGGGA CACGCTCACC CTGCCCTACA CCGACACCCC CGACCTGGTG CGGCGCGTGG CCTACTTCGG CTCCAACGCG GTCGTCGTCG AACCCGTGGG CGCCCGTGAG GCCATGGCCG CCCACCTGAC GGCCACCGCC GAGACGGCCC CCGCCGCCGA GGCCGCGGAC GGTCCCGGCA CGGTGGAGGA GACCGAGGCC CCCGTCCGCG GTCCCCGCTC CGCCGACCAG CTGCGCCGCC TGCTGATGCT CGTCCCCTAC GCCCTGGGTC GCGACGTGCG CGTGCCCGAG GTGGCCCGCC ACTTCGGGCT CAGCGAGAAG CAGGTGGTCG CCGACCTCAG CCTGCTGTGG ATGTGCGGCC TGCCCGGCTA CACCCCCGGC GACCTCATCG ACGTCGACCT CGACGCGGCC AGGGAGACCG GCGAGATCAT CATCGCCAAC GCCGACACCC TCGCCCAGCC GCTGCGCCTG ACCGCCGACG AGGCGGCCAG CCTGGTCGTG GGCCTGTCCC TGCTGGAGGC GCTGCCCGAG ACCGAGGGCC TGGAGACCGG CGCCCTCAAG CGGGTGGGCG AGAAGCTGCG CGCCGCCGCG GGGGCCGCCG TGGGCTCGCT GGCCGACTCG GTCCAGGTGC GCGTGGAGGG CGACGAGCAG GTCGCCCAGA CCCAGCGCCG CTGCGCCGAC GCGCTGGAGG CGGGCCAGCG CCTGCACCTG CGCTACCTGT CGGGCTACCT GGACCAGGTC ACCGAGCGCG ACGTGGACCC CATGGGGCTG GTGGTCCAGG ACGGCTACCC CTTCCTGGAG GGCTACTGCC ACCTGCGCCG GGACGTGCGC CTGTTCCGCC TGGACCGGGT CCTGGAACTG ACGGTGCTGC CCTTCGCCGC CGAGGTGCCC GCGGGGGTGG GCCGCCGCGA CCTGTCCGGG GGCGTCCTCC AGCGCTCGGA GCGCGACGCC CTGGTCGTCC TGGACCTGGA GCCCGCGGCC CGCTGGGTGA CCGAGGACTA CGTGTGCGAG TCGGTGACCG AACTGCCCGG GGGAGGGGTC CGCGCCACTC TGCGCACTCC CGCTCCGGCC TGGGTGGTGC GGCTGGCGCT GCGTCTGGGG CAGCAGGGGC GCGTGGTGTC TCCTGTGGCG CTCGCCGAGG AGGCCCGGGC CGAGGCCCGC CGTGCGCTGG AGCACTACCG GACGTCCCAA ACCTCCGCGA AATTGGTCGA GACCACTTCG TCGTCCGAGT GA
|
Protein sequence | MSRRKTERQL NLVICLLSTR RHLTAQEIRK TVHGYAEAET ESAFKRMFER DKRELRDSGI PIQVGTGDAI SGEEGYRISR SDYELPEIEL LPDEALVLGL AARAWRHASL GEAAANALFK LRSAGVPVDS EAAPGLTPAM RTDEPSFGPV WQAVRDRRPV SFAYRKPGQT EAETREVEPW GIVNLRGHWY VVGHDRLRDA RRVFRLGRIR GEVTILRGGP DVVVPEGVDL RSVVAGHSAE SERTAVLRVR TDSAHALRRS ALRVVPGERD SSGVGWDTLT LPYTDTPDLV RRVAYFGSNA VVVEPVGARE AMAAHLTATA ETAPAAEAAD GPGTVEETEA PVRGPRSADQ LRRLLMLVPY ALGRDVRVPE VARHFGLSEK QVVADLSLLW MCGLPGYTPG DLIDVDLDAA RETGEIIIAN ADTLAQPLRL TADEAASLVV GLSLLEALPE TEGLETGALK RVGEKLRAAA GAAVGSLADS VQVRVEGDEQ VAQTQRRCAD ALEAGQRLHL RYLSGYLDQV TERDVDPMGL VVQDGYPFLE GYCHLRRDVR LFRLDRVLEL TVLPFAAEVP AGVGRRDLSG GVLQRSERDA LVVLDLEPAA RWVTEDYVCE SVTELPGGGV RATLRTPAPA WVVRLALRLG QQGRVVSPVA LAEEARAEAR RALEHYRTSQ TSAKLVETTS SSE
|
| |