Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2662 |
Symbol | |
ID | 9246513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3170537 |
End bp | 3171667 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | putative transcriptional regulator, PucR family |
Protein accession | YP_003680585 |
Protein GI | 297561611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.384645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.292926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAAAA CAGCGCCGTC GGACGTCATC CCCCGAGCCG CACAGGCGTG CCTGGAGGAG CTGGAGACCG TCGCCTGGAC CTACGTGCGC AGGGTCCGCG AGCTGACCGG GTACGCCGAG ACCGTCATCG ACGACGCCGA GCTGTACGGC ACGGCCCGGG CCACGCTCGA ACTCCTCCTG GAACTCCTGC GGGGACGGGA CCGCCACGGC GAGCTGCGCG CCCACTCGAT CGAGGTCGGA CGCTCCCGCG CCCGCCGCGG CATCCCCCTG GAGTCGCTGC TGCGGGCCGT GCGCATGGAC TTCCGCTTCC TCTGGGAGGC GATGCGCGCC CACGTCGCCG AGGCCGACTT CCGCGACTTC TCCGAGGAGG TCATCTCCAT CTGGGAGGCG GTGGAGGTGC ACACCACCCA CGTGCAGACC GGCTACACCG ACGAGATCGC CCGGATGCGC GCGGAACTGG AGCTGGAGCA CGCCTTCCTC CTGCGGCACC TGCTGAGCGG TTCCGGCGGG GACCCGCGCC TGAACGAGCA GGCGGCGCGG GCGCTGGGCC TGCGCCCCGA CGGCGTCCAC CTCGTCCTGG TGGCCAACGG CAACCACGCC CGCGACTTCC GGGGCTGGGT CGGCCAGACG TTTCCCCGGG CCGTCCTGCT CCGCCTGGAC GGGGTGGAGT TCGCCATCGT CCCCGCGGGG GACGCCGCGG GGGCCGCCCG GGAGACGCTG CTGGGCAAAC CCGTCGGGAT CTCCCCGAGC GCCCACGGCA TCGGGGAGAT AGCCCCCATG TGGCGGCTCG CCCGCGAGCT GGCGGAGTGG GCGCGGCCCG GCGCGGCGGC CACGGTCGAG GGCCACTGGA CCCGGCTGGC GGGCGCGCGC CTGGGCCCGG CCGCCGGGGC CTTCGCCCGC GACGTGCGGG AGGCGCTCGG CGGGTTCACC GAACGCGAGG TCGATCTGCA GGTCGAGACG GTCGAGGCCT ACTACCGGAC CGGTTCGGTG ACCGAGGTCG CGCAGGCGAT GTTCTGCCAC CGCAACACGG TGATCAACCG CCTGCGCCGC TTCGCCGAGG CCACCGGGCT CGACGTCACC AGGCCGGTCG ACGCGTCCGC GGCCCATCTC GCCCTGACCG TCCTGCGCTA G
|
Protein sequence | MHKTAPSDVI PRAAQACLEE LETVAWTYVR RVRELTGYAE TVIDDAELYG TARATLELLL ELLRGRDRHG ELRAHSIEVG RSRARRGIPL ESLLRAVRMD FRFLWEAMRA HVAEADFRDF SEEVISIWEA VEVHTTHVQT GYTDEIARMR AELELEHAFL LRHLLSGSGG DPRLNEQAAR ALGLRPDGVH LVLVANGNHA RDFRGWVGQT FPRAVLLRLD GVEFAIVPAG DAAGAARETL LGKPVGISPS AHGIGEIAPM WRLARELAEW ARPGAAATVE GHWTRLAGAR LGPAAGAFAR DVREALGGFT EREVDLQVET VEAYYRTGSV TEVAQAMFCH RNTVINRLRR FAEATGLDVT RPVDASAAHL ALTVLR
|
| |