Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4554 |
Symbol | |
ID | 9248435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5395950 |
End bp | 5397005 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003682447 |
Protein GI | 297563473 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCTT TGAGCGACAT GCTCAGCGGC ATCCGCTCCG CCGGAGCCTC CGTCAGCCAG GCCTCGCTGG AACCGCCCTG GACGATCCGC TTCACCGGCG GCGCGCCGCT GACCATGCTG ACGGTGATCA GCGGCGGCGG TTCGGTGGTC ATGTCCGACG GGACGGCCCT CGCGATCAGC GCGGGCGACA CCGCGCTCGT GCGCGGCCCC GACCCCTTCC ACCTCGCGGA CAGCCCCGCC TCGCCGCGCC GCCCCCACCG GGACCACGAG ATCGACTGCC TCGACCCGGA CGCCGGGGGC GGGCCGGTCA GCGCTCACCC GGTCGGTCTC CCCCGGCTCG ACGGCCTCCG GGACGGCGGC CCCGGCGGCG GACTCCCCAC CGGGCTCCCG GACGGCGGCC TCCCCGACGG GGCTCCCGAG GGCGCCACCA CCCTCCTCGT CGCCGCCTAC CGGGCCACCC GGAGCCGCCA CGAGCGGCTG CTGCGCACCC TGCCGCGCAC GCTGGTCCTC ACCGAGGACG CCGAGAGCGT CTTCTGGCTC GAAGCGGCGA GGGACGCCCT GTCCCGGCGC CACCTCCCGG GCGGCCAGGC CCTGGTCGAC CGCGTCGTGG ACATGGGGCT GGTGTGCAGC CTGGCCTGCT GGTTCGAGCA GGAGGGCGCC GACGCCCCGG CCTGGTACCG GGGGGCCGTG GACCCGGTGA CCGGTCCCGC TCTGGAGGCC GTCCACCGGC GTCCGCACGA GCCATGGACG GTGGGCGCGC TGGCCGCCCG GGCGGGGGTG TCGCGCGCGC TGTTCGCCAA GCGCTTCACC GAGGTCGTGG GCCAGTCGCC CCTGGGCTAC CTGACCGAGT GGCGGATGTA CACCGCCGAG GAGCTGCTGT CGGACCCCGA CCTGAGCGTG GCGAAGGTGG CCGGGGCCGT GGGCTACGCC GACCCCTCCT CGTTCAGCAC GGCCTTCAAA CGGCTGCGGG GGCTGAGCCC GCGCGAGTTC CGCCTGCGGA ACCTGCTGCC CGAGACCGGC ACCGCCCCTG GAACCGGCGC CGGGTCCAGC CCCTGA
|
Protein sequence | MDPLSDMLSG IRSAGASVSQ ASLEPPWTIR FTGGAPLTML TVISGGGSVV MSDGTALAIS AGDTALVRGP DPFHLADSPA SPRRPHRDHE IDCLDPDAGG GPVSAHPVGL PRLDGLRDGG PGGGLPTGLP DGGLPDGAPE GATTLLVAAY RATRSRHERL LRTLPRTLVL TEDAESVFWL EAARDALSRR HLPGGQALVD RVVDMGLVCS LACWFEQEGA DAPAWYRGAV DPVTGPALEA VHRRPHEPWT VGALAARAGV SRALFAKRFT EVVGQSPLGY LTEWRMYTAE ELLSDPDLSV AKVAGAVGYA DPSSFSTAFK RLRGLSPREF RLRNLLPETG TAPGTGAGSS P
|
| |