Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3549 |
Symbol | |
ID | 9247418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4258857 |
End bp | 4259870 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003681456 |
Protein GI | 297562482 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.757715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.622262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA ATCAGGCCAT GAGGATTTTC TCGCGGCCCG GCCGCCACCA CGTGGCCGTC CTCGTCCGGC ACGGCCTGCT GCCCATCGAG GCGGGCATCG TCCACCGCCT GTTCGGCCAG GCCCGGAGCG CCGACGGGGA GCTGCTCTAC GAAGTCGTCA CCTGCGCGCT GGAACCGGGG GAGATCAGCA CCGACACCGA CTTCACGATC AACGTGGCCC ACGGACCGGA GGCCCTCGAC GAGGCGGACA CGGTGATCCT GCCCGCCGCC GACGAGGACT ACGGCGAACG CCCGCACGCC CCCCTCGCGC CGGCTCTGGC CGCGGCCGTC GCGCGCATCC CGCCGAACGC GCGCGTGGCC TCGATCTGCA CCGGCGCGTT CGTGCTCGCC GCGGCCGGGC TCCTGGACGG GTGCCGCGTG ACCACCCACT GGAAGTCCGC GGGCTACTTC CGCGCCATGT ACCCCGGCAT CGACCTGGAC CCGGACGTGC TGTACACCGA CAACGGGCGT GTGCTGACAG CTGCCGGGGT CGCCTCGGGC ATCGACCTGG GCCTGCACAT GATCCGGCTC GACCACGGCG CCGCCGTGGC CAACGAGGTG GCGCGCAGCA CCGTCGTCCC GCCCCACCGC GACGGCGGCC AGGCCCAGTA CATCCGCCGT CCCGTGCCCG CGCCGGAGCG TGCCGCGACG GGTCGGGCGC GCGCCTGGGC CCTGGAACAC CTGCACCTCC AGCCGGGCCT GCGGGAGATG GCCCTCCAGG AGGCCGTGAG CGTGCGTACC CTCACCCGCC GTTTCCGCGA CGAGGTGGGG CTCTCGCCCG GCCAGTGGGT CGCCCGGCAG CGCCTGGACC GGGCCCGGCA GCTGCTGGAG GAGTCGGACC TGCCGGTGGA CAGGGTCGCG CACGAGGCGG GTTTCGGCAC GGCGGCGTCG CTGCGCCAGC ACATGCACGC CGAACTGGGC GTGTCCCCCA GCGCCTACCG GCGCACCTTC CGGGGCGCAC CGGACCCGGC CTGA
|
Protein sequence | MRKNQAMRIF SRPGRHHVAV LVRHGLLPIE AGIVHRLFGQ ARSADGELLY EVVTCALEPG EISTDTDFTI NVAHGPEALD EADTVILPAA DEDYGERPHA PLAPALAAAV ARIPPNARVA SICTGAFVLA AAGLLDGCRV TTHWKSAGYF RAMYPGIDLD PDVLYTDNGR VLTAAGVASG IDLGLHMIRL DHGAAVANEV ARSTVVPPHR DGGQAQYIRR PVPAPERAAT GRARAWALEH LHLQPGLREM ALQEAVSVRT LTRRFRDEVG LSPGQWVARQ RLDRARQLLE ESDLPVDRVA HEAGFGTAAS LRQHMHAELG VSPSAYRRTF RGAPDPA
|
| |