Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4547 |
Symbol | |
ID | 9248428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5389225 |
End bp | 5390190 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003682440 |
Protein GI | 297563466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACC CGAGCCGATG GGAAACCGGC CTGCCCGCCC TCGTGCTGGC CGCCGCCGCC CTGCTCGGCG GCGCGGCGGG TCTGCTGATC GCGCCCCACG CCGCCGCCGT CGTCCTGGCC TCGACCCTCG TGCAACTGGG CGCCACCGCG CTGGGCTGGC GGCTCACCGG CCGTTCCGCG GTGGTGACCG GCGTGGTCGC GGTCGTGACC CTGGGCATCG CCCTCGAACC CACTCTCCGC GAACTGCTGC GGCTGGGCGC GGTTCCCTGG ACCGTCCTGG CGCTCGCCTT CGGCACCCTC AACCTCGTGC GCCGGGCCCG GAGCACGACC GAGCTGGTGT CCGGTGTGGC GCTGTCGTCG GCGGCCGCGG CCGTGTCGGC GGTCGTCACC GCGCAGGGCG GAGCCCCCCT GGTGCCGAGC CTGCTCGCCT CGTCGGTTCC GGTCCTGGGC GGTGCCCTGG TGGCCACCGT GCAGCGGCTC AACGAGGCAC GCCGGGACAG GCTGTCGCGC CCCGTCGTCC CGCCGATCTC GACCGGAGCC GACACGGGGG ACGCCCGGCA GGCGCTTCTC CTGGTGGCGC TGCGGGCGGA GAGCATGTTG ACGGCCGCGC GCGACGCGAC CGCCGCGGCC CAGGCACGCG ACCTGCGCAC GGTCGCACTA CAGGGGCTCA ACGTGCCCGG CGCCGCCTCC GGGCCGAGGC TGACCGTGCG CATCGACGTG GCGGCCGCCG GAGCGGAGGG GGAACAGGAG GCCCCCGAAC CCGACAGGGT GGAGACGCCG GCCCTGTCGG AGCGCGAGGG CGAGATCGCG CGCCTGCTGA CCACGGGCGC CTCCAACGCG CGGATCGCCC GCGAGCTCTA CCTGAGCGAG GCCACCGTCA AGGGGCACGT GTCCCGGATG ATGCGCCGTT TCGACTGCGG CAACCGCACC CAGCTCGCCC TGATGGCCGC CCGGTGGTTC GGCTGA
|
Protein sequence | MTDPSRWETG LPALVLAAAA LLGGAAGLLI APHAAAVVLA STLVQLGATA LGWRLTGRSA VVTGVVAVVT LGIALEPTLR ELLRLGAVPW TVLALAFGTL NLVRRARSTT ELVSGVALSS AAAAVSAVVT AQGGAPLVPS LLASSVPVLG GALVATVQRL NEARRDRLSR PVVPPISTGA DTGDARQALL LVALRAESML TAARDATAAA QARDLRTVAL QGLNVPGAAS GPRLTVRIDV AAAGAEGEQE APEPDRVETP ALSEREGEIA RLLTTGASNA RIARELYLSE ATVKGHVSRM MRRFDCGNRT QLALMAARWF G
|
| |