Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1588 |
Symbol | |
ID | 9245438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1942656 |
End bp | 1944479 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003679523 |
Protein GI | 297560549 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.127314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG ACGGCTGGCC GCTGGAGACC GGCCTGCTGC GCGCGGTGTT CGACAGCGTG GGCGCGGGGA TGTTCGTCAT CGACACCACC GGCCGGATCA CCGCCGCCAA CCCCTACGCG CAGCACATCC TCGGCCGGCC CAGGGAGCGG ATGCTGGGCC TGGACCTGCA CGATCTGCTG CACCGCGACG CGGAGGGGGA GAGGAGCTCC AGGGAGGACT GCCGCGTCCT GGCCGTGCTC GCCAGCGGCC ACCCGGCCGA GGGGAGCAGC GAGTCCTTCC TGCGCGGGGA CGGCACGCTC GTGCCCATCA TCTGGGCGAC CACCCCGCTC CAGCACGAGG GACGCCTGGA GGGCGCGGTC ATCGTCTTCC ACGACTTCCG GGCGCACCGC GACGCCGCCG AGCAGACCGC GGCGCACCTG GCGGCCCTGG AGGAGCTCAC CGGCCGGTTG AGCCTGGTCG CGGAGATCTC CGTGGTCCTG GTCTCCACCC TGGACGTGCC CGAGATGCTG GACAGGCTGG TCCGGCTGCT GGTGCCGGAA CTGGGGGACT GGGCGGTGGT CGACATGGTC ACCGACGGGC GCACCAAGGA GGTGGAGCGC GTCGCCGTGC ACGCCCTCGG CGACCAGGAC ACGGCCGAGG CGCTCAAGGG CACGCTGCCG CCGCTGTCCT CCGACGCCCA CGCGGCGCTG ATGCGCGCCC TGTACAGCAC CCAGCCCGTC CTGTTCGTCG CCGGTGACCT CGCCCGTGAA CCGGACGAAC CGCTCGCCCG GGCGCACCGG CGGCTGTTCG ACCGCCTGGG CGGGCACTCC GCGGTGGTGG TGCCGCTGCA CACCCGCAGG CAGGTCTTCG GGGCGCTGAC CGTGGCCCGC ACCGGGGAGC GGCCCGCCTA CACCGACGCC GAGATCCTCG TGCTGGGGGA CGTGGCCCGG CGCGCCGGTC TGGTGATGGA GAGCGCCCAG CTCTTCGCGC AGCAGCGCCA CGTCGCCGAG ACCATGCAGC GCCAGCTCCT GACCCCCCTG CCGCAGGTCG ACCACCTGCG CTTCGCCGCC CGCTACCAGC CCGCGCAGCA GGCCGCCGAG ATCGGCGGGG ACTGGTACGA CGCCTTCCTG CTCGCCGACG GCGTCATGAC CATGGTCATC GGCGACGTGG TCGGCCACGA CCTCCAGGCC GCCGCGCACA TGGCCGAGGT CCGCAACATG CTGCGCGCCC TGGCGTGGGA CCGCCAGGAA CCGCCCAGCC TGATCATGCG GCGGCTGGAC GAGGCCATGA CCAACACCAG CGACGCCCCC ATGGCCACCG CCGTCTTCGC CCGTATCGAG GGCCCCGAGG GCGGGCCCTG GTGTCTGCAC TGGGTCAACG CCGGACACCC CCCGCCGCTG CTCGTCACCG CCGACGGCCG CACCCGCTAC CTGGAGGACG GCCACGGACC CCTGCTGGGC ATGAGCGCCG CCCTGCACCT GGGACTGGAC TGGCCCGACG CCCGCGAGGA GATCCCCGCG GAGTCCACCC TGCTGCTGTA CACCGACGGC CTCGTCGAGA GCCGCGACCA CCCCATCGAC ACGGGCCTGG CCAACCTGCG CCGCCACGCC GCCGCCCTGG CCCGCCGCGA CGTCGAGGAC TTCTGCGACG AACTCCTCGC CCGCATCTCC CCCCGCGGTG ACGACGTCGC CCTGCTCGCC CTGCGCATCC CGGCGGCGGG AGAGGGGGCG GGCGAGGACA CCGCGCCGCC CCAGCACGCG CACAGCCCCG CCGCGCCCGA CCGCGGGGCG CCCGGCGCGG CAGTCGAGGA CAGCCCGGTC AGGGACACCT CCGAGCGGGG GTGA
|
Protein sequence | MSDDGWPLET GLLRAVFDSV GAGMFVIDTT GRITAANPYA QHILGRPRER MLGLDLHDLL HRDAEGERSS REDCRVLAVL ASGHPAEGSS ESFLRGDGTL VPIIWATTPL QHEGRLEGAV IVFHDFRAHR DAAEQTAAHL AALEELTGRL SLVAEISVVL VSTLDVPEML DRLVRLLVPE LGDWAVVDMV TDGRTKEVER VAVHALGDQD TAEALKGTLP PLSSDAHAAL MRALYSTQPV LFVAGDLARE PDEPLARAHR RLFDRLGGHS AVVVPLHTRR QVFGALTVAR TGERPAYTDA EILVLGDVAR RAGLVMESAQ LFAQQRHVAE TMQRQLLTPL PQVDHLRFAA RYQPAQQAAE IGGDWYDAFL LADGVMTMVI GDVVGHDLQA AAHMAEVRNM LRALAWDRQE PPSLIMRRLD EAMTNTSDAP MATAVFARIE GPEGGPWCLH WVNAGHPPPL LVTADGRTRY LEDGHGPLLG MSAALHLGLD WPDAREEIPA ESTLLLYTDG LVESRDHPID TGLANLRRHA AALARRDVED FCDELLARIS PRGDDVALLA LRIPAAGEGA GEDTAPPQHA HSPAAPDRGA PGAAVEDSPV RDTSERG
|
| |