Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5201 |
Symbol | |
ID | 9249094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 346615 |
End bp | 348801 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_003683087 |
Protein GI | 297564114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.171311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.162808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAGCAC ACGACGCGTC GAAGGTCGCT CGACGAGAGT TCCCTCCCGC CCCGGAGACC GCGGCGGCGG CCCGCGAGTT CGTTCACGAC ACCCTCCTGT CCTGGGGTGT GACCGACCCC TCCGACGACG TGATCCTGCT GGTGAGCGAG CTGGTCACCA ACGCCGTCAT CCACGCGCGC TCGTCCCTGG ACGTGACCGT GCGCCGGTCG GAGGGCGCGA CCGAGGTGAT GGTCACCGAC TCCGTGCCCG AACGCGCGGT CCCGCAGGCC GGTCCCCTGT CGGTGGACAC CTCCCCGTCC CGGGAGAACG GGCGCACCGG CGGTCTGGGG CTGGCCCTGG CGTCGGCGAT CGCGTCGAGC TGGGGCGTGA GCTACGGGCG CAACGACAAG GCCGTGTGGT TCCGGATCGA GGACGACTCG GGGGAGCGCG CCTCCCTGGA GTCCCCGCCG GTGCGCAGGA GCCCCTCCCG TACCGGGCGC CCCGCCCCGT GGTCGGCGCT GGACTCCGCC CTGGGCTCGC GGCTGTCCCT GCCGCAGCTG CTGGAGCGCA CCGTGGAGCA CTCGGCCACG GCGCTGGGCG CGGACGCGGC CTACATCACG CTGGCCACGT CCGACGAGAC CATGTGGGAG GTGCGCGCCG CGGTCGGGCT GACCCCGGGC GGCGCCCCGT GGCGGCCGCT GCGCGTGCGC ACGGAGGAGA TCTTCCCGTC GTCGGCGCCG GAGCCGGGCG CGGTGATCAA CGACGACCTG ATGATCGCCC GCGCCAGCCG GGGGCGCCTG TCGCGGGCGG GGATGCGTTC GCTGGTCACG GCGCCGCTCA TCGTGGACGG GCGGGTGGTC GGCCTGCTGG GGGTGGCCTC CCGGCGGGCC CGGCACTTCG GGGCGTCGGC GGCCAAGCGG CTCCAGGAGG GCGCGAACCT GATCGCGCTG CCGGTGGAGA GGGCCCGGCT GGCCGAGGTG GAGCTGAGCA GGCGGGCCTC GCTGAGCTTC CTGGCCGAGG CCAGCGACCT GCTCGCGGGC ACGCTGGACG AGCGGATGAC GGGCGCGCTG GCGGCGCAGC TGATCACCTC GCGGCTGGGC CGGTGGTGCG CCATCGAGAC GATCAACGAG ATGGGCGTCT CGCAGCTGAC GCACGTGGTG CACAGCGACG AGAACTACAA CGACATCCTG CGCGCGCTGC TGACCGGGCT GCCGCCGCAC GAGCAGAGGG AGCCGCAGCC GCTGTGGTCC CCGGCCGAGC TGGCCAAGGA GGGGCTGGAC GAACAGCTCG TCCGGGAGCT GGCGAGCGGT CCGGCGATCT GCGTCCCGCT GGTGGCGCAC GGGCGCGCGC TGGGGCGGAT GACGATCGGC AAGAACGAGT CGGACGACTT CACCCGCGAG GAGGTGGACG TCGCCGACGA CCTGAGCAGC CGGGTGGCGT CGGCGATGGA GAACGCGCGG CTGCACGAGA AGCAGGCCGC GATGAGCGAC GCCCTCCAGC GCAGCCTGCT GCCGGCCAAG GAGAAGGAGC CGGTGATCCC GAACGTGGAC CACGCGGTGT TCTACCGTCC GGCCGACGAG CGCAACGTGG TGGGCGGGGA CTTCTACGAC GTGTTCGCGG CGAGCGGCCG GTGGTGCTTC GCCATCGGCG ACGTGTGCGG GACGGGCCCG GAGGCGGCGG CCGTGACCGG TCTGGCGCGG CACACGCTGC GGGCGCTGGC CAAGGAGGGG TTCACGCCCT CGCACATCAT GCAGCGGCTG AACATGGCGA TCCTGGACGA GAACACCTCG ACCCGGTTCC TGACGATGCT GTACGGGGAG ATGACCCCGG CCACGGACGG CGGGGGCGGG ATGCGGCTGC GGATGGTCTC GGCCGGGCAC CCGCTGCCGC TGCGGCTGAA CCAGAAGGGC GAGGTACTGC CGTTCGGCGC CTCGCAGCCG CTGCTGGGCG CGTTCGAGGA CGTGGGGTTC ACGACCGAGA ACGTGGACAT CCGCCCCGGC GAGGTGGTGC TGGCGGTGAC CGACGGCGTG ACCGAGCGGC GCAGCAACAC GGACATGCTC GGCGACGAGG GGCTGATGGA GATCTTCTCC GGCTGCGCGG GGCTGACCGC CCAGGCGGTG ATCAGCCGCA TCGACCGGGA GCTGGAGGAG TACGCCCCGG GCGGGCGCAG CGACGACACC GCGATGCTGG TGCTGCGCTT CCTCTAG
|
Protein sequence | MPAHDASKVA RREFPPAPET AAAAREFVHD TLLSWGVTDP SDDVILLVSE LVTNAVIHAR SSLDVTVRRS EGATEVMVTD SVPERAVPQA GPLSVDTSPS RENGRTGGLG LALASAIASS WGVSYGRNDK AVWFRIEDDS GERASLESPP VRRSPSRTGR PAPWSALDSA LGSRLSLPQL LERTVEHSAT ALGADAAYIT LATSDETMWE VRAAVGLTPG GAPWRPLRVR TEEIFPSSAP EPGAVINDDL MIARASRGRL SRAGMRSLVT APLIVDGRVV GLLGVASRRA RHFGASAAKR LQEGANLIAL PVERARLAEV ELSRRASLSF LAEASDLLAG TLDERMTGAL AAQLITSRLG RWCAIETINE MGVSQLTHVV HSDENYNDIL RALLTGLPPH EQREPQPLWS PAELAKEGLD EQLVRELASG PAICVPLVAH GRALGRMTIG KNESDDFTRE EVDVADDLSS RVASAMENAR LHEKQAAMSD ALQRSLLPAK EKEPVIPNVD HAVFYRPADE RNVVGGDFYD VFAASGRWCF AIGDVCGTGP EAAAVTGLAR HTLRALAKEG FTPSHIMQRL NMAILDENTS TRFLTMLYGE MTPATDGGGG MRLRMVSAGH PLPLRLNQKG EVLPFGASQP LLGAFEDVGF TTENVDIRPG EVVLAVTDGV TERRSNTDML GDEGLMEIFS GCAGLTAQAV ISRIDRELEE YAPGGRSDDT AMLVLRFL
|
| |