Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1913 |
Symbol | |
ID | 9245763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2332906 |
End bp | 2333952 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003679846 |
Protein GI | 297560872 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0831866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.645841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCA CCCGCGGACG CGGCGTCCTG AGCGCCGCCG CGATCACCTC CGTCCTCGCC ATGACCGCCT GCGGCAACGC CGACCTGCCC CCGGCCGCGG CCACCGGCGA CGGCGGCACC GTCATCACCT ACAACTCCCC CGCCGAGTGG GGCAACTACG GCGAGGTCCT GGCCGCCTTC ACCGAGCGGA CCGGGATCCA GGCCCCCAAC GACCCGAAGA ACTCCGGGCA GGCCCTGGCC GCGCTCCAGG CCGAGAAGGG CGCGCCCGTC GCCGACGTCG CCTACACCGG CATCGCCTTC GCCGGACAGC TCGTGGAGGC CGGGGTCCTG CAGTCCTACG TGCCCGAGGG CGCGGAGGAG GTCCCCGAGG ACCTGCGCGA TCCCGACGGG AACTGGACGG CCGTCCACAC CGGCACCATC GCCTTCATCG TCAACGAGGA CCACCTGGAC GGCGCGCCCG TGCCGAGCAG CTGGGAGGAC CTCCTCGACC CCGCCTACGA GGGCAAGGTC GGCTACCTCG ACCCCACCCA GGCGGCCGTG GGCTACTCCG CCGCGACGGC GGTCAACCAC GCGCTCGGCG GCGACCTGAC CGACTGGGGG CCGGGTCTGG ACTACCTGGC CGAGCTGAAG GAGAACGGCG CCTCCACCTC CGCCCAGACC GCCACCGCCA AGGTCGCCCA GGGCGAGATC CCCATCCTCA TCGACACCGA CTTCAACGGC TACAAGCTCC GCGACGAGGG CGCCGACGTC AGCGTCGTCA TCCCCGAGGA GGGATCGCTG CAGATCCCCT ACATCGTCGG CCTGGTCGAG GGCGCCCCCA ACGCCGACAA CGGCAGGGAG CTGCTGGACT TCTACTTCTC CGAGCAGGGC CAGGGCCTCT TCGCGGACGG TTACATGCGC CCGGTGGTCG GCCAGATGCC CGAGGAGCTC GCCGACCGGG TCCTGCCCGA GTCCGACTAC GAGCGCGCGA TCACCATCGA CTACCTCGAA CAGGGCGAGC GGCAGCAGGA GTTCATCGAC CTGTACCAGA GCGAGGTCGG CTTCTAG
|
Protein sequence | MTLTRGRGVL SAAAITSVLA MTACGNADLP PAAATGDGGT VITYNSPAEW GNYGEVLAAF TERTGIQAPN DPKNSGQALA ALQAEKGAPV ADVAYTGIAF AGQLVEAGVL QSYVPEGAEE VPEDLRDPDG NWTAVHTGTI AFIVNEDHLD GAPVPSSWED LLDPAYEGKV GYLDPTQAAV GYSAATAVNH ALGGDLTDWG PGLDYLAELK ENGASTSAQT ATAKVAQGEI PILIDTDFNG YKLRDEGADV SVVIPEEGSL QIPYIVGLVE GAPNADNGRE LLDFYFSEQG QGLFADGYMR PVVGQMPEEL ADRVLPESDY ERAITIDYLE QGERQQEFID LYQSEVGF
|
| |