Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1553 |
Symbol | |
ID | 9245403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1903193 |
End bp | 1904956 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003679488 |
Protein GI | 297560514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.525667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.106292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGGCA GTTCCTTGAG GAAACGCACC CTCGGCTTCG TCGCGCTCGG CGCGGCGGCT TCCATCCTCC TTTCGGCGTG CGGGGGCGGT GGCTCCAACG GCGGAGACTC CGCGTCCGAC GCCGCCTTCG ACCAGGGCTC GACCGAGGTC GTCAACGCGT CCGACCAGAC CGGCGGCACC CTGCGCTACG CCATCTCCTC GGACTTCGAC TCCACCGACC CGGGCGACAC CTACTACGGG TTCAGCTGGA ACTTCACCCG CTACTACGCC CGTACGCTGC TGGCCTTCAC CCCGGCGCCG GCCCAGGAGT CCACCGAGCT GACCACGGAC ATGGCCGCTG GCATGCCCGA GCCCAACGAG GACTTCACCG AGTGGACCAT CAAGATCCAG GAGGGCCTCA AGTACGAGGA CGGCTCCGAG ATCACGGCGC AGGACATCAA GTACGCCATC GCGCGCTCGA ACTACAACGG CGGTGAGCTG CCCAACGGTC CGCGCTACTT CGAGGCGCAC CTGGACCAGG AGCCCTTCAA CGTCTACGAG GTCGACGACC CGCTGGAGAC CTTCACCGCG GTCGAGACCC CGGACGACTA CACCCTGGTC TTCCACCTCA AGGACCCCTT CTCCGAGTTC CCCTACGTGC TGACCCAGCC GCAGACCGCC CCGGTGCCCG TGGAGGCCGA CCGCGGCGCG CTGTACAAGG AGAAGGTCCT CTCCTCGGGC CCGTACAAGT TCGAGGGCAA CTACGAGCCC GGCGTCCAGC TCAACCTGGT CCGCAACGAG CACTGGGACG CCGAGACCGA CCCGATCCGC CCGGCCCTGC CGGATGAGGT CACCGTCCAG ATCGGCATCG ACCAGGACGA GATCGACCAG CGCCTGGTCA ACGGCGACCT CGACGTCGAC CTGTCCGGCG TCGGCGTCGG CCCGGCGATG AAGAGCAGCC TGCTCACCGA CGAGGACGCC CAGGCCAACC TGGACAACCC GTACTCGGGC GCCCTGCGCT ACGTCAACAT CCACACCCCG GTCATCGAGG ACGTGGCCTG CCGCCAGGCG ATCATGTACG CGGCCGACCG GGACAGCCTG CACCGCGCCT GGGGCGGCGA GACCGGCGGC GACATCGCCA CCAACCTGCT CCCGCCGACC ATCCAGGGCT CGAACCCGGA GTCGGACCTG TACCCCTCCG ACGACGACAA GGGCGACCTG GCCGCCGCCG AGGCCAAGCT GGAGGAGTGC GGCGAGGCCG AGGGCTTCAG CACCACCATC GCGGTCCGCG ACGGCCGTCC CAACGACATC GCCACCGCGG AGTCCCTCCA GGAGTCCCTC AAGCGCGTGG GCATCGAGGT CGACATCCAG ACCTTCCCGG CCGAGGACTT CTTCGCCCAG TACGCGGGCT CGCAGGACTA CGTCCGTGAG AACAACATCG GCCTGAGCGT CTCCGGCTGG ATCCCCGACT GGGCCACCGG CTACGGCTTC GCCTCCAAGA TCACCGACGG CGACGCCATC CAGGCCACGG GCAACTACAA CACCTCCGAG CTCGACGACC CGGAGATCAA CGCCCTGTGG GACGAGGCGC TGGCCACCGA GGACCCGGAC GAGCGCGCCA GCATCTACGA GCAGATCGAC ACCCTGGTCA TGGAGCAGGC GGCCATCCTG CCGGTCGTCT TCGACCGCGC GCTGTTCTAC CGCTCGGACG AGCTGACCAA CGTCTACTAC ACGTCCTCGT ACGCGATGTA CGACTTCATG GCCCTGGGCG TGGACCGGGG TTAG
|
Protein sequence | MRGSSLRKRT LGFVALGAAA SILLSACGGG GSNGGDSASD AAFDQGSTEV VNASDQTGGT LRYAISSDFD STDPGDTYYG FSWNFTRYYA RTLLAFTPAP AQESTELTTD MAAGMPEPNE DFTEWTIKIQ EGLKYEDGSE ITAQDIKYAI ARSNYNGGEL PNGPRYFEAH LDQEPFNVYE VDDPLETFTA VETPDDYTLV FHLKDPFSEF PYVLTQPQTA PVPVEADRGA LYKEKVLSSG PYKFEGNYEP GVQLNLVRNE HWDAETDPIR PALPDEVTVQ IGIDQDEIDQ RLVNGDLDVD LSGVGVGPAM KSSLLTDEDA QANLDNPYSG ALRYVNIHTP VIEDVACRQA IMYAADRDSL HRAWGGETGG DIATNLLPPT IQGSNPESDL YPSDDDKGDL AAAEAKLEEC GEAEGFSTTI AVRDGRPNDI ATAESLQESL KRVGIEVDIQ TFPAEDFFAQ YAGSQDYVRE NNIGLSVSGW IPDWATGYGF ASKITDGDAI QATGNYNTSE LDDPEINALW DEALATEDPD ERASIYEQID TLVMEQAAIL PVVFDRALFY RSDELTNVYY TSSYAMYDFM ALGVDRG
|
| |