Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5565 |
Symbol | |
ID | 9249468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 765561 |
End bp | 767219 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003683450 |
Protein GI | 297564477 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.379464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.358408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACCA CCCCCCTGCG CCAGCCCCCC AGCCGACGCC GCTTCCTCGG CCTGGTGGGC GCCTCCACCG CCGCGGCGCT CACCGCCGGA ACGCTCACCG GCTGCGGCTC GGAGTCGTCC AGCGGCGGAG GGTCCGCCTC CTCCGCCAGC TCGCTGGACG ACCTGATCCC CACGCACATC CCGTTCCAGG GGGTCACCCC CGACATCCCG GGACAGCACG GCGCCCCGGA CGGGTTCACC GCCTACCCGC AGGAGTTCGT CCGGGCCGTG TCCGAGGCCC CCGGCAGGGG CGGGAGCTAC ACCGCGATGA CCCCGCTGTG GGGGCCGATC CCGCCCGGCC TGGGCGACAA CTCCTTCTTC GAGTACGTCA ACGGGCGTCT GGGCGCCACG GTGGAGTTCA ACTTCCAGGA CGGCAACTCG GTCATCGACA AGATGAACGC GGTGATCGCC GGGCGCGACG TCGCCGACAT CACGATGATC CCCGACTGGG TCATCAACCT GATCCCGCAG TTCAACCGGG CCGTCGGCGA GCTGTTCGAG GACCTGACGC CGCACCTGGC CGGCGACGCG GCGCAGGCCT ACCCCCTGCT GGCCAACCTG GACAGCGACG CCTGGCGCTG GAACGTCTTC AACCAGCAGC TGCACGGGGT GCCCTGGCCC GCCGAGCCCT TCGGCAACTG GGTCCTGTAC CGGCGCGACC TCCTGGAGGA GTACGGCCTG GAGGCCCCGA CCAGCCCCGA CGACCTGTTC GCGATCGGCG AGGAGGTCAA CGACCCGGAC AACAACCGCT GGGCCTTCGG CGACTTCAAC CTCACCATGC GCCAGGTCTT CGGCGCGCCC AAGCAGTGGC GCTACTCGGG CGGCGAGCTC ATCCACATGT TCGAGACCGA GGAGTGGAGG GCCAGCATCG AGTACATGCG CAGGGTGTTC GACGCGGGGC TCGTCCACCC CGACATCGTC GCCCTGGGGG ACAACTCCAA GGAGCTGCTC AACTCCGGGC AGATCCTCTT CAACCAGGAC GGCATCGGCG CCTGGCACGA GGCCTACATG CAGATGCTCG GCGACAACCC CGACTTCCGC CTGGACCTGA TGCCGGCCTT CGGCAACGGC GGCGCCGACC CGGTGATGCA CCGCAGCGAC CCCTCCGCCC AGTCCGTGTT CGTACGCAAG GGCATGGAGC CGGAGCAGGT CGAGGAGATC CTCGGCATCA TCAACTACTG CGCCGCTCCG TTCGGCACGC GGGAGTACAT GGACTACCGC TACGGCGAGG CGGGCGCGCA CCACGAGCTC AACGACGAGG GCGCCCCGCA GCTCACCGAC ACCGGCAACG GCGAGGTCAA CGACGGCTAC TACTTCATCA GCGGACGCCC CCAGGCGATC ACCGAGAGCC AGTACCCGGA CTTCGTGCCA TGGAAGTGCG ACTGGTACAA CCACGCCGCC CAGTTCACCG AGGACGACCC GTTCGCGGGC ATCCGCATCC AGCGCCCCGA GCGCTTCTCC GGGGCCGAGA CCCCGATGAC CGACAGGGTC AACGACATCA TCCGGGGCCG CGAGGACCTC AGCGCCCTGG ACCAGGCCGT CGCGGACTGG CGCCGCGACG GCGGCGACGA GGGCCGCGAG TTCTACATGC GGGTCCTCCA GGAGCACGGC CGTGACTGA
|
Protein sequence | MRTTPLRQPP SRRRFLGLVG ASTAAALTAG TLTGCGSESS SGGGSASSAS SLDDLIPTHI PFQGVTPDIP GQHGAPDGFT AYPQEFVRAV SEAPGRGGSY TAMTPLWGPI PPGLGDNSFF EYVNGRLGAT VEFNFQDGNS VIDKMNAVIA GRDVADITMI PDWVINLIPQ FNRAVGELFE DLTPHLAGDA AQAYPLLANL DSDAWRWNVF NQQLHGVPWP AEPFGNWVLY RRDLLEEYGL EAPTSPDDLF AIGEEVNDPD NNRWAFGDFN LTMRQVFGAP KQWRYSGGEL IHMFETEEWR ASIEYMRRVF DAGLVHPDIV ALGDNSKELL NSGQILFNQD GIGAWHEAYM QMLGDNPDFR LDLMPAFGNG GADPVMHRSD PSAQSVFVRK GMEPEQVEEI LGIINYCAAP FGTREYMDYR YGEAGAHHEL NDEGAPQLTD TGNGEVNDGY YFISGRPQAI TESQYPDFVP WKCDWYNHAA QFTEDDPFAG IRIQRPERFS GAETPMTDRV NDIIRGREDL SALDQAVADW RRDGGDEGRE FYMRVLQEHG RD
|
| |