Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1044 |
Symbol | |
ID | 9244890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1286890 |
End bp | 1288380 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003678993 |
Protein GI | 297560019 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0364384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.362635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACCTGC GCAAGATCCC TCCCGGCGCC TGCCTGGCAG CCGCCTCCGC CCTCCTGCTG ACCGCCTGCC TCCCCTCCGG CGGCGACACC GCCGCATCCG AGGAGGACGG GCGCATCGCG GTGGCGATGC TGCTGCCGCC GAGGTCGGCC CTCTCGCCGC TCACCGACGA CGCCTTCAAG CTCTCGCGCT GGAGCACCGC CGAGACCCTC GTCGTCCTGG ACGAGGTCGG CGACCCCCAG CCCGGCCTGG CCACCGACTG GACCCAGGTG GACGACACCA CCTGGACCTT CACCATCCGC GACGGGGTGA CCTTCCACGA CGGCACGGAG CTGACCGCCG ACGTGGCCGC CCACTCCCTG GAGTACGCCG CGCAGGCCTC GCCCAAGCCG CGCATCCTCG ACGGCGTCGA ACTCGGGGTC GAGGCCGAGG GCGACACGGT CACCGTCACC ACCGCCGAGG CCGACCCGCT CGTCCCCCAG CGCCTGTCCT CGCCGCAGCT GTCGATCCTG GCCGAGAGCG CCTACGAGGG CGACACGGTC AGCCCGGCCG GCACCGGCTC GGGCCCCTTC GAACTGGTCG AGGTCAACGG CACCACCTCC GCGAGCCTGG ACCGCTACGA CGGCTACTGG GGTGAGGCCG CCCTGGCCCC GGGCATCGAC GTCAGCTTCG TGCCCGACGG CACCGCCCGC GCCGCCGCCC TGCGCACCGG CGAGGCCGAC ATCGTCGAGG CCGTGCCCAC CTCCCAGGCC GCCCTGCTGG ACCCGGAACT CATCACCGAG GTCCCGATGC CGCGCACCAA CACCCTCTAC CTCAACACCC AGGACGGGCC CTTCACCGAC CCCGGCCTGC GCGCCGCCGC CCGCGAGGCC ATCGACCGCC CGACGCTGGT GGACGGCGTC TACGAGGGGC GCGCGGACGC GGCCGAGGGA CTGCTGGGCC CCGCGCTGCC CTGGGCCGCC GACCGCCCCG AACGCCCGGA GGCCGCCGAG GCCGCCGACC CCGACGGCGC CGCCATCACC CTGGCCACCT TCACCGACCG CCCCGAACTG CCCGAGGTGG CCACCGTCCT GGAGCAGCAG CTGGAGGAGG CCGGGTTCGA GGTCGAGCAG GTCGTGCGCG AGTACGCCAA CATCGAGGAG GACGCCCTCA ACGGCGAGTT CGACGCGTTC ATCCTCTCCC GCGCCACCGT TCTGGACTCC GGCGACCCCG TCGCCTACAT GACGAGCGAC TTCTCCTGCG ACGGGTCCTT CAACATCGCC CAGCTGTGCG ACGAGGACGT GGACGCGGCC CTGGAGGAGG CCGAGCGCAC CCCCGCGGGC GACGAGCGGC GCGCCGCGAT CCTGGAGGCC GAGGCCGCCG TCCTGGCCAC CGACGCCGCC ATCCCCATGC TGCACGAACG CGTCATCCAG GGCGACGCCG CCAACGTCGT GGACTCGGCC AAGGACCCGC GCGAGCGCCT GCTCGTCACC CCGCGGACCC GGCTGAACTG A
|
Protein sequence | MHLRKIPPGA CLAAASALLL TACLPSGGDT AASEEDGRIA VAMLLPPRSA LSPLTDDAFK LSRWSTAETL VVLDEVGDPQ PGLATDWTQV DDTTWTFTIR DGVTFHDGTE LTADVAAHSL EYAAQASPKP RILDGVELGV EAEGDTVTVT TAEADPLVPQ RLSSPQLSIL AESAYEGDTV SPAGTGSGPF ELVEVNGTTS ASLDRYDGYW GEAALAPGID VSFVPDGTAR AAALRTGEAD IVEAVPTSQA ALLDPELITE VPMPRTNTLY LNTQDGPFTD PGLRAAAREA IDRPTLVDGV YEGRADAAEG LLGPALPWAA DRPERPEAAE AADPDGAAIT LATFTDRPEL PEVATVLEQQ LEEAGFEVEQ VVREYANIEE DALNGEFDAF ILSRATVLDS GDPVAYMTSD FSCDGSFNIA QLCDEDVDAA LEEAERTPAG DERRAAILEA EAAVLATDAA IPMLHERVIQ GDAANVVDSA KDPRERLLVT PRTRLN
|
| |