Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3658 |
Symbol | |
ID | 9247527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4388668 |
End bp | 4389768 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003681562 |
Protein GI | 297562588 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.720757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATGC TGCGGCTTCC GGCGGCACTC GCGGCCCTGG CCCTCGCGGT CAGCGCCTGC TCGGGTGGCG GCGGGAACAG CGACGGCGGT TCGGGTCAGT ACCCCAGGAA CGAGACCCTG TACACCACGG GTACGGCCTG GGAGGCGCCG ACCAGCTGGA ACCCGATGAT GCGGGGCCAG TTCGCGGTCG GCACCAACGG CCTGGTCTAC GAGTCGCTCT TCCACTACGA CGCGGACGCG GGAGAGTACG TCCACTGGCT CGCCGAGAGC GACGAGTGGA CCTCGGAGAC CGAGCACGTG ATCACCCTGC GCGAGGGCGT CACGTGGAAC GACGGCGAGC CCTTCGTCGC CCAGGACGTG GTCACCACGC TGGAACTCGG CCAGGTCCCC GGAGTCCCCT ACAGCAACGT CTGGGACTAC ATCGAGAGCG TCGAGGCCAC CGACGAGCGC ACGGTCACCG TCACCTTCTC GGAGAGCCGT CCGCAGGAGT GGATGAACTG GGCCTACTCC AACCCCATCG TCCCGGACCA CATCTGGGCC GGCATGGAGG AGAGCCAGGT CGCCGACAGC CCCAACGAGA ACCCGGTCGG CACCGGCCCC TACGTCTACG AGTCGCACAC CGACGACCGC ATGGTCTGGG AGCGCAACGA CGAGTGGTGG GCCATCGAGG CCCTCGACAT GACGATGGAC GCCCGCTACA TCGTCGACAT CGTCAACGCC TCCAACGAGG TCACGATGGG CATGCTGAAC CAGGGCGAGG TCGACCTCTC CAACAACTTC CTGCCCGGTA TCGACCAGGT CCTCAACAGC AACGAGACCA TCACCAGCTT CTACGACGGC CCCCCGTACA TGAAGAGCGC CAACACGGCG TGGCTCATCC CGAACCACAC CCGTGAGCCG CTCAACGACA CGGCGTTCCG CCAGGCCCTG GCCCACTCGA TCAACATCAC CCAGATCGTC GAGGGCCCGT ACGCCAACCT GGTCCAGGCG GCCAACCCCA CGGGTGATGA TGCGGGGCCA GTTCGCGGTC GGCACCAACG GCCTGGTAAC GAGTCATATA TATCTACGAC GAGGTCGCGG AAGAGAACGT CCACTGGCTA A
|
Protein sequence | MRMLRLPAAL AALALAVSAC SGGGGNSDGG SGQYPRNETL YTTGTAWEAP TSWNPMMRGQ FAVGTNGLVY ESLFHYDADA GEYVHWLAES DEWTSETEHV ITLREGVTWN DGEPFVAQDV VTTLELGQVP GVPYSNVWDY IESVEATDER TVTVTFSESR PQEWMNWAYS NPIVPDHIWA GMEESQVADS PNENPVGTGP YVYESHTDDR MVWERNDEWW AIEALDMTMD ARYIVDIVNA SNEVTMGMLN QGEVDLSNNF LPGIDQVLNS NETITSFYDG PPYMKSANTA WLIPNHTREP LNDTAFRQAL AHSINITQIV EGPYANLVQA ANPTGDDAGP VRGRHQRPGN ESYISTTRSR KRTSTG
|
| |