Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5074 |
Symbol | |
ID | 9248963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 218337 |
End bp | 219680 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003682961 |
Protein GI | 297563988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACA GAGAAACGGA GACGGCCGTG AAGACACCCC CCACAGCACT CCTGTCCGCC TCGACGGCGC TGGTACTGGC CGCGGCACTG ACCGGCTGCG GTTCCGGCGA GGATTCCGGT GACCAGACGC TGACCTACTG GGCCAGCAAC CAGGGAGCGA GCGTGGAGGA GGACCGGGAG GTCCTCCAGC CCGTGCTGGA CCGCTTCACC GAGGAGACCG GGGTGGAGGT CGAGCTGGAG GTCATCCCCT GGAGCGAGCT GTACAACCGC ATCCTCACCG CGGTGAGCAG CGGCGACGGC CCCGACGTGC TCAACATCGG CAACACCTGG GCGGCCAGCC TCCAGGAGAC CGGCGCCTTC GTGCCCTACG AGGGCGCGGA CCTGGAGGCG GTGGGCGGCG AGGGGCGCTT CGTCGGCACC AGCTTCGCCA CCGGCGGCGC CGAGGGCCAG ACGCCGACGT CGGTGCCGCT CTACGGGCTG TCCTACGCGC TGTTCTACAA CCCCACCATG TTCGAGGAGG CGGGCATCGA GGAACCGCCC GCGACCTGGG ACGAGTTCGT CGACACCGCG GACGAGCTGA CCAGGGACAC CGACGACGAC GGCGACGTCG ACCAGTACGG GTTCGTGCTG GAGGGCGGCA ACGAGCGGCA GAACTCCCAC ATGGCCTTCA TCCTCGGCCA GCAGCAGGGC GGACGGCTGT GGGGCGAGGA CGGGCCCTCC TTCTCCTCCG ACGAGCAGGT CGCCGCGGTC AAGCAGTGGG TGGACCTGAT GGCCGTGGAG GAGGTCGTCG ACCCCAGCAG CGCCGAGTTC AGCGACGGAA CCCAGGGCAT CAGCGACTTC GTCGACGGGC GCGCGGCCAT GATCATCGTG CAGGGCAGCG CCCGCACCAG CATGGCCGCC CGCGGTTTCG AGGACTACGA GGTCGCCCAG GTGCCGATGC TCGACCCGCT GCCGGGCGAG CCCATCCAGA GCCACGCGGC CGGGATCAAC ATCAGCGTCT TCAACGACAC CGACGACAAG GAGGGCGCTC TGCGGCTGGT CGAGCACCTG ACCAGCCCGG AGGAGCAGGT GTACCTGTCC CAGGAGTTCC AGACGCTGCC GGTGGCCACC GAGGCCTACG ACAGCGAGGA GCTGCGGAGC GAGTCCATGG AGACCTTCCG CACGATCCTG ACCGAGCACT CCGCTCCGAT GCCGCTGATC CCCGAGGAGG GCCAGATGGA GACGGTGCTC GGCGAGGCGA TCGGCGGGCT CTTCGCCCGG GTGGCGACCG GAGACGAGGT CACCGAGGCC GACGTCCGCC AGGCCATGGA GGCGGCCGAG ACCCAGATGG ACGCCGCGAA CTAG
|
Protein sequence | MAYRETETAV KTPPTALLSA STALVLAAAL TGCGSGEDSG DQTLTYWASN QGASVEEDRE VLQPVLDRFT EETGVEVELE VIPWSELYNR ILTAVSSGDG PDVLNIGNTW AASLQETGAF VPYEGADLEA VGGEGRFVGT SFATGGAEGQ TPTSVPLYGL SYALFYNPTM FEEAGIEEPP ATWDEFVDTA DELTRDTDDD GDVDQYGFVL EGGNERQNSH MAFILGQQQG GRLWGEDGPS FSSDEQVAAV KQWVDLMAVE EVVDPSSAEF SDGTQGISDF VDGRAAMIIV QGSARTSMAA RGFEDYEVAQ VPMLDPLPGE PIQSHAAGIN ISVFNDTDDK EGALRLVEHL TSPEEQVYLS QEFQTLPVAT EAYDSEELRS ESMETFRTIL TEHSAPMPLI PEEGQMETVL GEAIGGLFAR VATGDEVTEA DVRQAMEAAE TQMDAAN
|
| |