Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3337 |
Symbol | |
ID | 9247199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3989551 |
End bp | 3990834 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003681249 |
Protein GI | 297562275 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.806542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCA GAAGAACCCA CAAGTACGTT CCGCTCCCCG CCGCCGCCGC GGCGCTGGTC CTGGCCGCGA CGGCGTGCGG GAGCGGAGGA GGCGGGGCCT CCGGTTCCGA CGGCCTCCTG GTCTGGATCA TGCAGGGCAC CAACCCCGAC GAGACGGGGT TCTTCGAGGC GGCCAACGCC GCGTTCACCG AGGAGACCGG CATCGAGGTG GACGTCGAGT TCGTGCCGTG GCAGGACGCC CAGAACAAGA TCTCCACCGC CATCGCGGGC GGCACCATGC CCGACGTCGC CGAGCTCGGC AACACCTTCA CCCCCGGTTT CGCCGACGCC GGAGCCCTGC ACGACCTGTC CGGCTACGAC ATCGACACCT CCCAGTACAT CCCCGGCCTG ATGGAGATGG GCCAGCTCGA CGACGGTGTC TACGGCGTGC CCTGGTACGC CTCCATCCGC TCCGTCGTCT ACCGCACCGA CGTCTTCGAG GAGCACGGCC TGGAGGTCCC CGAGAACTGG GAGGAGCTGC GCGAGACCGC CCTGGCCCTG TCCGAGGCCG AGGAGGACAT GATCGCCTTC CCCGTGCCCG GAGACGCCCA GTACTCGGTC ATGCCGTGGA TCTGGGGCGG CGGCGGGGAG ATCGCGGTCG AGCAGCCCGA CGGCACCTGG GTCTCGGAGA TCGACAGCGA GGAGGCCCGT GCCGGGATCG GGTTCTTCAC CGGCCTGGCC CTGGAGGACA ACACCTCCAC CACCGGCGCC GTCAACTGGA ACGAGATCGC CGTCATGGAG GCCGTCGCGG AGGAGGAGGC CGCCATGGCC ATCCTCGGCA GCGCCAACCC CAAGGCCATC CTGGAGGCCA ACCCCGACCT GGAGGGCAGG CTGGGCTCCT TCACCCTGCC CGGCCAGGAC GGCGGGTACA TGCCCTCCTT CGCGGGCGGC TCGCTGCTGT CGGTCTTCGA GGGCACCGGC CAGGAGGAGG CCGCCTGGCA GTACGTCCAG CACCTGACCG GCGAGGAGTT CGGCATGCGC TGGTCCGAGG AGACCGGCTT CTTCCCGGGC GTGGTCGACC GGGTCGACAC CTTCTCCTCC TCCGCCGACC CCATCCTGGA GCCCTTCGCC GTCCAGCTCA ACGAGGCCAG CCGGGGCGTG CCCGTCACCC CCGCCTGGAC CCAGGTCGAG GCCGAGAAGG TCCTGGTCGG CATGCAGCAG GACATCCTCA ACGGCGAGGC CACCGTGGAC GAGGCCACCG AGAACGCGGC CGACGAGATC GAGCGCATCC TCAACGGGGG GTAG
|
Protein sequence | MARRRTHKYV PLPAAAAALV LAATACGSGG GGASGSDGLL VWIMQGTNPD ETGFFEAANA AFTEETGIEV DVEFVPWQDA QNKISTAIAG GTMPDVAELG NTFTPGFADA GALHDLSGYD IDTSQYIPGL MEMGQLDDGV YGVPWYASIR SVVYRTDVFE EHGLEVPENW EELRETALAL SEAEEDMIAF PVPGDAQYSV MPWIWGGGGE IAVEQPDGTW VSEIDSEEAR AGIGFFTGLA LEDNTSTTGA VNWNEIAVME AVAEEEAAMA ILGSANPKAI LEANPDLEGR LGSFTLPGQD GGYMPSFAGG SLLSVFEGTG QEEAAWQYVQ HLTGEEFGMR WSEETGFFPG VVDRVDTFSS SADPILEPFA VQLNEASRGV PVTPAWTQVE AEKVLVGMQQ DILNGEATVD EATENAADEI ERILNGG
|
| |