Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0694 |
Symbol | |
ID | 9244536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 854431 |
End bp | 855678 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003678645 |
Protein GI | 297559671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.333505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCC GCAGCGCACG AGCGGTCACG GGCATCGCCG CGATCGCACT GATGGCGACC GCGTGCAGTG GTGGCGACGA GGGCGGGGAG GCCGCCGCCG AGGGCAGCCT CGTCATCTGG TCCGACCCCG AGCGCGCCGA CGCCATCAAG GCGGCCGCCC AGGAGTTCGC CGAGGCCAAC GGCATCGAGG TCGAGGTCCA GGGCCTGACC TTCGGCGACA TCCAGGGCGA CGTGCTCAAC GCCCACCAGG CCGGAAACGC CCCCGACGTC TTCATCGGCG CCCACGACTG GACGGGCAAC CTCGTGCGCA ACGGCGCCGT CCAGCCCATC GAGCTGCCCC AGGACCGCGC CTCCGGCCTG GACGAGACCT CGTTGCAGGC CCTCAACTAC GACGGCCAGC TCTTCGGTGT TCCCTACTCC CAGGAGAACA TCTTCCTCAT GCGCAACACC GACCTGGCGC CGGACGCCCC CGCGACCTTC GAGGAGATGG TCGAGGTCGG CACCGAGCTC AAGGACTCCG GTGAGACCAG CGAGGTCCTG TCCATGGCCG TGGGCCAGGA GGGCGACCCC TACCGGATGA ACGCCCTGTT CACCTCCGCG GGCGGCTACC TCTTCGGCCA GGACGAGGAG GGCAACTGGG ACCCGACCGA CCTGGGCGTG GGCACCGACG AGTCCATCGC GGCCATGGAG AAGGTCGCCG AGTACGGCGA GGCCGGGGAG GGCGTGCTGC GCCGCTCCAT CACCCTGGAG AACGACGCCT CCCTGTTCTA CGAGGGCGAG GCCCCCTTCT TCGTCGCGGG TCCGTGGAAC GTCGCCGACG CCAACGAGGC GGGCGTCAAC TACGAGATCA GCCCCATCCC CGGCTTCGAG GGCGAGGAGC CCGCCAGCCC CTACATCGGC TACCAGGCGT TCTTCGTCAC CGAGGGCAGC GCCAACAGCG CCCTGGCCCA GGAGTTCGTG ACCAACTACG TCACCGACAC CGACTTCGTC CTCAGCCTCT ACGAGGCCGA CCCCCGCATG CCGGTGCAGA CCGAGGCCCT GGAGAGCGTC TCGGCCGACG ACCCCACCAT CGCCGCGATC TCCGAGGCCG AGGCCGGGGC CGAGGGCATG CCGATGCCCT CCATCCCGGA GATGGGCGAG ACGTGGGAGC CGCTGGGCAT CGCCCAGGCC GCCGTCATCG CCGGTGAGGA CGTGCGCGAG GCCATGGAGG CCACCCACGA AACGATCGCC TCGCAGATCG GCGAGTAG
|
Protein sequence | MRFRSARAVT GIAAIALMAT ACSGGDEGGE AAAEGSLVIW SDPERADAIK AAAQEFAEAN GIEVEVQGLT FGDIQGDVLN AHQAGNAPDV FIGAHDWTGN LVRNGAVQPI ELPQDRASGL DETSLQALNY DGQLFGVPYS QENIFLMRNT DLAPDAPATF EEMVEVGTEL KDSGETSEVL SMAVGQEGDP YRMNALFTSA GGYLFGQDEE GNWDPTDLGV GTDESIAAME KVAEYGEAGE GVLRRSITLE NDASLFYEGE APFFVAGPWN VADANEAGVN YEISPIPGFE GEEPASPYIG YQAFFVTEGS ANSALAQEFV TNYVTDTDFV LSLYEADPRM PVQTEALESV SADDPTIAAI SEAEAGAEGM PMPSIPEMGE TWEPLGIAQA AVIAGEDVRE AMEATHETIA SQIGE
|
| |