Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0659 |
Symbol | |
ID | 9244501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 807245 |
End bp | 808522 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003678610 |
Protein GI | 297559636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.579585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.728331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTCC CCAAGATCGC CGCGGCCACG GCCGTGGTCC TGCTCGCCTC CGCGTGCGGC GGCTCCGACT CCGGTGACGG GGGCGGCGAG GCCGACAGCC TGACCGTCTG GGTCATGGGC ACCTCCCAGG AGCCCCTCGT CGAGTACTTC GAGGACGTCG AGGCGCGCTT CCAGGAGAGC AACCCGGACG TGGCCGTCAA CGTCGAGTTC ATCCCCTGGC CCGACGCGCA GGAGGCCATC ACCAACGCCC TCGCCGGCGG TGACGCCCCC GACGTGCTGG AGGTCGGCAA CGACCAGGTC GCGGGCTGGG CCGCGCAGGG CGCCCTGATG GACATCACCG AGCAGGTCGA CGGCTGGGAC GCCGCCGCGG GCATCGACGA GAACGCCCTG GAGTACGGCA CCTACGAGGG CGTCCAGTAC GGCGTCCCGT GGTTCTCGGG CGTGCGCACC CTGTACTACC GCGCGGACTG GCTGGAGGAG ATCGGCCACG AGCCGCCGAC GACCTGGGAC GAGCTCGTCG AGGTGGCCGA GGCCATCGAG GAGGAGTACG ACGTCCCCGG CTTCGCCGCG CCGACCGACT TCACCAACGG CATCGCCAGC TTCATCTGGA GCAACGGCGG CTCGATCGCG GAGCAGAACG GCGAGGAGTG GGAGGGCACC CTCACCGACC CCGCCACCGT CGAGGCGATC GAGTTCTACT CCAGCCTGAC CACCGACGGC ATCTCGCCGC AGGACTACGT CGGCCAGAAC GAGCTCATCG CGCTGGCCGA CATGGCCAAC AGCCAGCTGG GCATGTACAT CGACGGCGGC TGGGCGATCG GCTCCATGGA GGAGCAGGCC GAGGACCCGG CGGTCATCGA GAACATCGTC GCCGCGCCCA TCCCCGGCGC CGAGGGCATC GCCCCGGCCT TCGCGGGCGG TTCCGCCCTG ACGGTGTTCA CCACCACCGA GCACCCGGAC CTGGCCTTCG AGCTGCTCAC CGTCCTCGGT GACGAGGAGG GCGGCCAGGG CTACGCCGAC GTCGCGGGCT TCTTCCCGGC CTACCCGCAC CTGCTGGAGA GCGAGACGTA CCAGGAGGAC CCCGCCACCG CCGCTGCCGC CGAGCAGATG CAGCACACCC AGTTCTTCCC GACCACCCCG CGCTGGACCG CGGCCGACCA GGACAACAAG ATCCTGCCGG GCGCCGTCCT GGAGATCGTG CAGGGAGGCG ACGCCGAGGA GGTCCTGGCG GCGGCCAACG AGGAGCTGAC CTCGATCCTC AACGAGCCGG TCGAGTAG
|
Protein sequence | MRFPKIAAAT AVVLLASACG GSDSGDGGGE ADSLTVWVMG TSQEPLVEYF EDVEARFQES NPDVAVNVEF IPWPDAQEAI TNALAGGDAP DVLEVGNDQV AGWAAQGALM DITEQVDGWD AAAGIDENAL EYGTYEGVQY GVPWFSGVRT LYYRADWLEE IGHEPPTTWD ELVEVAEAIE EEYDVPGFAA PTDFTNGIAS FIWSNGGSIA EQNGEEWEGT LTDPATVEAI EFYSSLTTDG ISPQDYVGQN ELIALADMAN SQLGMYIDGG WAIGSMEEQA EDPAVIENIV AAPIPGAEGI APAFAGGSAL TVFTTTEHPD LAFELLTVLG DEEGGQGYAD VAGFFPAYPH LLESETYQED PATAAAAEQM QHTQFFPTTP RWTAADQDNK ILPGAVLEIV QGGDAEEVLA AANEELTSIL NEPVE
|
| |