Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1164 |
Symbol | |
ID | 9245014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1420290 |
End bp | 1421255 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | periplasmic solute binding protein |
Protein accession | YP_003679111 |
Protein GI | 297560137 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.231796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACCG CCGTGACGGC TCCGAAGGCG CTCCTGCGCG CCGCGACCGC GTCCGCCCTG GCCGGGGCGG TCCTGCTCGG CACGGCCTCC TGCTCCTCGC AGGAGGACGG GGGCGAGGGG ATCGTGGTGA CCACCAACAT CCTGGGCGAC CTCACGCGGG AGGTCGTGGG CGAGGAGGCC GAGGTCACCG TCCTGATGAA GCCCAACGCG GACCCGCACT CCTTCGGCAT CTCCGCTCGG GAGGCCGCGC TCGTGGAGAA CGCGGCGCTC GTGGTCTACA ACGGGCTCGG CCTGGAGGAG GGCGTGCTGC GCAACGTCGC CGCCGCCGAG GAGGCCGGGA TACCCGCCCT GGAGGTCGGC GCCCAGGTGG ACCCCCTGGC CTTCTCCCCC GGCGGGGACG CCGCGGACAA CGAGGACGAG GGCGAGCCCG ACCCGCACTT CTGGACCGAC CCCCGGCGGG TCGTCCGGGC CGTGGAGCTG ATCGCCGAAC ACGTGGTCGC CGAGGTGGAC GGCGTGGACG CCGAGGCCGT GCGCGCCAAC GCCGAGGCCT ACACCGCACA GCTGGAGGAG CTGGACGCGT GGATGGCCGA GGAGTTCGCC GCGATCCCGG AGGAGGACCG CAACCTGGTG ACCAACCACC ACGTCTTCGG CTACCTCGCC GAACGCTACG GGTTCGAGGT CGTGGGCGCC GTGATCCCCA GCGGCACCAC CCTGGCCTCG CCCAGCAGCT CCGACCTCAA GTCCCTGGCC GACGCCGTGA GCGCGGCGGG CGTCGAGGCC GTCTTCGCCG ACTCCTCCCA GCCCGACCGC CTGGCCACGG CCATGGCGGA GGAGGCGGGC GTGCACATCG AGGTCGTCCC CCTGTTCTCC GAGTCGCTCA GCGAGGAGGG CGGGGGCGCG GCCACCTACC TGGAGATGAT GCGCTCCAAC ACCGAGGCCA TCGCCACCGG ACTGCGTGGC GAGTGA
|
Protein sequence | MVTAVTAPKA LLRAATASAL AGAVLLGTAS CSSQEDGGEG IVVTTNILGD LTREVVGEEA EVTVLMKPNA DPHSFGISAR EAALVENAAL VVYNGLGLEE GVLRNVAAAE EAGIPALEVG AQVDPLAFSP GGDAADNEDE GEPDPHFWTD PRRVVRAVEL IAEHVVAEVD GVDAEAVRAN AEAYTAQLEE LDAWMAEEFA AIPEEDRNLV TNHHVFGYLA ERYGFEVVGA VIPSGTTLAS PSSSDLKSLA DAVSAAGVEA VFADSSQPDR LATAMAEEAG VHIEVVPLFS ESLSEEGGGA ATYLEMMRSN TEAIATGLRG E
|
| |