Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3999 |
Symbol | |
ID | 9247871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4782517 |
End bp | 4783875 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003681902 |
Protein GI | 297562928 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.220113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.250177 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACACCA CCTACGCCGT CCGGGCGACC GCCGTCGCCG CCTGCGGACT GCTGCTCACC GGGTGCGCCG GGACGGGCGC CCTCCCCTCC GAGGACGGCG CCGTCCGGCT GACCGTCGCG ATCGTGTCCA ACCCGCAGAT GCAGGACGCG ATCTCCCTGG AGTCGCGGTT CCGCGCGGAG CACCCCGGCA TCGCGCTCGA CTTCGTGTCC CTGCCCGAGA ACGAGGCCCG CGCCAAGATC ACCACCTCCG TGGCCACCGG CGGGGGCGAG TTCGACGTCG TCATGATCAG CAATTACGAG ACCCGCCAGT GGGCCGAGTA CGGCTGGCTG GAGAACCTGC AACCCTCCAT CGACGCCGCC GGGGGCTACG ACCACGAGGA CTTCATCCCC TCCATCAGGG AGGACCTGTC CCACGAGGGC GACATGTACT CGGTGCCCTT CTACGGCGAG TCCTCCTTCC TCGCCTACCG CAAGGACCTG TTCGAGCAGG CCGGGGTGGA GATGCCGCCC GACCCCACCT GGGAGGAGGT GGCGGACCTG GCCGCCGAGC TCGACGGCGT GGAGCCGGGG GTCTCGGGGA TCTGCCTGCG CGGCCTCGCG GGCTGGGGCG AGGTGCTGAG CCCCTTCAAC AGCGTCCTGA ACACCTTCGG CGGGCGCTGG TACGACGAGG ACTGGAACGC CGAGATCGAC TCCCCCGAGT TCCGGCGCGC GGCCGAGTTC TACGTGGGCC TGGCGCGTGA GCACGGCCAG CCGGGCGCGG CCAACAGCGG GTTCGGGGAC TGCCTGAACC GCTACTCCCA GGGCCGGGCG GCCATGTTCT ACGACTCCAC CTCCATGGTC AGCACCATCG AGGACCCGGA CTCGGCGACC GTGGCCGGGC TCAACGGCTA CGCCGCGGCG CCGGTGGCCG AGACCGACTA CGGCGGCTGG CTCTACACCT GGGCGCTGGG CGTCCCCTCC ACCTCCGAGC ACAAGGAGGA GGCGTGGGCG TTCCTGGAGT GGATGACCGA CAAGGACTAC GTGCGCACCG TCGCCGAGGA GTACGGCTGG CAGCGGGTGC CGCCCGGCAA CCGGCTCTCC ACGTTCGAGG TCCCCGAGTA CCGGGAGGCC GCCCGGGCCT ACGCCGAGCC CATGCTCCAG GGCATCCAGG AGGCGGACCC CGAGGACCCG GGCACGCGCC CGGTCCCCTA CGAGGGCATC GGCTTCCTCG CCATCCCCGA GTTCCAGGAC CTGGGCACCC GGGTCAGCCA GCAGCTGAGC GCGGCCATAG CCGGGCAGAT CACCGTCGAG CAGGCGCTCG AACAGAGCCA GGAGTACGCC GAGGTCGTCG GTGGGACCTA CAGGGAGGAC GACCGATGA
|
Protein sequence | MHTTYAVRAT AVAACGLLLT GCAGTGALPS EDGAVRLTVA IVSNPQMQDA ISLESRFRAE HPGIALDFVS LPENEARAKI TTSVATGGGE FDVVMISNYE TRQWAEYGWL ENLQPSIDAA GGYDHEDFIP SIREDLSHEG DMYSVPFYGE SSFLAYRKDL FEQAGVEMPP DPTWEEVADL AAELDGVEPG VSGICLRGLA GWGEVLSPFN SVLNTFGGRW YDEDWNAEID SPEFRRAAEF YVGLAREHGQ PGAANSGFGD CLNRYSQGRA AMFYDSTSMV STIEDPDSAT VAGLNGYAAA PVAETDYGGW LYTWALGVPS TSEHKEEAWA FLEWMTDKDY VRTVAEEYGW QRVPPGNRLS TFEVPEYREA ARAYAEPMLQ GIQEADPEDP GTRPVPYEGI GFLAIPEFQD LGTRVSQQLS AAIAGQITVE QALEQSQEYA EVVGGTYRED DR
|
| |