Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1085 |
Symbol | |
ID | 9244931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1332883 |
End bp | 1334238 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003679033 |
Protein GI | 297560059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.474529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCACC CACCACGCAG ACCCAGCCCG CCGACCGGGC CGCCTCCGCC GTTCTCGCGG CGCCGCGTGC TCCTGGGCAC GGGAGCCCTC GCCCTGGGCT CGGCCCTGGG CGCCGCCGGA TGCGCCCCCG CCCCCGGTTC GGGATCGACC ACGCAGGTGC GGTTCTGGAG CCTGTTCCAG GGCGGCGACG GCGCCCGGGT GCAGACCATG CTGGACGCGG TGCGCGAACA GGCCCCGCAC CTGGACGTCA CCCCCAGCAC ACTGGCCTGG GGACCGCCGT ACTACACCAA GCTGGCGGTG GCCTCCGTGG GCGGTCGGGC CCCCGAGACG GCCGTGCTGC ACCTGTCCCG CCTGCCCGGG TACGCCCCCG GCGGGCTGCT CGAACCCTTC GACCTGGACC TGCTGGCCGA GTTCGGGGTC ACCGCCGAGG ACTTCGTACC CGACCTGTGG GAACGCGGCA TCCACGACGG CGCCACCTAC GCCGTCCCGC TGGACACCCA CCCGGTGATC GTCTTCTACG ACGCCGAAGT CGCCGACCGG GCCGGTCTGC TCGACGGGGA CGGGAAGCTG ACCGGGATGG ACTCCCCCGA GGGGTTCCTC GCGGCCTCCC GGGCGCTGGC CGAGGCCGGG GGCGGCAACG GCGTCTCCTA CGGGCACGTC AACGACGACT CCCAGGGGTG GCGGCTGTTC TGGATGCTGT ACAACCAGAC CGGCGCGTCC ATGGAGCTGC CCGGGGGCGG ACCGGCGGTG TTCGACCGCG ACGCGGCGCT GCGCGTGTAC TCCTTCCTCG CCGAACTGCT CGACGGCCGG ACGTCGGAGC CGGACCTGGA CTACCCCACC GCCCTGGCGG CCTTCGCCTC GGGGCGCTCG GCGATGCTCG TGTGCGGGGA GTGGGAGCTG CCCTACCTGT CGGAACACGT GGAGAACCTG GGGGCGGCCC CCTTCCCCAC GGTCTTCGAC CAGCCCGGCG GGTACGCCGA CTCCCACGCC TTCGTGCTGC CCCGCCAGGG CGACCCCGAC CCCGCACGGG TGCGCGCCGC CCACGAGTTC GTGGCGCTCA TGGTGCGCAA CAGCCTGATC TGGGGCGAGG CGGGCCACAT CCCGGCCTAC TCGCCGATCG CCCAGTCGCC GGAGTACCTG GCGCTGGACC CGCAGTCGGA CTACGCCGCC GCCGGGGAGA CCCCCGTGCT CGACCCCGAG GTGTGGTTCG CCGGGGCCGG ATCGCGGTTC CACTCCGACG TGAGCGAGGC GCTGCGCACG GCCCTGACCG GCGACGGACC CGAGGCGGCG GTGGACCACC TGGGCCGGAC CCTGGACTCC TGGGCCGCCC GCACCAACCC GGGAGGCCAG GAATGA
|
Protein sequence | MPHPPRRPSP PTGPPPPFSR RRVLLGTGAL ALGSALGAAG CAPAPGSGST TQVRFWSLFQ GGDGARVQTM LDAVREQAPH LDVTPSTLAW GPPYYTKLAV ASVGGRAPET AVLHLSRLPG YAPGGLLEPF DLDLLAEFGV TAEDFVPDLW ERGIHDGATY AVPLDTHPVI VFYDAEVADR AGLLDGDGKL TGMDSPEGFL AASRALAEAG GGNGVSYGHV NDDSQGWRLF WMLYNQTGAS MELPGGGPAV FDRDAALRVY SFLAELLDGR TSEPDLDYPT ALAAFASGRS AMLVCGEWEL PYLSEHVENL GAAPFPTVFD QPGGYADSHA FVLPRQGDPD PARVRAAHEF VALMVRNSLI WGEAGHIPAY SPIAQSPEYL ALDPQSDYAA AGETPVLDPE VWFAGAGSRF HSDVSEALRT ALTGDGPEAA VDHLGRTLDS WAARTNPGGQ E
|
| |