Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1926 |
Symbol | |
ID | 9245776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2346178 |
End bp | 2347239 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003679859 |
Protein GI | 297560885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.932979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA CCGCACACCC GCGCCGCATG CTCGCCGCGC TCGCCGGGAC GGCGGCGCTG GCCCTGGTGA GCGCCTCGTG CGCACCGGTG CGCGAGGACG ACCCCGACAC GCTGGTGGTG AGCACCTTCG CCTTCGCCAC CGAGGAGTTC ACCGAGGTGG TGGCCGACCC CTTCGAGGCC GAGACCGGGA TCCGGGTGGT CCTGGACACC GGCAACAACG CCGGGCGGCT CACCAAGCTC AGGATCAACG CCGAGACGCC CGACACCGAC GTCGTGCTCA TCTCCGACTA CTACGCCCAG ATCGGCAAGG ACATGGGGCT GTTCGCCCCC GTCGACCCCG CCGACGTGCC CAACCTCGAC GCCATCCAGC CCTGGGCGGT GGACCCGGAC GGGTACGGCC CCGCCTACAC CTTCCAGCTC CTGGGCCTGC TCTACCGCAC CGACCTCGTC GAGGAGGCCC CCGACTCCTG GGACGACCTG TGGGCCGAGC CCGAGGGAGG GTACGTGCTG CCCGACATCT CGGTCTCGGC CGGTCCGATG TTCGTCCTGG CCGCGGGCGA ACACTTCGGC TCGGGTCCCT CCGACCCCGA CGCCGGCTTC GAGGCGATGG GCCGGATCGG CGCGGACGCG CTCCAGTTCT ACACCGGCTC CACCGAGCTC ACCAGCCTGC TCGAACGCGG TGAGATCGCC ATGGCGCCCG GCCTGGACAA CTTCGCCATG GGCTCGGTGG AGGCCGGGCA GCCGATCGGC TTCGCCGCGC CCGAACAGGG CCGGGTGATG ACCGCCAACA CCGTCCAGGT GGTCGACGGG GCGCCCAACG AGGCCGGTGC ACTGGCCTTC GTCGACTTCC TGCTGCGCCC CGAGATCCAG GAGGGGATGG CCGAGGCCCT CTACGACAAG CCCGTGGCGC TGGAGGCCGA CCCCACCCCG CTCATGGAGC GGGTGTCGGG ACAGGCCGCC TCCTCCCCGT CCGACAGCGG CTACCACCAG GGCGACCTGG CCCTCATCGC GCAGGAGCGC TCCACCTGGC TGGACCGCTT CACCGAGGAG GTGGCGCGGT GA
|
Protein sequence | MSRTAHPRRM LAALAGTAAL ALVSASCAPV REDDPDTLVV STFAFATEEF TEVVADPFEA ETGIRVVLDT GNNAGRLTKL RINAETPDTD VVLISDYYAQ IGKDMGLFAP VDPADVPNLD AIQPWAVDPD GYGPAYTFQL LGLLYRTDLV EEAPDSWDDL WAEPEGGYVL PDISVSAGPM FVLAAGEHFG SGPSDPDAGF EAMGRIGADA LQFYTGSTEL TSLLERGEIA MAPGLDNFAM GSVEAGQPIG FAAPEQGRVM TANTVQVVDG APNEAGALAF VDFLLRPEIQ EGMAEALYDK PVALEADPTP LMERVSGQAA SSPSDSGYHQ GDLALIAQER STWLDRFTEE VAR
|
| |