Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0752 |
Symbol | |
ID | 9244594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 922297 |
End bp | 923616 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative extracellular solute-binding protein |
Protein accession | YP_003678703 |
Protein GI | 297559729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.31197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGACA CCACCCCCCA CCACCCCCCA CCACACGTCG GGCCCGGTCG CCGGGCCCTG CTCACCGGCG CCCTGGCCGC GGCCGGACTG GCCGCCGCGG GCTGCGCGCC CCCCGACGCA CTGCTGACCG GGGACACCCG CCTGCGGCAG TGGAACCTCT TCTCGGGCGG CGACGGCCTG CGGATGATCG AGATGCACGA CGCCTACCGG GCCGAGCACC CGGAGGTCGA CTTCCGGGCG ACCACCTTCA CCTGGGGCTC CCCCTTCTAC ACCAAGGTCG CGATGGGCGC GGCGGGAGGG CGCGGCGCCG ACATCGCCAC CGTGCACGTC TCCCGCCTGG AGAGCCTGGC CCCCGGGAGG CTGCTGGACC CGGTCGACCC GGCGCTGCTG GCCGAGGCCG GGATCGACGA CACCGTGATC CCGCCCAACG TCTGGGAGAA GTGCTTCTTC GACGGGCAGC TGTACGCGGT CCCCATCGAC ACGCACGTCC TGATCCAGTA CCACAACCTC GACGTGTGCC GCGAGGCCGG ACTGCTCGAC GCCGACGACC GGCTGGTCCA GGTCAGCGGG CTGGACGACT ACATGGCCAT GCTCCGCGAG ATCAAGGCGG TCACCGGGGC CTACGGCCTG TCCGTCGACA CCTGGCAGCC CTGGCCCAAC TTCTGGGCGC TCTACCGCCA GCAGGACGGG GAACTCCTCC TGGGCGAGGA CGACTTCACC ATGGACGACG ACAAGGCCCT GGCGGCCATG GAGGTCATGT ACCGGCTCTC CGAGGAGGAG CTGGCGCCCC GCCACTCGAT GCTGGCCGAC ACCGCGGCCA ACCTCTCCAA CGGCAGGGCC GGGCTGATGA TCCACGGCAA CTGGGAGATC CCGACGCTGG AGGCCGCCGG AACGGCCTTC TCGGCGTCCC AGTTCCCCGA CGTCTTCGGC AACCGCCGCA CCCGAGGCGA CTCGCACTGC TACGTGTTCC CGCACCAGCG CGACCCCGAC CCCGAGCGGA TCCGGGCCGC CGTCGGATAC GCCGCATGGA TGCTGCGCCA CAGCCTCACC TGGGCCGGGG GCGGCCACAT CCCCGCCTAC CGGCCCGTGG TCGAGAGCGC CGAGTACGAG GCGCTGCACC CCCAGTCCGC GTACCGCGAG GCGGCCGAGA ACGTGCAGTT CGAGCCCGAG GCCTGGTTCA GCGGCTCGGC GGGGCGCCTC CAGGAGGAGG CCAACGGCCC GCTCACCACC CTCCACCAGG GGACCCAGAC ACCCGAACAG GCGCTGGAAC AGCTCAAGGG AGCCATCCGC GACCTGCTGA CCGTGCCGTC ACCGGTGTGA
|
Protein sequence | MRDTTPHHPP PHVGPGRRAL LTGALAAAGL AAAGCAPPDA LLTGDTRLRQ WNLFSGGDGL RMIEMHDAYR AEHPEVDFRA TTFTWGSPFY TKVAMGAAGG RGADIATVHV SRLESLAPGR LLDPVDPALL AEAGIDDTVI PPNVWEKCFF DGQLYAVPID THVLIQYHNL DVCREAGLLD ADDRLVQVSG LDDYMAMLRE IKAVTGAYGL SVDTWQPWPN FWALYRQQDG ELLLGEDDFT MDDDKALAAM EVMYRLSEEE LAPRHSMLAD TAANLSNGRA GLMIHGNWEI PTLEAAGTAF SASQFPDVFG NRRTRGDSHC YVFPHQRDPD PERIRAAVGY AAWMLRHSLT WAGGGHIPAY RPVVESAEYE ALHPQSAYRE AAENVQFEPE AWFSGSAGRL QEEANGPLTT LHQGTQTPEQ ALEQLKGAIR DLLTVPSPV
|
| |