Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4837 |
Symbol | |
ID | 9248723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5729648 |
End bp | 5731303 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003682726 |
Protein GI | 297563752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCACCG CCCGAGTACG AGCACAGTCC CAGCTCACCC TCAACGACAT CACCCACCGC TACGGCGACC ACACCGTGCT CGACCGTGTC ACCCTCACCA TCAGCCCCGG CGAGCGCGTC GGGATCATCG GCGACAACGG CTCGGGCAAG AGCACGCTGC TGCGCCTGCT CGCCGGGGAG GAGGAGCCCA CCAACGGCGA CCTCACGGTC CGGGCTCCGG GAGGAACCGG CTACCTGCCG CAGACCCTGG AGGCGGTGGG GGTCTCCACC GGCACCGTCG GCGACGCGAT CGACTCCGCC CTGGCCGTCC TGCGCGACAT GGAGGCGCGC CTGCGCGCGG CGGAGAGCAC GCTCGGCTCG GCCACCCCGG AGGAGCTGGA CGCCTACGGC GACCTGCTGG CCCGGTTCGA GGCGCGCGGC GGCTACGACG CCGACGCGCG CGTCGACGCC GGACTGCACG GGCTGGGCCT TCCCGGCCTG GACCGCGACC GCCCGCTGAG GACCCTCTCC GGCGGGGAGC GCTCGCGCCT GGCCCTGGCC GCGATGCTGG CGGCCTCGCC CGAACTGCTC CTGTTGGACG AGCCCACCAA CGACCTGGAC GACCGCGCGG TCGCCTGGTT GGAGGACCGG CTCCGCCGCC ACCGGGGCAC GGTCGTGGCC GTCACCCACG ACCGGGCGTT CCTGGCCAAC GTGACCGACA CCGTCCTGGA GGTCGACCAC GACCTGCGCC GGATCCACCG GTACGGCAAC GGCTACGACG GGTTCCTGGC CGCGAAGGCC TCGGCCAGGG CCCGGTGGAT CCGCGAGTAC GAGGAGTGGA GGGCCGACCT GGCCCGGCAG CGCAGGCTGG CCGAGAACAG CATCGCGGCC CTGAAGGCCA TCCCGCGCAA GATGGAGAAG GCCGCGTTCG GCCACGGGGC TTTCAAGATG CGGGGGGCCG CGCACGGTTC CATGGGCCGC GTCCGCAACG CCAAGGAGAG GGTGGAGAGG CTGACCGCCG ACCCGGTGGC GCCGCCGCCG GTACCGCTGC GGATGACCGC CGGGTTCACC GGATCCCGCA CCGGTGAGAG CGCTCCGGCC GCCTCGCTGG AGGGGGTCCG GGTGGACCAC CGCCTGGAGG TGCCCTGCCT GGACCTCGCT CCGGGCGAAC GCGTGCTGGT CACCGGGCCG AACGGGGCGG GCAAGACCAC CCTGCTCCGG CTCCTGTCCG GGGACGCGGT CCCCGACGAG GGGCGGGTTT CGGTCCCCGG CCGGGTGGGT TACCTGCGTC AGGAGGACGG TTCCTTCGCC GCCGGACAGA CCGTGCTCGG CGCCTACGCC GCGGGGCGGC CCGGGTTCGC CGAGGACCAC CGGGACGAGC TCGCCTCGCT CGGGCTGTTC CGGCCCGGGG AGCTGGACCA GCCGGTGGCG TCCCTGTCGG TGGGCCAGCG CCGCAGGATC GAGGTCGCCC GGCTGACGTC GGGTGCCTAC GACCTGCTGC TCCTGGACGA GCCGACCAAC CACCTCTCCC CCGGTCTCGT GGAGGACCTG GAGGAGGCGC TGACACGTTA CGAGGGCACG CTGGTGGTCG TCACCCACGA CCGCAGGACC CGACAGCGCT TCACCGGCCG CCACCTGGAA CTACGGGAGG GGATCGTGGT CAGCGACTCC GTGTGA
|
Protein sequence | MLTARVRAQS QLTLNDITHR YGDHTVLDRV TLTISPGERV GIIGDNGSGK STLLRLLAGE EEPTNGDLTV RAPGGTGYLP QTLEAVGVST GTVGDAIDSA LAVLRDMEAR LRAAESTLGS ATPEELDAYG DLLARFEARG GYDADARVDA GLHGLGLPGL DRDRPLRTLS GGERSRLALA AMLAASPELL LLDEPTNDLD DRAVAWLEDR LRRHRGTVVA VTHDRAFLAN VTDTVLEVDH DLRRIHRYGN GYDGFLAAKA SARARWIREY EEWRADLARQ RRLAENSIAA LKAIPRKMEK AAFGHGAFKM RGAAHGSMGR VRNAKERVER LTADPVAPPP VPLRMTAGFT GSRTGESAPA ASLEGVRVDH RLEVPCLDLA PGERVLVTGP NGAGKTTLLR LLSGDAVPDE GRVSVPGRVG YLRQEDGSFA AGQTVLGAYA AGRPGFAEDH RDELASLGLF RPGELDQPVA SLSVGQRRRI EVARLTSGAY DLLLLDEPTN HLSPGLVEDL EEALTRYEGT LVVVTHDRRT RQRFTGRHLE LREGIVVSDS V
|
| |