Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3996 |
Symbol | |
ID | 9247868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4778532 |
End bp | 4780310 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003681899 |
Protein GI | 297562925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.181167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGG TCAATCTTCA GGACGTGTCC CTGGCCTACG GGCCGCTCGT ACTCCTCGAC AAGGTGTCGC TCGGCGTCGA CGAGGGTGAG CGCATCGGCG TCGTCGGCCG CAACGGCGGC GGCAAGTCCA CGCTCATCTC CGTGCTCTCG GGCATCACCC GGCCCGACTC CGGGCGGGTG GTGCACAACC GCGGGCTGCG CATCGGCTTC CTGCACCAGC GCGACACCTT CCCCGACTCC ACAGTCGGGG AGTTCGTCCT GGGCGACCGG GCCGAGCACG AGTGGGCGGG CGACGCCCGC GTCCGCGACA TCCTGCGCGG ACTCCTCGGC GGCTGGGCCC TCGACACCCC CATGAGCGGC CTGTCCGGCG GCGAGCGCCG CCGCTCCGGC CTGGCCCGGC TCCTGGTGGG CGACCACGAC CTCATCGTGC TCGACGAGCC CACCAACCAC CTCGACATCG AGGGCATCGC CTGGCTCGCC GAGCACCTGC GCGCCCGTCC CGAGGCGCTC GTGGTCGTCA CCCACGACCG CTGGTTCCTG GACGCCGTCA CCACCCGCAC CTGGGAGGTC GGACGCGGCG CGGTCGAGCG CTACGAGGGC GGTTACGCCG CCTACGTCCT GGCCAAGGCC GAGCGCGAGC GCCAGGCGGC CGCCGCCGAG GAGCGCCGCC AGAACCTCAT GCGCAAGGAG CTGGCCTGGC TGCGCCGCGG CGCCCCGGCC CGCACCTCCA AGCCCAAGTT CCGCATCGAG GCGGCCAACC AGCTCATCGC CGACGAGCCG CCGCCCCGCG ACACCGTCGA ACTGGTCAAG TTCGCCAGCT CCCGGCTGGG CAAGACCGTC ATCGACCTCA AGAACGTGTC CGCGTCCGTC CCCGACCGGA CCCTGCTCGA CCACCTCACC TGGCAGCTGG GCCCCGGCGA CCGGGTCGGC CTGGTCGGGG TCAACGGCGC GGGCAAGTCC ACCCTGCTCA GGATCCTGGC CGGGGAGCGC GAGCCCGACT CCGGCAGCGT GCGCACCGGC CGGACCGTCA AGCTCGCCCA CCTGTCCCAG AACGTCGCCG AACTCGACCC CGCCGCCCGC CCCCTCCAGG CGGTCATGGA CGTGCGCGAG CACGTCACCA TCGGCAAGCG CGACTACACC GCCAGCCAGA TGCTGGAGCG GTTCGGGTTC CGCGGGGAGC GCCAGTGGAC CCCGATCGGC GACCTGTCCG GCGGTGAGCG ACGCCGCCTC CAGCTGCTGC GGCTGCTCAT GGACGAGCCC AACGTGCTGC TGCTGGACGA GCCCACCAAC GACCTGGACA TCGAGACGCT CACCGAGCTG GAGGACCTGC TCGACGGCTG GCCCGGCTCC CTGGTGCTGG TCAGCCACGA CCGCTACTTC CTGGAGCGCA TCACCGACCG CGTGCTGGCT CTGATGGGCG ACGGCGGCCT GGCCTTCCTG CCCGGCGGCG TGGACGAGTA CCTCCAGCGC CGCGCGGCCA GCGCGGGCGA GGCCACCGTG CCGCTGGGCG CCTCCGAGGC CGCCGCGGCC CAGGCCCCGG CGGAGCAGCC CGCGGTGTCG GCGGCCGAGC GGCGCGCCGC GCAGAAGGAG ATGCAGCGCG TCGAGCGGCG CATGGACCGC ATCGCCAAGC GGGAGGCGGA GCTGCACGAG CTGATGGCCG CCGCGGCCGA GGACTACACC CGGCTCGCCG AACTCGACGC AGAGGCCAAG GCCCTGGCCG TCGAGCGCGG GGAGCTGGAG GAGGTCTGGC TGGAGCAGGC CGAACTCGTC GGGGAGTGA
|
Protein sequence | MNLVNLQDVS LAYGPLVLLD KVSLGVDEGE RIGVVGRNGG GKSTLISVLS GITRPDSGRV VHNRGLRIGF LHQRDTFPDS TVGEFVLGDR AEHEWAGDAR VRDILRGLLG GWALDTPMSG LSGGERRRSG LARLLVGDHD LIVLDEPTNH LDIEGIAWLA EHLRARPEAL VVVTHDRWFL DAVTTRTWEV GRGAVERYEG GYAAYVLAKA ERERQAAAAE ERRQNLMRKE LAWLRRGAPA RTSKPKFRIE AANQLIADEP PPRDTVELVK FASSRLGKTV IDLKNVSASV PDRTLLDHLT WQLGPGDRVG LVGVNGAGKS TLLRILAGER EPDSGSVRTG RTVKLAHLSQ NVAELDPAAR PLQAVMDVRE HVTIGKRDYT ASQMLERFGF RGERQWTPIG DLSGGERRRL QLLRLLMDEP NVLLLDEPTN DLDIETLTEL EDLLDGWPGS LVLVSHDRYF LERITDRVLA LMGDGGLAFL PGGVDEYLQR RAASAGEATV PLGASEAAAA QAPAEQPAVS AAERRAAQKE MQRVERRMDR IAKREAELHE LMAAAAEDYT RLAELDAEAK ALAVERGELE EVWLEQAELV GE
|
| |