Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3351 |
Symbol | |
ID | 9247215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4004083 |
End bp | 4005933 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003681263 |
Protein GI | 297562289 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.46018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATC CATCGAAAGT AGACACCGGC TCGAAACCGG GTGCGCCCGA ACCCCGGTTC GCCCAGCTCC GGGTGCTGTG GTCGTTCGTG CGCCCGCACC GGAACAAGCT GGCGCTGGGC CTGGTGCTGG CCCTGTTCGG CTCGGCGCTC GAACTCGCCA ACCCGATGGT GATCAAGCTG GTCCTGGACA CCGTCTCCGG CGGGGGCGGC CTGCTCGTGC CGATCGCCCT GCTGCTGGGC CTGTTCGTGC TGGGCACGGT GTCCGGCCTG TGGCACTGGA TCCTCCTGGG CACCGTCGCC GAGAAGGTGG TGCTCGACGC GCGCACCTCG CTGGTGCGCC GCTACTTCCG CGCAGCGCTC ATCCCGCTGT CGCGCCGCTC CTCCGGCGAG CTCGTCACCC GGGCGACCTC CGACACGGTC CTGCTGCGCG AGGCCGCCTC CAGCAGCGTC ATCAGCCTCA TCAACGGCGG CGTGCTGCTG GTGGGAACGC TGGTCATGAT GGGCGTGCTG GACCTGTTCC TGCTCACGGT CACCTTCGTC GCGGTGCTCG TGGTCACCGT CCTGTTCCTG ACGCTGATGC CCGCCCTGGC CAAGGCGCAG GAGAGGGCGC AGAACTCCCT GGGCCTGATG GGCGGCATGC TCGACGGCGC GCTGCGCGCG GTCCGCACGG TCAAGGTCAG CCGGGCCGAG GAGCGCCTGA GCGGCCAGAT CCTCGAACAC GCGCGGGAGT CCGCGCGGCA CGGCGTGCGC TCGGTGCGGC GCGAGGCGGT CGCCTGGACG ATCGCGTTCA GCGGGATCCA GCTCGCCATC ATCTCCATCC TGGGCGTGGG CGCGCTGCGG GTGTCCTCGG GCGCGATCGA GGTCTCCACC CTCATCGCCT TCCTGCTCTA CGCGTTCACC CTGATGACCC CGGTCATGGA GCTGTCCCAG AGCGTCACCA CCCTCCAGTC GGGCGTGGCC GCGGCCAAGC GCATCCGCGA GGTGGAGGCC ATTCCGCTCG AACCCTCCTC CGAGGCGGCG GACACGGACG CGCCGGTCCC CTCCCCGGAC GGGGACCGTT CGGGCGCGCT GCTGGAACTG CGCGGGGTCA CCGCGCGGTA CGCGCCCGGC GCCGAGTCCG CGCTGGACGG CGTGGACCTG GCCGTCCCCC GGCGCGGGCA CACCGCGATC GTGGGGCCCT CCGGCGCGGG CAAGACCACC GTGTTCTCGC TGCTGCTGCG CTTCCTCGAA CCCGAGGAGG GGCAGCTGTT CCTGGACGGG ACCCCCTACC GGGAGCTCAC TCCCGGGCAG GTGCGCGGCC GCTTCGCCTA CGTCGAGCAG GACACCCCGG TCGTCCCCGG CACCATCCGG GAGAACCTGC TGTTCAGCCA CCCCGACGCC ACCGAGGAGG AGGTGCGCCG GGTCCTGGGC CAGGTGCGGC TGGCCGACAA GATCGACGCC CTGGAGGAGG GGCTGGACAC CCCGCTGGAC GCCACGTCCT TCTCCGGGGG CCAGCGCCAG CGCATCGCCC TGGCCCGCGC CCTGCTGCGC TCGCCGGACG TGCTGCTGCT GGACGAGGCC ACCTCGCAGG TGGACGCGAT CACCGAGGCC GCCATCACCG AGAGCGTGCG CGCCCACGCC GCGCGGGCGG CCGTGGTGAC CATCGCGCAC CGGCTGTCCA CCGTGATCCA CGCCGACACC ATCGTGCTGA TGGAGGACGG ACGGGTGCGG GCCAGGGGCA CGCACCGGGA GCTGATGGAC CGGGACGACC TGTACCGGGA GCTGGTCACG GCACTGCACA TCGCCGAGTC CGGGGCTCCG GACCCGGGCG GTGACCGGGC CGAGGCGGAC CGGGTGACGC CGGTCACGTG A
|
Protein sequence | MTDPSKVDTG SKPGAPEPRF AQLRVLWSFV RPHRNKLALG LVLALFGSAL ELANPMVIKL VLDTVSGGGG LLVPIALLLG LFVLGTVSGL WHWILLGTVA EKVVLDARTS LVRRYFRAAL IPLSRRSSGE LVTRATSDTV LLREAASSSV ISLINGGVLL VGTLVMMGVL DLFLLTVTFV AVLVVTVLFL TLMPALAKAQ ERAQNSLGLM GGMLDGALRA VRTVKVSRAE ERLSGQILEH ARESARHGVR SVRREAVAWT IAFSGIQLAI ISILGVGALR VSSGAIEVST LIAFLLYAFT LMTPVMELSQ SVTTLQSGVA AAKRIREVEA IPLEPSSEAA DTDAPVPSPD GDRSGALLEL RGVTARYAPG AESALDGVDL AVPRRGHTAI VGPSGAGKTT VFSLLLRFLE PEEGQLFLDG TPYRELTPGQ VRGRFAYVEQ DTPVVPGTIR ENLLFSHPDA TEEEVRRVLG QVRLADKIDA LEEGLDTPLD ATSFSGGQRQ RIALARALLR SPDVLLLDEA TSQVDAITEA AITESVRAHA ARAAVVTIAH RLSTVIHADT IVLMEDGRVR ARGTHRELMD RDDLYRELVT ALHIAESGAP DPGGDRAEAD RVTPVT
|
| |