Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1036 |
Symbol | |
ID | 9244882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1276322 |
End bp | 1278061 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003678985 |
Protein GI | 297560011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0188433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0545959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCGCG CCTTCCTCAA GGCCCTGGGA CCCGAGCAGG CCGGGCCCAT GCGCGCGAGC CTCGCGCTCA CCACCGTCGT CTCGGCCCTC CAGGGCGTGC TGTTCGCCCT CCTGGTGCCC GTGCTGTCCC ACCTCCTGGG CCCCGACCCG GACCGGGCCT GGCCCTGGGC GGCCGTCCTG CTCGCCGCCA CCGCCGTCTA CGCTGTCCTG CGCGCGGGCA GCCTGTACCT CAACTTCCGC GTCGGCGGCG CGCTCTCGCG GGCCCTGCAC CACCGGCTCG GCGACCACGT CGTCCGCCTG CCCCTGGGAT GGTTCACCGG AGGCCGGGTC GGGGAGCTCA ACCGCCTCGC CACCGACGGC GTCTCCCGGG CCACCAGCCT GCCCGTGCAC CTGTACCCGC CCCTGGCCGA CGCGGTGGTC ACCCCCCTCG TCTCGGTCCT GGCCCTGTTC GTGTGGGACT GGCGGATCGC GCTCGCGGCC GCCGCCTGCC TCCCGCTGCT GTGGATGGTC TTCACCCTGT CCGGGGAGGC GGTCGGGCGC AACGACGCCG CACGCGACGC CGTCACCGAC GAGGCCGCCG ACCGGGTCCT GGAGTACGCC CGCGCCCAGC CCGTGCTGCG CGCCTTCGGC AGGACGGACA AGGGCGGCCA GCGCCTCGAC GCGGCGCTGG AGGCCGAGCA CGGCGCCGCC CGCCGCCTGC TGCTGCGCGC GGTCCCCGCC CTGCTCGGCT ACTCCTTCGC GGTGCGCCTG GCCTTCGGCC TGCTCCTGGT CGCCACCGTG TTCCTCGCGC TCGGCGGCAC CCTCGACGCT CCCCTGGCGG TCGCGCTGCT CGTGCTCGTG GCCAGGTTCG TCCACCCCCT GTCCGGCGCG GCCGACCAGG GGGCCGCCCT GAGGATGGCG ACGAACGGGC TCGACCGGAT CAACGCCGTC CTGGAGGCGC GCCCCCTCCC CGAACCGGAC ACGCCGGTGC CGCCCCGGGG CGCGGACGTG GAGTTCGACG ACGTGTCCTT CTCCTACGCC CCGGACGGGC CCCGCGTCCT GGACGGGGTC TCCTTCCGGG CCGAACCCGG GACGCTCACC GCCCTGGTCG GGCCCTCGGG TTCGGGCAAG ACCACGGTGG CCCGGCTGCT GGCGCGCTTC CACGACGTCG ACGCGGGCAG CGTGCGCCTC GGCGGGGTGG ACGTGCGCTC GGTCGGCAGC GAGGAGCTGT CCCGGCACGT GGCGATGGTC TTCCAGGACG TGTACCTGTT CGACGCGAGC ATCGCCGAGA ACGTGCTCCT GGCCGACCCC GCGGCCACGC GGGAGGACCT GGACCGGGTG GCCGCCGCCT CCGGCCTCGA CGCGGTGGTG GCCGAGTTGC CGGACGGCTG GGACACCCGC GTCGGCGAGG GCGGCGCCTC GCTCTCGGGC GGGCAGAAGC AGCGGGTCTC CATCGCCCGC GCCCTGCTCA AGGACGCGCC GGTCGTCGTC CTGGACGAGG CCTCGGCCGC CCTGGACGCG GAGAACGAGG CGCTGCTGAC CGCGACCGCC GTTTCCCTGG CGCGGGAGCG CACGGTGCTC GTCATCGCCC ACCGGCCCGC CACCGTGGCC GCCGCGGACC GGGTGGTCTT CCTCGACGCC GGACGCGTCG CGGAGGCCGG AACCCCGGCC GAACTCCTCG CCGCGGGAGG GCGCCACGCC GAGTTCGCCC GTGCGCGCGA GCGGGCCCGG GGCTGGAGGC TGACCGCCGA ACCCTCCTGA
|
Protein sequence | MIRAFLKALG PEQAGPMRAS LALTTVVSAL QGVLFALLVP VLSHLLGPDP DRAWPWAAVL LAATAVYAVL RAGSLYLNFR VGGALSRALH HRLGDHVVRL PLGWFTGGRV GELNRLATDG VSRATSLPVH LYPPLADAVV TPLVSVLALF VWDWRIALAA AACLPLLWMV FTLSGEAVGR NDAARDAVTD EAADRVLEYA RAQPVLRAFG RTDKGGQRLD AALEAEHGAA RRLLLRAVPA LLGYSFAVRL AFGLLLVATV FLALGGTLDA PLAVALLVLV ARFVHPLSGA ADQGAALRMA TNGLDRINAV LEARPLPEPD TPVPPRGADV EFDDVSFSYA PDGPRVLDGV SFRAEPGTLT ALVGPSGSGK TTVARLLARF HDVDAGSVRL GGVDVRSVGS EELSRHVAMV FQDVYLFDAS IAENVLLADP AATREDLDRV AAASGLDAVV AELPDGWDTR VGEGGASLSG GQKQRVSIAR ALLKDAPVVV LDEASAALDA ENEALLTATA VSLARERTVL VIAHRPATVA AADRVVFLDA GRVAEAGTPA ELLAAGGRHA EFARARERAR GWRLTAEPS
|
| |